InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage Deep Learning Content on InfoQ

News

RSS Feed

Newer Older

Cloud

Azure AI Foundry Labs: a hub for the Latest AI Research and Experiments at Microsoft

Microsoft's Azure AI Foundry Labs revolutionizes AI development by bridging cutting-edge research with real-world applications. Offering experimental projects like Aurora and MatterSim empowers developers to prototype new technologies. With tools for dynamic learning and multimodal models, Azure Labs accelerates innovation and collaboration.

Steef-Jan Wiggers
on Mar 04, 2025
AI, ML & Data Engineering

Microsoft Releases BioEmu-1: a Deep Learning Model for Protein Structure Prediction

Microsoft Research has introduced BioEmu-1, a deep-learning model designed to predict the range of structural conformations that proteins can adopt. Unlike traditional methods that provide a single static structure, BioEmu-1 generates structural ensembles, offering a broader view of protein dynamics.

Robert Krzaczyński
on Feb 28, 2025
AI, ML & Data Engineering

AMD and Johns Hopkins Researchers Develop AI Agent Framework to Automate Scientific Research Process

Researchers from AMD and Johns Hopkins University have developed Agent Laboratory, an artificial intelligence framework that automates core aspects of the scientific research process. The system uses large language models to handle literature reviews, experimentation, and report writing, producing both code repositories and research documentation.

Vinod Goje
on Jan 31, 2025
AI, ML & Data Engineering

DeepSeek Open-Sources DeepSeek-V3, a 671B Parameter Mixture of Experts LLM

DeepSeek open-sourced DeepSeek-V3, a Mixture-of-Experts (MoE) LLM containing 671B parameters. It was pre-trained on 14.8T tokens using 2.788M GPU hours and outperforms other open-source models on a range of LLM benchmarks, including MMLU, MMLU-Pro, and GPQA.

Anthony Alford
on Jan 21, 2025
AI, ML & Data Engineering

QCon SF: Large Scale Search and Ranking Systems at Netflix

Moumita Bhattacharya spoke at QCon SF 2024 about state-of-the-art search and ranking systems. She gave an overview of the typical structure of these systems and followed with a deep dive into how Netflix created a single combined model to handle both tasks.

Anthony Alford
on Nov 19, 2024
AI, ML & Data Engineering

Meta AI Introduces Thought Preference Optimization Enabling AI Models to Think before Responding

Researchers from Meta FAIR, the University of California, Berkeley, and New York University have introduced Thought Preference Optimization (TPO), a new method aimed at improving the response quality of instruction-fine tuned LLMs.

Daniel Dominguez
on Nov 04, 2024
AI, ML & Data Engineering

PyTorch 2.5 Release Includes Support for Intel GPUs

The PyTorch Foundation recently released PyTorch version 2.5, which contains support for Intel GPUs. The release also includes several performance enhancements, such as the FlexAttention API, TorchInductor CPU backend optimizations, and a regional compilation feature which reduces compilation time. Overall, the release contains 4095 commits since PyTorch 2.4.

Anthony Alford
on Oct 29, 2024
AI, ML & Data Engineering

Meta Releases Llama 3.2 with Vision, Voice, and Open Customizable Models

Meta recently announced Llama 3.2, the latest version of Meta's open-source language model, which includes vision, voice, and open customizable models. This is the first multimodal version of the model, which will allow users to interact with visual data in ways like identifying objects in photos or editing images with natural language commands among other use cases.

Andrew Hoblitzell
on Oct 07, 2024
AI, ML & Data Engineering

Google Develops Voice Transfer AI for Restoring Voices

A team at Google Research developed a zero-shot voice transfer (VT) model that can be used to customize a text-to-speech (TTS) with a specific person's voice. This allows speakers who have lost their voice, for example from Parkinson's disease or ALS, to use a TTS device to replicate their original voice. The model also works across languages.

Anthony Alford
on Oct 01, 2024
AI, ML & Data Engineering

Google Announces Game Simulation AI GameNGen

A research team from Google recently published a paper on GameNGen, a generative AI model that can simulate the video game Doom. GameNGen can simulate the game at 20 frames-per-second (FPS) and in human evaluations was preferred only slightly less often than the actual game.

Anthony Alford
on Sep 10, 2024
AI, ML & Data Engineering

University Researchers Create New Type of Interpretable Neural Network

Researchers from Massachusetts Institute of Technology, California Institute of Technology, and Northeastern University created a new type of neural network: Kolmogorov–Arnold Networks (KAN). KAN models outperform larger perceptron-based models on physics modeling tasks and provide a more interpretable visualization.

Anthony Alford
on Aug 20, 2024
AI, ML & Data Engineering

University of Pennsylvania Researchers Develop Processorless Learning Circuitry

Researchers from the University of Pennsylvania have designed an electrical circuit, similar to a neural network, that can learn tasks such as nonlinear regression. The circuit operates at low power levels and can be trained without a computer.

Anthony Alford
on Aug 13, 2024
AI, ML & Data Engineering

Google's JEST Algorithm Automates AI Training Dataset Curation and Reduces Training Compute

Google DeepMind recently published a new algorithm for curating AI training datasets: multimodal contrastive learning with joint example selection (JEST), which uses a pre-trained model to score the learnability of batches of data. Google's experiments show that image-text models trained with JEST-curated data require 10x less computation than baseline methods.

Anthony Alford
on Jul 30, 2024
AI, ML & Data Engineering

Google Open Sources 27B Parameter Gemma 2 Language Model

Google DeepMind recently open-sourced Gemma 2, the next generation of their family of small language models. Google made several improvements to the Gemma architecture and used knowledge distillation to give the models state-of-the-art performance: Gemma 2 outperforms other models of comparable size and is competitive with models 2x larger.

Anthony Alford
on Jul 16, 2024
AI, ML & Data Engineering

OpenAI's CriticGPT Catches Errors in Code Generated by ChatGPT

OpenAI recently published a paper about CriticGPT, a version of GPT-4 fine-tuned to critique code generated by ChatGPT. When compared with human evaluators, CriticGPT catches more bugs and produces better critiques. OpenAI plans to use CriticGPT to improve future versions of their models.

Anthony Alford
on Jul 09, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News