InfoQ Homepage Google DeepMind Content on InfoQ
-
Google DeepMind Launches TxGemma: Advancing AI-Driven Drug Discovery and Development
Designed to enhance the efficiency of drug discovery and clinical trial predictions. Built on the Gemma model family, TxGemma aims to streamline the drug development process and accelerate the discovery of new treatments.
-
Google DeepMind Unveils Gemini Robotics
Google DeepMind has introduced Gemini Robotics, an advanced AI model designed to enhance robotics by integrating vision, language, and action. This innovation, based on the Gemini 2.0 framework, aims to make robots smarter and more capable, particularly in real-world settings.
-
Google DeepMind Enhances AMIE for Long-Term Disease Management
Google DeepMind has extended the capabilities of its Articulate Medical Intelligence Explorer (AMIE) beyond diagnosis to support longitudinal disease management. The system is now designed to assist clinicians in monitoring disease progression, adjusting treatments, and adhering to clinical guidelines across multiple patient visits.
-
Google DeepMind’s AlphaGeometry2 AI Achieves Gold-Medal Math Olympiad Performance
Google DeepMind's AlphaGeometry2 (AG2) AI model solved 84% of the geometry problems from the last 25 years of International Math Olympiads (IMO), outperforming the average human gold-medalist performance.
-
AMD and Johns Hopkins Researchers Develop AI Agent Framework to Automate Scientific Research Process
Researchers from AMD and Johns Hopkins University have developed Agent Laboratory, an artificial intelligence framework that automates core aspects of the scientific research process. The system uses large language models to handle literature reviews, experimentation, and report writing, producing both code repositories and research documentation.
-
Google Releases Experimental AI Reasoning Model
Google has introduced Gemini 2.0 Flash Thinking Experimental, an AI reasoning model available in its AI Studio platform.
-
Google Releases PaliGemma 2 Vision-Language Model Family
Google DeepMind released PaliGemma 2, a family of vision-language models (VLM). PaliGemma 2 is available in three different sizes and three input image resolutions and achieves state-of-the-art performance on several vision-language benchmarks.
-
Google's JEST Algorithm Automates AI Training Dataset Curation and Reduces Training Compute
Google DeepMind recently published a new algorithm for curating AI training datasets: multimodal contrastive learning with joint example selection (JEST), which uses a pre-trained model to score the learnability of batches of data. Google's experiments show that image-text models trained with JEST-curated data require 10x less computation than baseline methods.
-
Recap of Google I/O 2024: Gemini 1.5, Project Astra, AI-powered Search Engine
Google recently hosted its annual developer conference, Google I/O 2024, where numerous announcements were made regarding Google’s apps and services. As anticipated, AI was a focal point of the event, being incorporated into almost all Google products. Here is a summary of the major announcements from the event.
-
Meta and IBM Lead Formation of AI Alliance to Drive Open-Source Innovation
A new consortium, led by Meta and IBM, has been formed to support open-source AI. The AI Alliance comprises numerous organizations from various sectors, including software, hardware, nonprofit, public, and academic, with the goal of collaboratively developing tools and programs to facilitate open development.
-
Google Announces New DeepMind Model, Lyria, to Generate High-Quality Music
Google has introduced Google DeepMind's Lyria, an AI music generation model able to generate vocals, lyrics, and background tracks mimicking the style of popular artists. The model is experimentally available on YouTube through two distinct AI experiments.
-
Google DeepMind Announces LLM-Based Robot Controller RT-2
Google DeepMind recently announced Robotics Transformer 2 (RT-2), a vision-language-action (VLA) AI model for controlling robots. RT-2 uses a fine-tuned LLM to output motion control commands. It can perform tasks not explicitly included in its training data and improves on baseline models by up to 3x on emergent skill evaluations.
-
Google Announces State-of-the-Art PaLM 2 Language Model Powering Bard
Google DeepMind recently announced PaLM 2, a large language model (LLM) powering Bard and over 25 other product features. PaLM 2 significantly outperforms the previous version of PaLM on a wide range of benchmarks, while being smaller and cheaper to run.
-
DeepMind Open-Sources AI Interpretability Research Tool Tracr
Researchers at DeepMind have open-sourced TRAnsformer Compiler for RASP (Tracr), a compiler that translates programs into neural network models. Tracr is intended for research in mechanistic interpretability of Transformer AI models such as GPT-3.
-
DeepMind Announces Minecraft-Playing AI DreamerV3
Researchers from DeepMind and the University of Toronto announced DreamerV3, a reinforcement-learning (RL) algorithm for training AI models for many different domains. Using a single set of hyperparameters, DreamerV3 outperforms other methods on several benchmarks and can train an AI to collect diamonds in Minecraft without human instruction.