InfoQ Homepage Deep Learning Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

PrefixRL: Nvidia's Deep-Reinforcement-Learning Approach to Design Better Circuits

Nvidia has developed PrefixRL, an approach based on reinforcement learning (RL) to designing parallel-prefix circuits that are smaller and faster than those designed by state-of-the-art electronic-design-automation (EDA) tools.

Claudio Masolo
on Aug 04, 2022
AI, ML & Data Engineering

Meta Open-Sources 200 Language Translation AI NLLB-200

Meta AI recently open-sourced NLLB-200, an AI model that can translate between any of over 200 languages. NLB-200 is a 54.5B parameter Mixture of Experts (MoE) model that was trained on a dataset containing more than 18 billion sentence pairs. On benchmark evaluations, NLLB-200 outperforms other state-of-the-art models by up to 44%.

Anthony Alford
on Aug 02, 2022
AI, ML & Data Engineering

Google AI Open-Sourced a New ML Tool for Conceptual and Subjective Queries over Images

Google AI open-sourced mood board search, a new ML-powered tool for subjective or conceptual queries over images. Mood board search helps users to define conceptual and subjective queries like peaceful, beautiful, over images.

Reza Rahimi
on Jul 29, 2022
AI, ML & Data Engineering

BigScience Releases 176B Parameter AI Language Model BLOOM

The BigScience research workshop released BigScience Large Open-science Open-access Multilingual Language Model (BLOOM), an autoregressive language model based on the GPT-3 architecture. BLOOM is trained on data from 46 natural languages and 13 programming languages and is the largest publicly available open multilingual model.

Anthony Alford
on Jul 26, 2022
AI, ML & Data Engineering

Google's Image-Text AI LIMoE Outperforms CLIP on ImageNet Benchmark

Researchers at Google Brain recently trained Language-Image Mixture of Experts (LIMoE), a 5.6B parameter image-text AI model. In zero-shot learning experiments on ImageNet, LIMoE outperforms CLIP and performs comparably to state-of-the-art models while using fewer compute resources.

Anthony Alford
on Jul 19, 2022
AI, ML & Data Engineering

PyTorch 1.12 Release Includes Accelerated Training on Macs and New Library TorchArrow

The PyTorch open-source deep-learning framework announced the release of version 1.12 which includes support for GPU-accelerated training on Apple silicon Macs and a new data preprocessing library, TorchArrow, as well as updates to other libraries and APIs.

Anthony Alford
on Jul 15, 2022
AI, ML & Data Engineering

Google AI Developed a Language Model to Solve Quantitative Reasoning Problems

Google AI developed a deep learning language model called Minerva which could solve mathematical quantitative problems. Google AI researchers achieved a state-of-the-art deep learning model by training on a large dataset that contains quantitative reasoning with symbolic expressions. The final model, Minerva, could solve quantitative mathematical problems on STEM reasoning tasks.

Reza Rahimi
on Jul 14, 2022
AI, ML & Data Engineering

OpenAI Releases Minecraft-Playing AI VPT

Researchers from OpenAI have open-sourced Video PreTraining (VPT), a semi-supervised learning technique for training game-playing agents. In a zero-shot setting, VPT performs tasks that agents cannot learn via reinforcement learning (RL) alone, and with fine-tuning is the first AI to craft a diamond pickaxe in Minecraft.

Anthony Alford
on Jul 12, 2022
AI, ML & Data Engineering

Adobe Researchers Open-Source Image Captioning AI CLIP-S

Researchers from Adobe and the University of North Carolina (UNC) have open-sourced CLIP-S, an image-captioning AI model that produces fine-grained descriptions of images. In evaluations with captions generated by other models, human judges preferred those generated by CLIP-S a majority of the time.

Anthony Alford
on Jul 05, 2022
AI, ML & Data Engineering

Stanford University Open-Sources Controllable Generative Language AI Diffusion-LM

Researchers at Stanford University have open-sourced Diffusion-LM, a non-autoregressive generative language model that allows for fine-grained control of the model's output text. When evaluated on controlled text generation tasks, Diffusion-LM outperforms existing methods.

Anthony Alford
on Jun 28, 2022
AI, ML & Data Engineering

DeepMind Trains 80 Billion Parameter AI Vision-Language Model Flamingo

DeepMind recently trained Flamingo, an 80B parameter vision-language model (VLM) AI. Flamingo combines separately pre-trained vision and language models and outperforms all other few-shot learning models on 16 vision-language benchmarks. Flamingo can also chat with users, answering questions about input images and videos.

Anthony Alford
on Jun 21, 2022
AI, ML & Data Engineering

Google's New Imagen AI Outperforms DALL-E on Text-to-Image Generation Benchmarks

Researchers from Google's Brain Team have announced Imagen, a text-to-image AI model that can generate photorealistic images of a scene given a textual description. Imagen outperforms DALL-E 2 on the COCO benchmark, and unlike many similar models, is pre-trained only on text data.

Anthony Alford
on Jun 14, 2022
AI, ML & Data Engineering

Meta Open-Sources 175 Billion Parameter AI Language Model OPT

Meta AI Research released Open Pre-trained Transformer (OPT-175B), a 175B parameter AI language model. The model was trained on a dataset containing 180B tokens and exhibits performance comparable with GPT-3, while only requiring 1/7th GPT-3's training carbon footprint.

Anthony Alford
on Jun 07, 2022
AI, ML & Data Engineering

Allen Institute for AI Open-Sources AI Model Inspection Tool LM-Debugger

The Allen Institute for AI (AI2) open-sourced LM-Debugger, an interactive tool for interpreting and controlling the output of language model (LM) predictions. LM-Debugger supports any HuggingFace GPT-2 model and allows users to intervene in the text generation process by dynamically modifying updates in the hidden layers of the model's neural network.

Anthony Alford
on May 31, 2022
AI, ML & Data Engineering

New GraphWorld Tool Accelerates Graph Neural-Network Benchmarking

Google AI has recently released GraphWorld, a tool to accelerate performance benchmarking in the area of graph neural networks (GNNs). GraphWorld is a configurable framework to generate graphs with a variety of structural properties like different node degree distributions and Gini index.

Reza Rahimi
on May 27, 2022

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News