InfoQ Homepage Deep Learning Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Allen Institute for AI Open-Sources AI Model Inspection Tool LM-Debugger

The Allen Institute for AI (AI2) open-sourced LM-Debugger, an interactive tool for interpreting and controlling the output of language model (LM) predictions. LM-Debugger supports any HuggingFace GPT-2 model and allows users to intervene in the text generation process by dynamically modifying updates in the hidden layers of the model's neural network.

Anthony Alford
on May 31, 2022
AI, ML & Data Engineering

New GraphWorld Tool Accelerates Graph Neural-Network Benchmarking

Google AI has recently released GraphWorld, a tool to accelerate performance benchmarking in the area of graph neural networks (GNNs). GraphWorld is a configurable framework to generate graphs with a variety of structural properties like different node degree distributions and Gini index.

Reza Rahimi
on May 27, 2022
AI, ML & Data Engineering

TensorFlow DTensor: Unified API for Distributed Deep Network Training

Recently released TensorFlow v2.9 introduces a new API for the model, data, and space-parallel (aka spatially tiled) deep network training. DTensor aims to decouple sharding directives from the model code by providing higher-level utilities to partition the model and batch parameters between devices.

Sabri Bolkar
on May 27, 2022
AI, ML & Data Engineering

Amazon Releases 51-Language AI Training Dataset MASSIVE

Amazon Alexa AI's Natural Language Understanding group released Multilingual Amazon SLURP (SLU resource package) for Slot Filling, Intent Classification, and Virtual-Assistant Evaluation (MASSIVE), a dataset for training natural language understanding (NLU) AI models that contains one million annotated samples from 51 languages. The release also includes code and tools for using the data.

Anthony Alford
on May 24, 2022
AI, ML & Data Engineering

LAION Releases Five Billion Image-Text Pair Dataset LAION-5B

The Large-scale Artificial Intelligence Open Network (LAION) released LAION-5B, an AI training dataset containing over five billion image-text pairs. LAION-5B contains images and captions scraped from the internet and is 14x larger than its predecessor LAION-400M, making it the largest freely available image-text dataset.

Anthony Alford
on May 17, 2022
AI, ML & Data Engineering

DeepMind Trains AI Controller for Nuclear Fusion Research Device

Researchers at Google subsidiary DeepMind and the Swiss Plasma Center at EPFL have developed a deep reinforcement learning (RL) AI that creates control algorithms for tokamak devices used in nuclear fusion research. The system learned control policies while interacting with a simulator, and when used to control a real device was able to achieve novel plasma configurations.

Anthony Alford
on May 10, 2022
AI, ML & Data Engineering

Serving Deep Networks in Production: Balancing Productivity vs Efficiency Tradeoff

A recently published work provides an alternative modality for serving deep neural networks. It enables utilizing eager-mode model code directly at production workloads by using embedded CPython interpreters. The goal is to reduce the engineering effort to bring the models from the research stage to the end-user and to create a proof-of-concept platform for migrating future numerical libraries.

Sabri Bolkar
on May 05, 2022
AI, ML & Data Engineering

NVIDIA Announces Next Generation AI Hardware H100 GPU and Grace CPU Superchip

At the recent GTC conference, NVIDIA announced their next generation processors for AI computing, the H100 GPU and the Grace CPU Superchip. Based on NVIDIA's Hopper architecture, the H100 includes a Transformer engine for faster training of AI models. The Grace CPU Superchip features 144 Arm cores and outperforms NVIDIA's current dual-CPU offering on the SPECrate 2017_int_base benchmark.

Anthony Alford
on May 03, 2022
AI, ML & Data Engineering

Google Trains 540 Billion Parameter AI Language Model PaLM

Google Research recently announced the Pathways Language Model (PaLM), a 540-billion-parameter AI natural language processing (NLP) model that surpasses average human performance on the BIG-bench benchmark. PaLM outperforms other state-of-the-art systems on many evaluation tasks, and shows strong results on tasks such as logical inference and joke explanation.

Anthony Alford
on Apr 26, 2022
AI, ML & Data Engineering

Google Announces AI-Generated Summaries for Google Docs

Google has announced a new feature for their Docs app that will automatically generate a summary of the document content. The summarization is powered by a natural language processing (NLP) AI model based on the Transformer architecture.

Anthony Alford
on Apr 19, 2022
AI, ML & Data Engineering

Ten Lessons from Three Generations of Tensor Processing Units

A recent report published by Google’s TPU group highlights ten takeaways from developing three generations of tensor processing units. The authors also discuss how their previous experience will affect the development of future tensor processing units.

Sabri Bolkar
on Apr 19, 2022
AI, ML & Data Engineering

Stanford University Publishes AI Index 2022 Annual Report

Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) has published its 2022 AI Index annual report. The report identifies top trends in AI, including advances in technical achievements, a sharp increase in private investment, and increasing attention on ethical issues.

Anthony Alford
on Apr 12, 2022
AI, ML & Data Engineering

EleutherAI Open-Sources 20 Billion Parameter AI Language Model GPT-NeoX-20B

Researchers from EleutherAI have open-sourced GPT-NeoX-20B, a 20-billion parameter natural language processing (NLP) AI model similar to GPT-3. The model was trained on 825GB of publicly available text data and has performance comparable to similarly-sized GPT-3 models.

Anthony Alford
on Apr 05, 2022
AI, ML & Data Engineering

Meta Announces Conversational AI Model Project CAIRaoke

Meta AI Research recently announced Project CAIRaoke, an end-to-end deep-learning model for digital assistants. Project CAIRaoke is currently being used in Meta's Portal device and outperforms a previous conversational model when evaluated on a reminder task.

Anthony Alford
on Mar 29, 2022
AI, ML & Data Engineering

University of Washington Open-Sources AI Fine-Tuning Algorithm WISE-FT

A team of researchers from University of Washington (UW), Google Brain, and Columbia University have open-sourced weight-space ensembles for fine-tuning (WiSE-FT), an algorithm for fine-tuning AI models that improves robustness under distribution shift. Experiments on several computer vision (CV) benchmarks show that WISE-FT improves accuracy up to 6 percentage points.

Anthony Alford
on Mar 22, 2022

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News