InfoQ Homepage Deep Learning Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

DeepMind Releases Weather Forecasting AI Deep Generative Models of Rainfall

DeepMind open-sourced a dataset and trained model snapshot for Deep Generative Models of Rainfall (DGMR), an AI system for short-term precipitation forecasts. In evaluations conducted by 58 expert meteorologists comparing it to other existing methods, DGMR was ranked first in accuracy and usefulness in 89% of test cases.

Anthony Alford
on Dec 21, 2021
AI, ML & Data Engineering

MIT Researchers Investigate Deep Learning's Computational Burden

A team of researchers from MIT, Yonsei University, and University of Brasilia have launched a new website, Computer Progress, which analyzes the computational burden from over 1,000 deep learning research papers. Data from the site show that computational burden is growing faster than the expected rate, suggesting that algorithms still have room for improvement.

Anthony Alford
on Dec 14, 2021
AI, ML & Data Engineering

AMD Introduces Its Deep-Learning Accelerator Instinct MI200 Series GPUs

In its recent Accelerated Data Center Premiere Keynote, AMD unveiled its MI200 accelerator series Instinct MI250x and slightly lower-end Instinct MI250 GPUs. Designed with CDNA-2 architecture and TSMC’s 6nm FinFET lithography, the high-end MI250X provides 47.9 TFLOPs peak double precision performance and memory that will allow training larger deep networks by minimizing model sharding.

Sabri Bolkar
on Dec 03, 2021
AI, ML & Data Engineering

Facebook Open-Sources GHN-2 AI for Fast Initialization of Deep-Learning Models

A team from Facebook AI Research (FAIR) and the University of Guelph have open-sourced an improved Graph HyperNetworks (GHN-2) meta-model that predicts initial parameters for deep-learning neural networks. GHN-2 executes in less than a second on a CPU and predicts values for computer vision (CV) networks that achieve up to 77% top-1 accuracy on CIFAR-10 with no additional training.

Anthony Alford
on Nov 30, 2021
AI, ML & Data Engineering

PyTorch 1.10 Release Includes CUDA Graphs APIs, Compiler Improvements, and Android NNAPI Support

PyTorch, Facebook's open-source deep-learning framework, announced the release of version 1.10 which includes an integration with CUDA Graphs APIs and JIT compiler updates to increase CPU performance, as well as beta support for the Android Neural Networks API (NNAPI). New versions of domain-specific libraries TorchVision and TorchAudio were also released.

Anthony Alford
on Nov 26, 2021
AI, ML & Data Engineering

QCon Plus ML Panel Discussion: ML in Production - What's Next?

The recent QCon Plus online conference featured a panel discussion titled "ML in Production - What's Next?" Some key takeaways were that many ML projects fail in production because of poor engineering infrastructure and a lack of intra-disciplinary communication, and that both model explainability and ML for edge computing are important technologies that are still not mature.

Anthony Alford
on Nov 23, 2021
AI, ML & Data Engineering

Roland Meertens on the Unreasonable Effectiveness of Zero Shot Learning

At the recent QCon Plus online conference, Roland Meertens gave a talk on developing AI-based applications titled "The Unreasonable Effectiveness of Zero Shot Learning." He demonstrated two examples of using foundation models and zero shot learning to rapidly deploy prototype applications and gain feedback without needing to gather large datasets and train models.

Anthony Alford
on Nov 18, 2021
AI, ML & Data Engineering

Francesca Lazzeri on What You Should Know before Deploying ML in Production

At the recent QCon Plus online conference, Dr. Francesca Lazzeri gave a talk on machine learning operations (MLOps) titled "What You Should Know before Deploying ML in Production." She covered four key topics, including MLOps capabilities, open source integrations, machine-learning pipelines, and the MLFlow platform.

Anthony Alford
on Nov 16, 2021
AI, ML & Data Engineering

BigScience Research Workshop Releases AI Language Model T0

BigScience Research Workshop released T0, a series of natural language processing (NLP) AI models specifically trained for researching zero-shot multitask learning. T0 can often outperform models 6x larger on the BIG-bench benchmark, and can outperform the 16x larger GPT-3 on several other NLP benchmarks.

Anthony Alford
on Nov 09, 2021
Cloud

Amazon Releases DL1 Instances Powered by Gaudi Accelerators

Amazon recently announced the general availability of the EC2 DL1 instances powered by Gaudi accelerators from Habana Labs. The new instances promise better price performances in training deep learning models for use cases such as computer vision, natural language processing, autonomous vehicle perception and recommendation engines.

Renato Losio
on Nov 07, 2021
AI, ML & Data Engineering

Baidu Announces 11 Billion Parameter Chatbot AI PLATO-XL

Baidu recently announced PLATO-XL, an AI model for dialog generation, which was trained on over a billion samples collected from social media conversations in both English and Chinese. PLATO-XL achieves state-of-the-art performance on several conversational benchmarks, outperforming currently available commercial chatbots.

Anthony Alford
on Nov 02, 2021
AI, ML & Data Engineering

IBM Develops Hardware-Based Vector-Symbolic AI Architecture

IBM Research recently announced a memory-augmented neural network (MANN) AI system consisting of a neural network controller and phase-change memory (PCM) hardware. By performing analog in-memory computation on high-dimensional (HD) binary vectors, the system learns few-shot classification tasks on the Omniglot benchmark with only 2.7% accuracy drop compared to 32-bit software implementations.

Anthony Alford
on Oct 26, 2021
AI, ML & Data Engineering

Google's Gated Multi-Layer Perceptron Outperforms Transformers Using Fewer Parameters

Researchers at Google Brain have announced Gated Multi-Layer Perceptron (gMLP), a deep-learning model that contains only basic multi-layer perceptrons. Using fewer parameters, gMLP outperforms Transformer models on natural-language processing (NLP) tasks and achieves comparable accuracy on computer vision (CV) tasks.

Anthony Alford
on Oct 19, 2021
AI, ML & Data Engineering

TensorFlow Similarity Supports Fast Query Search Index on Pre-trained Models

Francois Chollet and his team recently released a Python library for TensorFlow, called TensorFlow Similarity. Similarity learning is the process of finding similar items, from similar clothes in images to person identification using face pictures. Deep-learning models have used a method called contrastive learning to increase accuracy and efficiency in learning similarity between images.

Bruno Santos
on Oct 19, 2021
Development

Google's Dev Library is a Curated Collection of Projects about Google Tech

Google has launched a new initiative aimed at creating a curated collection of open source projects related to Google technologies. Google's Dev Library will not only contain code repositories, but also articles, tools, and tutorials collected from various Internet sources.

Sergio De Simone
on Oct 15, 2021

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News