InfoQ Homepage Deep Learning Content on InfoQ
-
PyTorch 1.8 Release Includes Distributed Training Updates and AMD ROCm Support
PyTorch, Facebook's open-source deep-learning framework, announced the release of version 1.8 which includes updated APIs, improvements for distributed training, and support for the ROCm platform for AMD's GPU accelerators. New versions of domain-specific libraries TorchVision, TorchAudio, and TorchText were also released.
-
Stanford Publishes AI Index 2021 Annual Report
Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) has published its AI Index annual report. This underlying data for this year's report has been expanded compared to the previous year's, and the report includes several perspectives on the COVID-19 pandemic's impact on AI research and development.
-
Google Open-Sources AutoML Algorithm Model Search
A team from Google Research has open-sourced Model Search, an automated machine learning (AutoML) platform for designing deep-learning models. Experimental results show that the system produces models that outperform the best human-designed models, with fewer training iterations and model parameters.
-
TensorFlow 3D: Deep Learning for Autonomous Cars’ 3D Perception
Google has released TensorFlow 3D, a library that adds 3D deep-learning capabilities to the TensorFlow machine-learning framework. The new library brings tools and resources that allow researchers to develop and deploy 3D scene understanding models.
-
Facebook Open-Sources AI Model to Predict COVID-19 Patient Outcomes
A team from Facebook AI Research (FAIR) and New York University (NYU) School of Medicine has developed deep-learning models that use chest X-rays to predict COVID-19 patient prognosis. In a comparison study, the models outperformed human radiologists, and could be used to help hospitals predict the demand for supplemental oxygen or intensive care.
-
NLP Library spaCy 3.0 Features Transformer-Based Models and Distributed Training
AI software makers Explosion announced version 3.0 of spaCy, their open-source natural-language processing (NLP) library. The new release includes state-of-the-art Transformer-based pipelines and pre-trained models for 17 languages.
-
Google Open-Sources Trillion-Parameter AI Language Model Switch Transformer
Researchers at Google Brain have open-sourced the Switch Transformer, a natural-language processing (NLP) AI model. The model scales up to 1.6T parameters and improves training time up to 7x compared to the T5 NLP model, with comparable accuracy.
-
OpenAI Announces GPT-3 Model for Image Generation
OpenAI has trained a 12B-parameter AI model based on GPT-3 that can generate images from textual description. The description can specify many independent attributes, including the position of objects as well as image perspective, and can also synthesize combinations of objects that do not exist in the real world.
-
Facebook Open-Sources Multilingual Speech Recognition Deep-Learning Model
Facebook AI Research (FAIR) open-sourced Cross-Lingual Speech Recognition (XSLR), a multilingual speech recognition AI model. XSLR is trained on 53 languages and outperforms existing systems when evaluated on common benchmarks.
-
TensorFlow 2.4 Release Includes CUDA 11 Support and API Updates
The TensorFlow project announced the release of version 2.4.0 of the deep-learning framework, featuring support for CUDA 11, cuDNN 8, and NVIDIA's Ampere GPU architecture, as well as new strategies and profiling tools for distributed training. Other API updates include mixed-precision in Keras and a NumPy frontend.
-
AI Models from Google and Microsoft Exceed Human Performance on Language Understanding Benchmark
Research teams from Google and Microsoft have recently developed natural language processing (NLP) AI models which have scored higher than the human baseline score on the SuperGLUE benchmark. SuperGLUE measures a model's score on several natural language understanding (NLU) tasks, including question answering and reading comprehension.
-
DeepMind's AlphaFold2 AI Solves 50-Year-Old Biology Challenge
The Protein Structure Prediction Center announced that AlphaFold2, an AI system developed by DeepMind, has solved its Protein Structure Prediction challenge. AlphaFold2 achieved a median score of 92.4 on the Global Distance Test (GDT) metric, above the threshold considered competitive with traditional methods.
-
Molecular Dynamics Simulation Based on Machine Learning Wins Gordon Bell Prize
The 2020 Association for Computing Machinery (ACM) Gordon Bell Prize was given to a team of researchers from institutions in the USA and China for their project titled: "Pushing the limit of molecular dynamics with ab initio accuracy to 100 million atoms with machine learning"
-
Caltech Open-Sources AI for Solving Partial Differential Equations
Researchers from Caltech's DOLCIT group have open-sourced Fourier Neural Operator (FNO), a deep-learning method for solving partial differential equations (PDEs). FNO outperforms other existing deep-learning techniques for solving PDEs and is three orders of magnitude faster than traditional solvers.
-
Trend Micro, Europol, and UNICRI Publish AI Misuse Report
Trend Micro, Europol’s European Cybercrime Centre (EC3), and United Nations Interregional Crime and Justice Research Institute (UNICRI) have jointly produced a report on current and possible future criminal misuse of AI. The report also includes a set of preparedness recommendations for policymakers, law enforcement, and cybersecurity experts.