InfoQ Homepage Artificial Intelligence Content on InfoQ
-
Generative AI Service AWS Bedrock Now Generally Available
After announcing Bedrock last April in preview, Amazon is now making its fully-managed service for generative AI apps generally available.
-
Multi-Modal LLM NExT-GPT Handles Text, Images, Videos, and Audio
The NExT Research Center at the National University of Singapore (NUS) recently open-sourced NExT-GPT, an "any-to-any" multi-modal large language model (LLM) that can handle text, images, videos, and audio as input or output. NExT-GPT is based on existing pre-trained models and only required updating 1% of its total parameters during training.
-
Hugging Face's Guide to Optimizing LLMs in Production
When it comes to deploying Large Language Models (LLMs) in production, the two major challenges originate from the huge amount of parameters they require and the necessity of handling very long input sequences to represent contextual information. Hugging Face has documented a list of techniques to tackle those hurdles based on their experience serving such models.
-
Microsoft Announces Copilot Copyright Commitment to Address IP Infringement Concerns
Microsoft recently published the Copilot Copyright Commitment to address concerns about potential IP infringement claims from content produced by generative AI. Under this commitment, which covers various products, including GitHub Copilot, Microsoft will take responsibility for potential legal risks if a customer faces copyright challenges.
-
AI a “Must-Have” in GitLab’s 2023 Global DevSecOps Report
GitLab has released their 2023 Global DevSecOps AI report, with the key finding that AI and ML use is evolving from a "nice-to-have" to a "must-have". The report shows that 23% of organizations are already using AI in software development, and of those, 60% are using it daily. Furthermore, 65% of respondents said they are using AI and ML for testing now, or would be within the next three years.
-
Meta Open-Sources Multilingual Translation Foundation Model SeamlessM4T
Meta recently open-sourced Massively Multilingual & Multimodal Machine Translation (SeamlessM4T), a multilingual translation AI that can translate both speech audio and text data across nearly 100 languages. SeamlessM4T is trained on 1 million hours of audio data and outperforms the current state-of-the-art speech-to-text translation model.
-
Abu Dhabi Releases Largest Openly-Available Language Model Falcon 180B
The Abu Dhabi government's Technology Innovation Institute (TII) released Falcon 180B, currently the largest openly-available large language model (LLM). Falcon 180B contains 180 billion parameters and outperforms GPT-3.5 on the MMLU benchmark.
-
AI, ML, Data Engineering News Roundup: Stable Chat, Vertex AI, ChatGPT and Code Llama
The most recent update, which covers developments through September 4, 2023, highlights significant pronouncements and accomplishments in the fields of artificial intelligence, machine learning, and data science. Developments from Stability AI, Google, OpenAI, and Meta were among this week's significant stories.
-
Weekly Update on Large Language Models: PointLLM, WALL-E, AskIt, and Jais
The most recent compilation of advanced research, inventive applications, and notable unveilings in the realm of Large Language Models (LLMs) during the week starting September 4th, 2023.
-
Google Announces Ray Support for Vertex AI to Boost Machine Learning Workflows
Google has announced that it is expanding its open-source support for Vertex AI, its machine learning platform, by adding support for Ray, an open-source unified compute framework. This move is aimed at efficiently scaling AI workloads and enhancing the productivity and operational efficiency of data science teams.
-
Duet AI Integration for Google Apigee API Management and Application Integration
At its Cloud Next event, Google announced the introduction of Duet AI into its cloud services Apigee API Management and Application integration to support developers in building APIs, integration flows, and extensions that connect Vertex AI or ChatGPT to real-world data through APIs. These capabilities will come in a private preview through Google Cloud’s Trusted Tester program in the coming weeks
-
Rider 2023.2 with C# Updates, UX/UI Enhancements and AI Assistant
Rider 2023.2 is now generally available. This version contains improvements to C# language support. They are related to code readability and improve the programming experience. Further, there are some UI/UX enhancements. JetBrains also presented AI Assistant, which for now has limited access.
-
Google Cloud Unveils AlloyDB AI: Transforming PostgreSQL with Advanced Vector Embeddings and AI
During the recent Google Cloud Next, Google announced AlloyDB AI in preview as an integral part of AlloyDB for PostgreSQL, allowing developers to build generative (gen) Artificial Intelligence (AI) applications leveraging large language models (LLMs) with their real-time operational data through built-in, end-to-end support for vector embeddings.
-
OpenAI Introduces ChatGPT Enterprise for Business-Focus
OpenAI is introducing ChatGPT Enterprise, which provides enterprise-grade protection and privacy, limitless higher-speed GPT-4 access, extended context windows for processing longer inputs, additional data analysis capabilities, customization possibilities, and much more.
-
6 Tracks Not to Miss at QCon San Francisco, October 2-6, 2023: ML, Architecture, Resilience & More!
At InfoQ’s international software development conference, QCon San Francisco (October 2-6) 2023, senior software practitioners driving innovation and change in software development will explore real-world architectures, technology, and techniques to help you solve such challenges.