InfoQ Homepage Artificial Intelligence Content on InfoQ

News

RSS Feed

Newer Older

Cloud

AWS Launches General Availability of Amazon EC2 P5 Instances for AI/ML and HPC Workloads

AWS recently announced the general availability (GA) of Amazon EC2 P5 instances powered by the latest NVIDIA H100 Tensor Core GPUs suitable for users that require high performance and scalability in AI/ML and HPC workloads. The GA is a follow-up to the earlier announcement of the development of the infrastructure.

Steef-Jan Wiggers
on Aug 03, 2023
Cloud

AWS Introduces a Generative AI-Powered Clinical Documentation Tool with HealthScribe in Preview

AWS recently announced a new HIPAA-eligible service called AWS HealthScribe in a preview that uses speech recognition and generative AI (powered by Amazon Bedrock) to generate clinical documentation.

Steef-Jan Wiggers
on Aug 02, 2023
AI, ML & Data Engineering

Researchers Publish Attack Algorithm for ChatGPT and Other LLMs

Researchers from Carnegie Mellon University (CMU) have published LLM Attacks, an algorithm for constructing adversarial attacks on a wide range of large language models (LLMs), including ChatGPT, Claude, and Bard. The attacks are generated automatically and are successful 84% of the time on GPT-3.5 and GPT-4, and 66% of the time on PaLM-2.

Anthony Alford
on Aug 01, 2023
AI, ML & Data Engineering

Amazon Bedrock Unveils New Agents Feature

Amazon announced the release of agents for Amazon Bedrock, a new feature that allows developers to quickly create fully managed agents. By performing API calls to enterprise systems, agents for Amazon Bedrock speed up the release of generative AI applications that can manage and carry out activities.

Daniel Dominguez
on Jul 31, 2023
Cloud

Amazon Aurora PostgreSQL Adds pgvector to Support Embeddings from Generative AI

AWS recently announced that the PostgreSQL-compatible edition of Amazon Aurora now supports pgvector for vector storage and similarity search. Aurora is the latest managed PostgreSQL database supporting the open-source extension to store and search embeddings from machine learning models.

Renato Losio
on Jul 29, 2023
AI, ML & Data Engineering

Meta Open Sources New AI Model Llama 2

Meta is open-sourcing its large language model, Llama 2. The model’s code and weights are being made available free of charge for both research and commercial use. Llama 2 is the result of the expanded partnership between Meta and Microsoft, with the latter being the preferred partner for the new model.

Andrew Hoblitzell
on Jul 28, 2023
Web Development

LangChain - Working with Large Language Models, Made Easy

LangChain is a framework that simplifies working with large language models (LLMs) such as OpenAI GPT4 or Google PaLM by providing abstractions for common use cases. It supports both JavaScript and Python.

Guy Nesher
on Jul 26, 2023
AI, ML & Data Engineering

Meta's Voicebox Outperforms State-of-the-Art Models on Speech Synthesis

Meta recently announced Voicebox, a speech generation model that can perform text-to-speech (TTS) synthesis in six languages, as well as edit and remove noise from speech recordings. Voicebox is trained on over 50k hours of audio data and outperforms previous state-of-the-art models on several TTS benchmarks.

Anthony Alford
on Jul 25, 2023
AI, ML & Data Engineering

AI, ML, Data Engineering News Round up: Claude 2, Stable Doodle, CM3leon, Llama 2, Azure and xAI

The most recent update, covering developments from July 17th, 2023, showcases significant progress and announcements in the fields of data science, machine learning, and artificial intelligence. This week's focus centers on Anthropic, Stability AI, Microsoft, Meta and xAI.

Daniel Dominguez
on Jul 25, 2023
AI, ML & Data Engineering

GitHub Details Key Prompt Engineering Practices Used to Build Copilot

Prompt engineering is key to creating effective LLM-based applications and does not require to have a PhD in machine learning or generative AI, say GitHub engineers Albert Ziegler and John Berryman, who also shared the lessons they learned developing GitHub Copilot.

Sergio De Simone
on Jul 24, 2023
Java

JetBrains Unveils AI Assistant for IntelliJ-Based IDEs and .NET Tools

JetBrains, the software development company known for creating the IntelliJ IDEA, has announced the introduction of a new AI Assistant in its Early Access Program (EAP) builds for all IntelliJ-based IDEs and .NET tools. This significant addition is aimed at transforming the landscape of software development tools by integrating generative AI and large language models into JetBrains' products.

A N M Bazlur Rahman
on Jul 24, 2023
Cloud

Microsoft Introduces the Public Preview of Vector Search Feature in Azure Cognitive Search

At its annual Inspire conference, Microsoft recently announced the public preview of Vector search in Azure Cognitive Search, a capability for building applications powered by large language models. It is a new capability for indexing, storing, and retrieving vector embeddings from a search index.

Steef-Jan Wiggers
on Jul 21, 2023
AI, ML & Data Engineering

Meta AI Reveals CM3leon, an Advanced Text-to-Image Generative Model

Meta AI has introduced CM3leon, a novel multimodal model combining text and image production. This model is the first of its type, using a modified formula from text-only language models to deliver remarkable outcomes with unequaled computational efficiency.

Daniel Dominguez
on Jul 20, 2023
Cloud

Microsoft Azure Managed Lustre for HPC and AI Workloads Now Generally Available

Microsoft recently announced the general availability (GA) of Azure Managed Lustre, a managed file system for high-performance computing (HPC) and AI workloads.

Steef-Jan Wiggers
on Jul 20, 2023
AI, ML & Data Engineering

Introduction to Mojo Programming Language

Mojo is a newly presented programming language that combines the simplicity of Python with the speed and memory security of Rust. It is at an early stage of development and offers users an online playground to explore its features. Mojo aims for excellence in data science and machine learning, providing a fast alternative to Python. There are gradual plans to make it available to open-source.

Robert Krzaczyński
on Jul 19, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News