InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage Artificial Intelligence Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Microsoft and Tsinghua University Present DIFF Transformer for LLMs

Researchers from Microsoft AI and Tsinghua University have introduced a new architecture called the Differential Transformer (DIFF Transformer), aimed at improving the performance of large language models. This model enhances attention mechanisms by refining how models handle context and minimizing distractions from irrelevant information.

Daniel Dominguez
on Oct 20, 2024
.NET

Microsoft Releases Preview of AI Integration Libraries for .NET

Last week, Microsoft announced the preview release of two libraries: Microsoft.Extensions.AI.Abstractions and Microsoft.Extensions.AI. These packages, referred to as Unified AI Building Blocks, provide the .NET ecosystem with essential abstractions for integrating artificial intelligence (AI) services into .NET applications and libraries, along with middleware to enhance key capabilities.

Almir Vuk
on Oct 18, 2024
AI, ML & Data Engineering

NVIDIA Unveils NVLM 1.0: Open-Source Multimodal LLM with Improved Text and Vision Capabilities

NVIDIA unveiled NVLM 1.0, an open-source multimodal large language model (LLM) that performs strongly on both vision-language and text-only tasks. NVLM 1.0 shows improvements in text-based tasks after multimodal training, standing out among current models. The model weights are now available on Hugging Face, with the training code set to be released shortly.

Robert Krzaczyński
on Oct 11, 2024
AI, ML & Data Engineering

OpenAI Developer Day 2024 (SF) Announces Real-Time API, Vision Fine-Tuning, and More

On October 1, 2024, OpenAI SF DevDay unveiled innovative features, including a Real-Time API enabling instant voice interactions and function calling. Enhanced model distillation and vision fine-tuning empower developers to customize AI for diverse applications. Upcoming events in London and Singapore will further expand these capabilities.

Andrew Hoblitzell
on Oct 10, 2024
AI, ML & Data Engineering

Meta Unveils Movie Gen, a New AI Model for Video Generation

Meta has announced Movie Gen, a new AI model designed to create high-quality 1080p videos with synchronized audio. The system enables instruction-based video editing and allows for personalized content generation using user-supplied images.

Daniel Dominguez
on Oct 08, 2024
AI, ML & Data Engineering

Meta Releases Llama 3.2 with Vision, Voice, and Open Customizable Models

Meta recently announced Llama 3.2, the latest version of Meta's open-source language model, which includes vision, voice, and open customizable models. This is the first multimodal version of the model, which will allow users to interact with visual data in ways like identifying objects in photos or editing images with natural language commands among other use cases.

Andrew Hoblitzell
on Oct 07, 2024
DevOps

Intuit Engineering's Approach to Simplifying Kubernetes Management with AI

Intuit recently talked about how they managed the complexities of monitoring and debugging Kubernetes clusters using Generative AI (GenAI). The GenAI experiments were conducted to streamline detection, debugging, and remediation processes.

Aditya Kulkarni
on Sep 29, 2024
AI, ML & Data Engineering

PyTorch Conference 2024: PyTorch 2.4/Upcoming 2.5, and Llama 3.1

The PyTorch Conference 2024, held by The Linux Foundation, showcased groundbreaking advancements in AI, featuring insights on PyTorch 2.4, Llama 3.1, and open-source projects like OLMo. Key discussions on LLM deployment, ethical AI, and innovative libraries like Torchtune and TorchChat emphasized collaboration and responsible practices in the evolving landscape of generative AI.

Andrew Hoblitzell
on Sep 26, 2024
Culture & Methods

Measuring and Reducing the Environmental Impact of Software

Software applications often manage big amounts of data; most of them are internet-based applications, and incorporate artificial intelligence. According to Coral Calero, these three aspects improve the capabilities and functionalities provided by software but they have also increased the amount of energy needed. We need to measure energy consumption of software to control its environmental impact.

Ben Linders
on Sep 26, 2024
AI, ML & Data Engineering

Anthropic Unveils Contextual Retrieval for Enhanced AI Data Handling

Anthropic has announced Contextual Retrieval, a significant advancement in AI systems' interaction with extensive knowledge bases. This technique addresses the challenge of context loss in Retrieval-Augmented Generation (RAG) systems by enriching text chunks with contextual information before embedding or indexing.

Daniel Dominguez
on Sep 25, 2024
Cloud

Microsoft Launches Azure AI Inference SDK for .NET

Microsoft launched Azure AI Inference SDK for .NET, streamlining access to generative AI models in the Azure AI Studio model catalog. This catalog includes models from providers like Azure OpenAI Service, Mistral, Meta, Cohere, NVIDIA, and Hugging Face, organized into three collections: Curated by Azure AI, Azure OpenAI Models, and Open Models from Hugging Face Hub.

Robert Krzaczyński
on Sep 24, 2024
AI, ML & Data Engineering

Stability AI Announces Integration of Top Text-to-Image Models with Amazon Bedrock

Stability AI has introduced three new text-to-image models to Amazon Bedrock: Stable Image Ultra, Stable Diffusion 3 Large, and Stable Image Core. These models focus on improving performance in multi-subject prompts, image quality, and typography. They are designed to generate high-quality visuals for various use cases in marketing, advertising, media, entertainment, retail, and more.

Daniel Dominguez
on Sep 18, 2024
Cloud

AWS Announces General Availability of EC2 P5e Instances, Powered by NVIDIA H100 Tensor Core GPUs

Amazon Web Services (AWS) has launched EC2 P5e instances featuring NVIDIA H100 Tensor Core GPUs, substantially boosting AI and HPC performance. With enhanced memory bandwidth, these instances reduce latency for real-time applications. Ideal for tasks like LLM training and simulations, they offer improved scalability and cost-efficiency, making them pivotal for modern cloud computing.

Steef-Jan Wiggers
on Sep 18, 2024
AI, ML & Data Engineering

Leveraging the Transformer Architecture for Music Recommendation on YouTube

Google has described an approach to use transformer models, which ignited the current generative AI boom, for music recommendation. This approach, which is currently being applied experimentally on YouTube, aims to build a recommender that can understand sequences of user actions when listening to music to better predict user preferences based on their context.

Sergio De Simone
on Sep 06, 2024
Culture & Methods

The Value of Using Timeless Testing Tools

According to Benjamin Bischoff, developers find new tools much more interesting than old ones, as they offer an opportunity to learn new technologies and approaches and to expand their tool belt. Using tools that have been around for decades, however, can save time and budget. When evaluating tools, it is more important to understand the problem to be solved than to jump straight into the tools.

Ben Linders
on Sep 05, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News