InfoQ Homepage Machine Learning Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Nexa AI Unveils Omnivision: a Compact Vision-Language Model for Edge AI

Nexa AI unveiled Omnivision, a compact vision-language model tailored for edge devices. By significantly reducing image tokens from 729 to 81, Omnivision lowers latency and computational requirements while maintaining strong performance in tasks like visual question answering and image captioning.

Robert Krzaczyński
on Dec 03, 2024
AI, ML & Data Engineering

QCon SF 2024 - Ten Reasons Your Multi-Agent Workflows Fail

At QCon SF 2024, Victor Dibia from Microsoft Research explored the complexities of multi-agent systems powered by generative AI. Highlighting common pitfalls like inadequate prompts and poor orchestration, he shared strategies for enhancing reliability and scalability. Dibia emphasized the need for meticulous design and oversight to unlock the full potential of these innovative systems.

Andrew Hoblitzell
on Nov 29, 2024
AI, ML & Data Engineering

QCon SF 2024 - Scaling Large Language Model Serving Infrastructure at Meta

At QCon SF 2024, Ye (Charlotte) Qi of Meta tackled the complexities of scaling large language model (LLM) infrastructure, highlighting the "AI Gold Rush" challenge. She emphasized efficient hardware integration, latency optimization, and production readiness, alongside Meta's innovative approaches like hierarchical caching and automation to enhance AI performance and reliability.

Andrew Hoblitzell
on Nov 26, 2024
AI, ML & Data Engineering

Meta Releases NotebookLlama: Open-Source PDF to Podcast Toolkit

Meta has released NotebookLlama, an open-source toolkit designed to convert PDF documents into podcasts, providing developers with a structured, accessible PDF-to-audio workflow. As an open-source alternative to Google’s NotebookLM, NotebookLlama guides users through a four-step process that converts PDF text into audio content.

Robert Krzaczyński
on Nov 17, 2024
Cloud

Native Vector Support in Azure SQL Database in Public Preview

Azure SQL Database now supports native vector storage and processing, streamlining AI development by integrating vector search with SQL queries. This update simplifies database management, enhances data analysis, and boosts performance by eliminating data movement. Ideal for diverse applications, it empowers sectors like e-commerce and healthcare with advanced, context-aware functionalities.

Steef-Jan Wiggers
on Nov 16, 2024
AI, ML & Data Engineering

OSI Releases New Definition for Open Source AI, Setting Standards for Transparency and Accessibility

The Open Source Initiative (OSI) released Version 1.0 of its Open Source AI Definition (OSAID) after two years of development with contributions from global experts. The OSAID sets criteria defining open-source AI, aiming to bring clarity to the concept and establish benchmarks for transparency and accessibility in AI.

Robert Krzaczyński
on Nov 07, 2024
AI, ML & Data Engineering

Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models

Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.

Vinod Goje
on Nov 06, 2024
AI, ML & Data Engineering

Rhymes AI Unveils Aria: Open-Source Multimodal Model with Development Resources

Rhymes AI has introduced Aria, an open-source multimodal native Mixture-of-Experts (MoE) model capable of processing text, images, video, and code effectively. In benchmarking tests, Aria has outperformed other open models and demonstrated competitive performance against proprietary models such as GPT-4o and Gemini-1.5.

Robert Krzaczyński
on Oct 28, 2024
AI, ML & Data Engineering

AI and ML Tracks at QCon San Francisco 2024 – a Deep Dive into GenAI & Practical Applications

At QCon San Francisco 2024, explore two AI/ML-focused tracks highlighting real-world applications and innovations. Learn from industry experts on deploying LLMs, GenAI, and recommendation systems, gaining practical strategies for integrating AI into software development.

Artenisa Chatziou
on Oct 25, 2024
AI, ML & Data Engineering

NVIDIA Unveils NVLM 1.0: Open-Source Multimodal LLM with Improved Text and Vision Capabilities

NVIDIA unveiled NVLM 1.0, an open-source multimodal large language model (LLM) that performs strongly on both vision-language and text-only tasks. NVLM 1.0 shows improvements in text-based tasks after multimodal training, standing out among current models. The model weights are now available on Hugging Face, with the training code set to be released shortly.

Robert Krzaczyński
on Oct 11, 2024
AI, ML & Data Engineering

Hugging Face Upgrades Open LLM Leaderboard v2 for Enhanced AI Model Comparison

Hugging Face has recently released Open LLM Leaderboard v2, an upgraded version of their benchmarking platform for large language models. Hugging Face created the Open LLM Leaderboard to provide a standardized evaluation setup for reference models, ensuring reproducible and comparable results.

Vinod Goje
on Oct 10, 2024
AI, ML & Data Engineering

Meta Releases Llama 3.2 with Vision, Voice, and Open Customizable Models

Meta recently announced Llama 3.2, the latest version of Meta's open-source language model, which includes vision, voice, and open customizable models. This is the first multimodal version of the model, which will allow users to interact with visual data in ways like identifying objects in photos or editing images with natural language commands among other use cases.

Andrew Hoblitzell
on Oct 07, 2024
.NET

OpenAI Releases Stable Version of .NET Library with GPT-4o Support and API Enhancements

OpenAI has released the stable version of its official .NET library, following June's beta launch. Available as a NuGet package, it supports the latest models like GPT-4o and GPT-4o mini, and the full OpenAI REST API. The release includes both sync and async APIs, streaming chat completions, and key-breaking changes for improved API consistency.

Robert Krzaczyński
on Oct 06, 2024
AI, ML & Data Engineering

PyTorch Conference 2024: PyTorch 2.4/Upcoming 2.5, and Llama 3.1

The PyTorch Conference 2024, held by The Linux Foundation, showcased groundbreaking advancements in AI, featuring insights on PyTorch 2.4, Llama 3.1, and open-source projects like OLMo. Key discussions on LLM deployment, ethical AI, and innovative libraries like Torchtune and TorchChat emphasized collaboration and responsible practices in the evolving landscape of generative AI.

Andrew Hoblitzell
on Sep 26, 2024
Cloud

Microsoft Launches Azure AI Inference SDK for .NET

Microsoft launched Azure AI Inference SDK for .NET, streamlining access to generative AI models in the Azure AI Studio model catalog. This catalog includes models from providers like Azure OpenAI Service, Mistral, Meta, Cohere, NVIDIA, and Hugging Face, organized into three collections: Curated by Azure AI, Azure OpenAI Models, and Open Models from Hugging Face Hub.

Robert Krzaczyński
on Sep 24, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News