InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage Artificial Intelligence Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

QCon SF 2024 - Scaling Large Language Model Serving Infrastructure at Meta

At QCon SF 2024, Ye (Charlotte) Qi of Meta tackled the complexities of scaling large language model (LLM) infrastructure, highlighting the "AI Gold Rush" challenge. She emphasized efficient hardware integration, latency optimization, and production readiness, alongside Meta's innovative approaches like hierarchical caching and automation to enhance AI performance and reliability.

Andrew Hoblitzell
on Nov 26, 2024
AI, ML & Data Engineering

Vercel Expands AI Toolkit with AI SDK 4.0 Update

Vercel has announced version 4.0 of its open-source AI SDK toolkit designed for building AI applications in JavaScript and TypeScript. The update introduces key features like PDF support, computer use integration, and a new xAI Grok API.

Daniel Dominguez
on Nov 24, 2024
AI, ML & Data Engineering

Meta Releases NotebookLlama: Open-Source PDF to Podcast Toolkit

Meta has released NotebookLlama, an open-source toolkit designed to convert PDF documents into podcasts, providing developers with a structured, accessible PDF-to-audio workflow. As an open-source alternative to Google’s NotebookLM, NotebookLlama guides users through a four-step process that converts PDF text into audio content.

Robert Krzaczyński
on Nov 17, 2024
Development

GitHub Universe 2024 Unveils AI Innovations and Developer-Centric Tools

GitHub Universe 2024 unveiled groundbreaking updates emphasizing developer autonomy and AI capabilities. With multi-model support for Copilot, the introduction of AI-driven GitHub Spark, enhanced security features, and improved workflows in popular IDEs, GitHub aims to democratize coding and empower developers, regardless of skill level, to harness the full potential of artificial intelligence.

Steef-Jan Wiggers
on Nov 17, 2024
DevOps

Thoughtworks Technology Radar Oct 2024 - From Coding Assistance to AI Evolution

Thoughtworks recently published their Technology Radar Volume 31, providing an opinionated guide to the current technology landscape. As per the Technology Radar, Generative AI and Large Language Models (LLMs) dominate, with a focus on their responsible use in software development. AI-powered coding tools are evolving, necessitating a balance between AI assistance and human expertise.

Aditya Kulkarni
on Nov 15, 2024
AI, ML & Data Engineering

Decart and Etched Release Oasis, a New AI Model Transforming Gaming Worlds

Decart.ai and Etched.ai recently introduced Oasis, an AI-driven model that generates a fully interactive, real-time open-world experience inspired by Minecraft.

Daniel Dominguez
on Nov 10, 2024
Cloud

Microsoft Unveils Enhanced AI Tools for Developers at GitHub Universe

At GitHub Universe, Microsoft unveiled powerful integrations of Azure AI with GitHub and VS Code, empowering developers with context-aware tools like GitHub Copilot for Azure and AI App Templates. These innovations streamline workflows, enhance resource management, and simplify deployment, providing developers with robust features for efficient, secure application building and experimentation.

Steef-Jan Wiggers
on Nov 09, 2024
AI, ML & Data Engineering

OSI Releases New Definition for Open Source AI, Setting Standards for Transparency and Accessibility

The Open Source Initiative (OSI) released Version 1.0 of its Open Source AI Definition (OSAID) after two years of development with contributions from global experts. The OSAID sets criteria defining open-source AI, aiming to bring clarity to the concept and establish benchmarks for transparency and accessibility in AI.

Robert Krzaczyński
on Nov 07, 2024
AI, ML & Data Engineering

Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models

Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.

Vinod Goje
on Nov 06, 2024
AI, ML & Data Engineering

xAI Unveils a New API Service for Grok Models

Elon Musk’s xAI has launched a public beta for its API service, enabling developers to integrate xAI's large language models (LLMs) into their applications.

Daniel Dominguez
on Nov 05, 2024
AI, ML & Data Engineering

Meta AI Introduces Thought Preference Optimization Enabling AI Models to Think before Responding

Researchers from Meta FAIR, the University of California, Berkeley, and New York University have introduced Thought Preference Optimization (TPO), a new method aimed at improving the response quality of instruction-fine tuned LLMs.

Daniel Dominguez
on Nov 04, 2024
AI, ML & Data Engineering

Rhymes AI Unveils Aria: Open-Source Multimodal Model with Development Resources

Rhymes AI has introduced Aria, an open-source multimodal native Mixture-of-Experts (MoE) model capable of processing text, images, video, and code effectively. In benchmarking tests, Aria has outperformed other open models and demonstrated competitive performance against proprietary models such as GPT-4o and Gemini-1.5.

Robert Krzaczyński
on Oct 28, 2024
AI, ML & Data Engineering

AI and ML Tracks at QCon San Francisco 2024 – a Deep Dive into GenAI & Practical Applications

At QCon San Francisco 2024, explore two AI/ML-focused tracks highlighting real-world applications and innovations. Learn from industry experts on deploying LLMs, GenAI, and recommendation systems, gaining practical strategies for integrating AI into software development.

Artenisa Chatziou
on Oct 25, 2024
DevOps

Meta Optimizes Data Center Sustainability with Reinforcement Learning

In a recent blog post, Meta describes how its engineers use reinforcement learning (RL), to optimize environmental controls in Meta’s data centers, reducing energy consumption and water usage while addressing broader challenges such as climate change.

Claudio Masolo
on Oct 25, 2024
Cloud

Microsoft Unveils Azure Cobalt 100-Based Virtual Machines: Enhanced Performance and Sustainability

Microsoft's Azure Cobalt 100 VMs are now generally available. They deliver up to 50% improved price performance with energy-efficient Arm architecture. Tailored for diverse workloads, these VMs offer various configurations, including general-purpose and memory-optimized options. Their release supports sustainable computing, aligning with Microsoft's commitment to lower carbon footprints.

Steef-Jan Wiggers
on Oct 24, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News