InfoQ Homepage AIOps Content on InfoQ

News

RSS Feed

Newer Older

DevOps

NVIDIA Dynamo Planner Brings SLO-Driven Automation to Multi-Node LLM Inference

Microsoft and NVIDIA have released Part 2 of their collaboration on running NVIDIA Dynamo for large language model inference on Azure Kubernetes Service (AKS). The first announcement aimed for a raw throughput of 1.2 million tokens per second on distributed GPU systems.

Claudio Masolo
on Jan 31, 2026
DevOps

CNCF Launches Certified Kubernetes AI Conformance Program to Standardise Workloads

The CNCF has launched the Certified Kubernetes AI Conformance program to standardise artificial intelligence workloads. By establishing a technical baseline for GPU management, networking, and gang scheduling, the initiative ensures portability across cloud providers. It aims to reduce technical debt and prevent vendor lock-in as enterprises move generative AI models into production.

Mark Silvester
on Dec 30, 2025
AI, ML & Data Engineering

SIMA 2 Uses Gemini and Self-Improvement to Generalize across Unseen 3D and Photorealistic Worlds

Google DeepMind researchers introduced SIMA 2 (Scalable Instructable Multiworld Agent), a generalist agent built on the Gemini foundation model that can understand and act across multiple 3D virtual game environments. The SIMA 2 architecture uses a Gemini Flash-Lite model trained on a mixture of gameplay and Gemini pretraining data.

Vinod Goje
on Dec 29, 2025
DevOps

Michelin Drives Pragmatic Path to AIOps without a Grand Vision

Michelin's China operations group have written about how they implemented an AIOps platform. It details the missteps and organisational resistance that were overcome on the way to eventual alignment with their global IT governance, and explains how enterprises can move past vendor pitches to get to a practical deployment.

Matt Saunders
on Dec 29, 2025
AI, ML & Data Engineering

Meta Details GEM Ads Model Using LLM-Scale Training, Hybrid Parallelism, and Knowledge Transfer

Meta released details about its Generative Ads Model (GEM), a foundation model designed to improve ads recommendation across its platforms. The model addresses core challenges in recommendation systems (RecSys) by processing billions of daily user-ad interactions where meaningful signals such as clicks and conversions are very sparse.

Vinod Goje
on Dec 22, 2025
AI, ML & Data Engineering

Private AI Compute Enables Google Inference with Hardware Isolation and Ephemeral Data Design

Google announced Private AI Compute, a system designed to process AI requests using Gemini cloud models while aiming to keep user data private. The announcement positions Private AI Compute as Google's approach to addressing privacy concerns while providing cloud-based AI capabilities, building on what the company calls privacy-enhancing technologies it has developed for AI use cases.

Vinod Goje
on Nov 30, 2025
Web Development

Introducing Evalite: the TypeScript Testing Tool for AI Powered Apps

Evalite is a TypeScript-native eval runner designed for AI applications, enabling developers to create reproducible evals with rich outputs. Featuring first-class trace capture, scoring, and a user-friendly web UI, Evalite enhances testing ergonomics and iteration speed. Open-source under MIT, it seamlessly integrates with any LLM, ensuring complete data control and fostering rapid development.

Daniel Curtis
on Nov 29, 2025
AI, ML & Data Engineering

Amazon Adds A2A Protocol to Bedrock AgentCore for Interoperable Multi-Agent Workflows

Amazon announced support for the Agent-to-Agent (A2A) protocol in Amazon Bedrock AgentCore Runtime, enabling communication between agents built on different frameworks. The protocol allows agents developed with Strands Agents, OpenAI Agents SDK, LangGraph, Google ADK, or Claude Agents SDK to "share context, capabilities, and reasoning in a common, verifiable format."

Vinod Goje
on Nov 28, 2025
DevOps

Nexla Launches Express: a Conversational Platform for AI Data Engineering

Nexla recently introduced Express, a conversational data engineering platform designed to dramatically lower the barrier for building data pipelines for AI applications.

Craig Risi
on Nov 22, 2025
AI, ML & Data Engineering

Kimi's K2 Opensource Language Model Supports Dynamic Resource Availability and New Optimizer

Kimi released K2, a Mixture-of-Experts large language model with 32 billion activated parameters and 1.04 trillion total parameters, trained on 15.5 trillion tokens. The release introduces MuonClip, a new optimizer that builds on the Muon optimizer by adding a QK-clip technique designed to address training instability, which the team reports resulted in "zero loss spike" during pre-training.

Vinod Goje
on Nov 17, 2025
AI, ML & Data Engineering

Anthropic Adds Sandboxing and Web Access to Claude Code for Safer AI-Powered Coding

Anthropic released sandboxing capabilities for Claude Code and launched a web-based version of the tool that runs in isolated cloud environments. The company introduced these features to address security risks that arise when Claude Code writes, tests, and debugs code with broad access to developer codebases and files.

Vinod Goje
on Nov 14, 2025
AI, ML & Data Engineering

KubeCon NA 2025 - Salesforce’s Approach to Self-Healing Using AIOps and Agentic AI

AIOps and Agentic AI technologies can help in developing solutions to intelligently analyze Kubernetes cluster health, automatically diagnose problems, and orchestrate issue resolutions with minimal human intervention. Vikram Venkataraman and Srikanth Rajan spoke at KubeCon + CloudNativeCon NA 2025 Conference about Salesforce’s approach to self-healing systems using AIOps and AI Agents.

Srini Penchikala
on Nov 12, 2025
AI, ML & Data Engineering

New Claude Haiku 4.5 Model Promises Faster Performance at One-Third the Cost

Anthropic released Claude Haiku 4.5, making the model available to all users as its latest entry in the small, fast model category. The company positions the new model as delivering performance levels comparable to Claude Sonnet 4, which launched five months ago as a state-of-the-art model, but at "one-third the cost and more than twice the speed."

Vinod Goje
on Nov 12, 2025
AI, ML & Data Engineering

Claude Sonnet 4.5 Ranked Safest LLM from Open-Source Audit Tool Petri

Claude Sonnet 4.5 has emerged as the best-performing model in ‘risky tasks’, narrowly edging out GPT-5 in early evaluations by Petri --- Anthropic’s new open-source AI auditing tool.

Matt Foster
on Oct 07, 2025
DevOps

DORA Report Finds AI Is an Amplifier in Software Development, But Trust Remains Low

Nearly 90% of technology professionals now use artificial intelligence in their work. But according to the 2025 DORA State of AI-assisted Software Development report, there's still a significant gap in trust between developers and the tools they increasingly rely upon. The report findings found that while AI adoption has become "nearly universal," there are still some organisational challenges.

Matt Saunders
on Sep 29, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News