InfoQ Homepage Large language models Content on InfoQ

News

RSS Feed

Newer Older

DevOps

Datadog Integrates Google Agent Development Kit into LLM Observability Tools

Datadog recently announced that its LLM Observability platform now provides automatic instrumentation for applications built with Google's Agent Development Kit (ADK), offering deeper visibility into the behavior, performance, cost, and safety of AI-driven agentic systems.

Craig Risi
on Feb 06, 2026
AI, ML & Data Engineering

Vercel Introduces Skills.sh, an Open Ecosystem for Agent Commands

Vercel has released Skills.sh, an open-source tool designed to provide AI agents with a standardized way to execute reusable actions, or skills, through the command line.

Daniel Dominguez
on Feb 04, 2026
AI, ML & Data Engineering

Agent Trace: Cursor Proposes an Open Specification for AI Code Attribution

Cursor has published Agent Trace, a draft open specification aimed at standardizing how AI-generated code is attributed in software projects. Released as a Request for Comments (RFC), the proposal defines a vendor-neutral format for recording AI contributions alongside human authorship in version-controlled codebases.

Robert Krzaczyński
on Feb 04, 2026
AI, ML & Data Engineering

MongoDB Introduces Embedding and Reranking API on Atlas

MongoDB has recently announced the public preview of its Embedding and Reranking API on MongoDB Atlas. The new API gives developers direct access to Voyage AI’s search models within the managed cloud database, enabling them to create features such as semantic search and AI-powered assistants within a single integrated environment, with consolidated monitoring and billing.

Renato Losio
on Feb 03, 2026
AI, ML & Data Engineering

Open Responses Specification Enables Unified Agentic LLM Workflows

OpenAI's Open Responses standardizes agentic AI workflows, tackling API fragmentation and enabling seamless transitions between proprietary and open-source models. Supported by partners like Hugging Face and Vercel, this specification enhances reasoning visibility and tool execution, streamlining complex workflows and boosting productivity for developers. Empower your AI integration today!

Hien Luu
on Feb 02, 2026
Cloud

Cloudflare's Matrix Homeserver Demo Sparks Debate over AI-Generated Code Claims

A Cloudflare blog post claiming a "production-grade" Matrix homeserver on Workers didn't survive community scrutiny. Missing federation, incomplete encryption, and TODO comments in authentication logic pointed to unreviewed AI output. Matrix's Matthew Hodgson welcomed the effort but noted the implementation "doesn't yet constitute a functional Matrix server."

Steef-Jan Wiggers
on Feb 02, 2026
AI, ML & Data Engineering

OpenAI Launches Prism, a Free LaTeX-Native Workspace with Integrated GPT-5.2

OpenAI has released Prism, a free, cloud-based LaTeX workspace designed for academic writing and collaboration, with GPT-5.2 integrated directly into the authoring environment. The platform combines document editing, compilation, citation management, and AI-assisted revision in a single web-based workspace, aimed at researchers producing long-form scientific documents.

Robert Krzaczyński
on Jan 31, 2026
AI, ML & Data Engineering

Anthropic Releases Updated Constitution for Claude

Anthropic has published an updated constitution for Claude, its AI assistant, providing a structured framework that guides behavior, reasoning, and training. The constitution combines explicit principles with contextual guidance, making it a practical tool for improving alignment, safety, and reliability in real-world interactions.

Robert Krzaczyński
on Jan 30, 2026
AI, ML & Data Engineering

Google DeepMind Introduces ATLAS Scaling Laws for Multilingual Language Models

Google DeepMind researchers have introduced ATLAS, a set of scaling laws for multilingual language models that formalize how model size, training data volume, and language mixtures interact as the number of supported languages increases.

Robert Krzaczyński
on Jan 29, 2026
AI, ML & Data Engineering

Google Introduces TranslateGemma Open Models for Multilingual Translation

Google has released TranslateGemma, a set of open translation models based on the Gemma 3 architecture, offering 4B, 12B, and 27B parameter variants designed to support machine translation across 55 languages and to run on platforms ranging from mobile and edge devices to consumer hardware and cloud accelerators.

Daniel Dominguez
on Jan 28, 2026
Mobile

Google Releases Gemma 3 270M Variant Optimized for Function Calling on Mobile and Edge Devices

FunctionGemma is a new, lightweight version of the Gemma 3 270M model, fine-tuned to translate natural language into structured function and API calls, enabling AI agents to "do more than just talk" and act.

Sergio De Simone
on Jan 26, 2026
Architecture & Design

DoorDash Applies AI to Safety across Chat and Calls, Cutting Incidents by 50%

DoorDash deploys SafeChat, an AI-driven safety system for moderating chat, images, and voice calls between Dashers and customers. Using a layered text moderation architecture, machine learning models, and human review, SafeChat detects unsafe content in real time, enabling immediate actions and reducing low- and medium-severity safety incidents by roughly 50 percent.

Leela Kumili
on Jan 23, 2026
AI, ML & Data Engineering

How CyberArk Protects AI Agents with Instruction Detectors and History-Aware Validation

To prevent agents from obeying malicious instructions hidden in external data, all text entering an agent's context must be treated as untrusted, says Niv Rabin, principal software architect at AI-security firm CyberArk. His team developed an approach based on instruction detection and history-aware validation to protect against both malicious input data and context-history poisoning.

Sergio De Simone
on Jan 20, 2026
AI, ML & Data Engineering

MIT's Recursive Language Models Improve Performance on Long-Context Tasks

Researchers at MIT's CSAIL published a design for Recursive Language Models (RLM), a technique for improving LLM performance on long-context tasks. RLMs use a programming environment to recursively decompose and process inputs, and can handle prompts up to 100x longer than base LLMs.

Anthony Alford
on Jan 20, 2026
AI, ML & Data Engineering

Hugging Face Releases FineTranslations, a Trillion-Token Multilingual Parallel Text Dataset

Hugging Face has released FineTranslations, a large-scale multilingual dataset containing more than 1 trillion tokens of parallel text across English and 500+ languages. The dataset was created by translating non-English content from the FineWeb2 corpus into English using Gemma3 27B, with the full data generation pipeline designed to be reproducible and publicly documented.

Robert Krzaczyński
on Jan 18, 2026

Newer News

Older News

InfoQ Software Architects' Newsletter

News