InfoQ Homepage Large language models Content on InfoQ
-
Datadog Integrates Google Agent Development Kit into LLM Observability Tools
Datadog recently announced that its LLM Observability platform now provides automatic instrumentation for applications built with Google's Agent Development Kit (ADK), offering deeper visibility into the behavior, performance, cost, and safety of AI-driven agentic systems.
-
Vercel Introduces Skills.sh, an Open Ecosystem for Agent Commands
Vercel has released Skills.sh, an open-source tool designed to provide AI agents with a standardized way to execute reusable actions, or skills, through the command line.
-
Agent Trace: Cursor Proposes an Open Specification for AI Code Attribution
Cursor has published Agent Trace, a draft open specification aimed at standardizing how AI-generated code is attributed in software projects. Released as a Request for Comments (RFC), the proposal defines a vendor-neutral format for recording AI contributions alongside human authorship in version-controlled codebases.
-
MongoDB Introduces Embedding and Reranking API on Atlas
MongoDB has recently announced the public preview of its Embedding and Reranking API on MongoDB Atlas. The new API gives developers direct access to Voyage AI’s search models within the managed cloud database, enabling them to create features such as semantic search and AI-powered assistants within a single integrated environment, with consolidated monitoring and billing.
-
Open Responses Specification Enables Unified Agentic LLM Workflows
OpenAI's Open Responses standardizes agentic AI workflows, tackling API fragmentation and enabling seamless transitions between proprietary and open-source models. Supported by partners like Hugging Face and Vercel, this specification enhances reasoning visibility and tool execution, streamlining complex workflows and boosting productivity for developers. Empower your AI integration today!
-
Cloudflare's Matrix Homeserver Demo Sparks Debate over AI-Generated Code Claims
A Cloudflare blog post claiming a "production-grade" Matrix homeserver on Workers didn't survive community scrutiny. Missing federation, incomplete encryption, and TODO comments in authentication logic pointed to unreviewed AI output. Matrix's Matthew Hodgson welcomed the effort but noted the implementation "doesn't yet constitute a functional Matrix server."
-
OpenAI Launches Prism, a Free LaTeX-Native Workspace with Integrated GPT-5.2
OpenAI has released Prism, a free, cloud-based LaTeX workspace designed for academic writing and collaboration, with GPT-5.2 integrated directly into the authoring environment. The platform combines document editing, compilation, citation management, and AI-assisted revision in a single web-based workspace, aimed at researchers producing long-form scientific documents.
-
Anthropic Releases Updated Constitution for Claude
Anthropic has published an updated constitution for Claude, its AI assistant, providing a structured framework that guides behavior, reasoning, and training. The constitution combines explicit principles with contextual guidance, making it a practical tool for improving alignment, safety, and reliability in real-world interactions.
-
Google DeepMind Introduces ATLAS Scaling Laws for Multilingual Language Models
Google DeepMind researchers have introduced ATLAS, a set of scaling laws for multilingual language models that formalize how model size, training data volume, and language mixtures interact as the number of supported languages increases.
-
Google Introduces TranslateGemma Open Models for Multilingual Translation
Google has released TranslateGemma, a set of open translation models based on the Gemma 3 architecture, offering 4B, 12B, and 27B parameter variants designed to support machine translation across 55 languages and to run on platforms ranging from mobile and edge devices to consumer hardware and cloud accelerators.
-
Google Releases Gemma 3 270M Variant Optimized for Function Calling on Mobile and Edge Devices
FunctionGemma is a new, lightweight version of the Gemma 3 270M model, fine-tuned to translate natural language into structured function and API calls, enabling AI agents to "do more than just talk" and act.
-
DoorDash Applies AI to Safety across Chat and Calls, Cutting Incidents by 50%
DoorDash deploys SafeChat, an AI-driven safety system for moderating chat, images, and voice calls between Dashers and customers. Using a layered text moderation architecture, machine learning models, and human review, SafeChat detects unsafe content in real time, enabling immediate actions and reducing low- and medium-severity safety incidents by roughly 50 percent.
-
How CyberArk Protects AI Agents with Instruction Detectors and History-Aware Validation
To prevent agents from obeying malicious instructions hidden in external data, all text entering an agent's context must be treated as untrusted, says Niv Rabin, principal software architect at AI-security firm CyberArk. His team developed an approach based on instruction detection and history-aware validation to protect against both malicious input data and context-history poisoning.
-
MIT's Recursive Language Models Improve Performance on Long-Context Tasks
Researchers at MIT's CSAIL published a design for Recursive Language Models (RLM), a technique for improving LLM performance on long-context tasks. RLMs use a programming environment to recursively decompose and process inputs, and can handle prompts up to 100x longer than base LLMs.
-
Hugging Face Releases FineTranslations, a Trillion-Token Multilingual Parallel Text Dataset
Hugging Face has released FineTranslations, a large-scale multilingual dataset containing more than 1 trillion tokens of parallel text across English and 500+ languages. The dataset was created by translating non-English content from the FineWeb2 corpus into English using Gemma3 27B, with the full data generation pipeline designed to be reproducible and publicly documented.