InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

Articles

RSS Feed

Newer Older

AI, ML & Data Engineering

InfoQ AI, ML and Data Engineering Trends Report - 2025

This InfoQ Trends Report offers readers a comprehensive overview of emerging trends and technologies in the areas of AI, ML, and Data Engineering. This report summarizes the InfoQ editorial team’s and external guests' view on the current trends in AI and ML technologies and what to look out for in the next 12 months.

Srini Penchikala Savannah Kunovsky Anthony Alford Daniel Dominguez Vinod Goje
on Sep 24, 2025
AI, ML & Data Engineering

Effective Practices for Architecting a RAG Pipeline

Hybrid search, smart chunking, and domain-aware indexing are key to building effective RAG pipelines. Context window limits and prompt quality critically affect LLM response accuracy. This article provides lessons learned from setting up a RAG pipeline.

Glenn Engstrand
on Sep 03, 2025
DevOps

How Causal Reasoning Addresses the Limitations of LLMs in Observability

Large language models excel at converting observability telemetry into clear summaries but struggle with accurate root cause analysis in distributed systems. LLMs often hallucinate explanations and confuse symptoms with causes. This article suggests how causal reasoning models with Bayesian inference offer more reliable incident diagnosis.

Dhairya Dalal
on Sep 02, 2025
AI, ML & Data Engineering

MCP: the Universal Connector for Building Smarter, Modular AI Agents

In this article, the authors discuss Model Context Protocol (MCP), an open standard designed to connect AI agents with tools and data they need. They also talk about how MCP empowers agent development, and its adoption in leading open-source frameworks.

Sanjay Surendranath Girija Lakshit Arora Shashank Kapoor
on Aug 29, 2025
AI, ML & Data Engineering

The Missing Layer in AI Infrastructure: Aggregating Agentic Traffic

In this article, author Eyal Solomon discusses AI Gateways, the outbound proxy servers that intercept and manage AI-agent-initiated traffic in real time to enforce policies and provide central management.

Eyal Solomon
on Aug 22, 2025
Java

Infusing AI into Your Java applications

Equip yourself with the basic AI knowledge and skills you need to start building intelligent and responsive Enterprise Java applications. With the help of our simple chatbot application for booking interplanetary space trips, see how Java frameworks like LangChain4j with Quarkus make it easy and efficient to interact with LLMs and create satisfying applications for end-users.

Don Bourne Michal Broz Laura Cowen Daniel Oh Kevin Dubois
on Aug 15, 2025
AI, ML & Data Engineering

Building Reproducible ML Systems with Apache Iceberg and SparkSQL: Open Source Foundations

Traditional data lakes are great for storing massive amounts of stuff, but they're terrible at the transactional guarantees and versioning that ML workloads desperately need. Apache Iceberg and SparkSQL bring database-like reliability to your data lake. Time travel, schema evolution, and ACID transactions help support reproducible machine learning experiments.

Anant Kumar
on Jul 31, 2025
Development

A First-Timer’s Guide to Curating a Technical Conference Track

One first-time track host shares the process, constraints, and takeaways from building a track from scratch at QCon London 2025.

Erica Pisani
on Jul 30, 2025
Architecture & Design

Optimizing Search Systems: Balancing Speed, Relevance, and Scalability

Innovative software engineer focused on optimizing search performance in dynamic environments. This article highlights key strategies from our QCon San Francisco 2024 presentation, addressing challenges faced by platforms like Uber Eats in data indexing and retrieval. Our advancements ensure swift, relevant user experiences amidst ever-growing datasets.

Janani Narayanan Karthik Ramasamy
on Jul 16, 2025
Architecture & Design

Agentic AI Architecture Framework for Enterprises

To deploy agentic AI responsibly and effectively in the enterprise, organizations must progress through a three-tier architecture, Foundation tier, Workflow tier, and Autonomous tier where trust, governance, and transparency precede autonomy.

Subash Natarajan Ahilan Ponnusamy
on Jul 11, 2025
Development

Effective Practices for Coding with a Chat-Based AI

In this article, we explore how AI agents are reshaping software development and the impact they have on a developer’s workflow. We introduce a practical approach to staying in control while working with these tools by adopting key best practices from the discipline of software architecture, including defining an implementation plan, splitting tasks, and so on.

Enrico Piccinin
on Jul 04, 2025
DevOps

Why Is My Docker Image So Big? A Deep Dive with ‘dive’ to Find the Bloat

AI images typically bloat from massive library installations and base OS components, with large Docker images slowing AI development and increasing costs. Chirag Agrawal demonstrates how to diagnose bloat using Docker's history and the interactive 'dive' tool to examine each layer in detail. The article shows how effective diagnosis leads to targeted optimizations.

Chirag Agrawal
on Jun 30, 2025

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles