InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Efficient Incremental Processing with Netflix Maestro and Apache Iceberg
Jun He discusses how to use an IPS to build more reliable, efficient, and scalable data pipelines, unlocking new data processing patterns.
-
Taking LLMs out of the Black Box: A Practical Guide to Human-in-the-Loop Distillation
Ines Montani discusses practical solutions for using the latest LLMs in real-world applications and explores how to distill knowledge into smaller and faster components.
-
Scale out Batch Inference with Ray
Cody Yu discusses how to build a scalable and efficient batch inference stack using Ray.
-
Data Mesh Architecture Applied to Complex Organizations
Nandakumar Heble looks at the basic construct of a data mesh and how one might go about applying it.
-
Why Most Machine Learning Projects Fail to Reach Production and How to Beat the Odds
Wenjie Zi discusses common pitfalls that cause these failures, such as the inherent uncertainty of machine learning, misaligned optimization objectives, and skill gaps among practitioners.
-
High Performance Time - Series Database Design with QuestDB
Vlad Ilyushchenko discusses geographical data distribution, simplifying data pipelines with HA writes, data visualization with SQL extensions, and providing data scientists with scalable data access.
-
Dataflow-Based Query Caching with Readyset
Alana Marzoev discusses the fundamentals of streaming dataflow and the architecture of ReadySet, a streaming dataflow system designed for operational workloads.
-
Trends in InfoSec: Data Minimization, Autoclassification, and Ethical AI
Rachael Greaves provides a summary of the requirements for data lifecycle management, the technology approaches, and the risks, and includes a Data Minimization Best Practice Checklist.
-
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik discusses some of the best practices in model optimization, serving and monitoring - with practical tips and real case-studies.
-
Improving Developer Experience Using Automated Data CI/CD Pipelines
Noémi Ványi and Simona Pencea discuss a code and data branching strategy that basically allows your data to follow your code.
-
Launching AI Agents across Europe at Breakneck Speed with an Agent Computing Platform
Arun Joseph, Patrick Whelan discuss patterns for productionizing LLM powered applications, AI agents development lifecycle, toolkits, components and building blocks of an AI Agents computing platform.
-
The Latest in the World of Web Engineering (Featuring AI)
Tejas Kumar overviews web engineering in relation to AI, AI engineering, Intelligent Answering Engines, an update on CSS, HTML, JavaScript, and personal health and productivity.