BT

InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

View an example

We protect your privacy.

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Logo - Back to homepage

News Articles Presentations Podcasts Guides

Topics

Development

Featured in Development

Rebuilding Prime Video UI with Rust and WebAssembly

Alexandru Ene features details of a new UI SDK in Rust for Prime Video that targets living room devices.

All in development

Architecture & Design

Featured in Architecture & Design

How GitHub Copilot Serves 400 Million Completion Requests a Day

David Cheney discusses the intricate architecture of GitHub Copilot's code completion service, explaining the challenges of achieving low-latency responses for millions of daily requests. He delves into HTTP/2 optimizations, global scaling strategies, and the critical role of their internal proxy.

All in architecture-design

AI Infrastructure

Featured in AI, ML & Data Engineering

AI Trends Disrupting Software Teams

In this article, author Bilgin Ibryam discusses various AI trends disrupting the overall software development process and tools, and how these trends are influencing different IT teams like developers, operations, technical writers, and SaaS service providers.

All in ai-ml-data-eng

Culture & Methods

Featured in Culture & Methods

A Platform Engineering Journey: Copy and Paste Deployments to Full GitOps

Jemma Hussein Allen explains practical approaches to CI/CD, GitOps, and team collaboration, aimed at enhancing the software development lifecycle. She highlights the benefits of automation, the importance of clear responsibilities, and the positive impact of psychological safety on team performance and project outcomes.

All in culture-methods

DevOps

Featured in DevOps

Checklist for Kubernetes in Production: Best Practices for SREs

This article provides SREs with a checklist for managing Kubernetes in production environments. It identifies common challenges including resource management, workload placement, high availability, health probes, storage, monitoring, and cost optimization. By implementing consistent GitOps automation across these areas, teams can significantly reduce complexity, and prevent downtime.

All in devops

Events

Helpful links

Choose your language

Discover emerging trends, insights, and real-world best practices in software development & tech leadership. Join now.

InfoQ Dev Summit Boston

Learn how senior software developers are solving the challenges you face. Register now with early bird tickets.

InfoQ Dev Summit Munich

Learn practical solutions to today's most pressing software challenges. Register now with early bird tickets.

QCon San Francisco

Explore insights, real-world best practices and solutions in software development & leadership. Register now.

InfoQ Homepage Machine Learning Content on InfoQ

Articles

RSS Feed

Newer Older

AI, ML & Data Engineering

Beyond Notebook: Building Observable Machine Learning Systems

In this article, the author discusses a machine learning pipeline with observability built-in for credit card fraud detection use case, with tools like MLflow, FastAPI, Streamlit, Apache Kafka, Prometheus, Grafana, and Evidently AI.

Lakshmithejaswi Narasannagari
on Mar 14, 2025
AI, ML & Data Engineering

Secure AI-Powered Early Detection System for Medical Data Analysis & Diagnosis

In this article, the author discusses the techniques for securing AI applications in healthcare with an use case of early detection system for medical data analysis & diagnosis. The proposed layered architecture includes application components to support secure computation, ai modeling, governance and compliance, and monitoring and auditing.

Mahesh Vaijainthymala Krishnamoorthy
on Mar 03, 2025
AI, ML & Data Engineering

Building Trust in AI: Security and Risks in Highly Regulated Industries

Explore the transformative power of responsible AI across industries, emphasizing security, MLOps, and compliance. As AI drives innovation—from predicting hurricanes to enhancing legal workflows—organizations must prioritize ethical practices, transparency, and robust governance to safeguard sensitive data while navigating an evolving regulatory landscape.

Stefania Chaplin Azhir Mahmood
on Feb 10, 2025
AI, ML & Data Engineering

A Framework for Building Micro Metrics for LLM System Evaluation

LLM accuracy is a challenging topic to address and is much more multi-dimensional than a simple accuracy score. Denys Linkov introduces a framework for creating micro metrics to evaluate LLM systems, focusing on goal-aligned metrics that improve performance and reliability. By adopting an iterative "crawl, walk, run" methodology, teams can incrementally develop observability.

Denys Linkov
on Jan 21, 2025
Culture & Methods

Reaching Your Automatic Testing Goals by Enhancing Your Test Architecture

If you have automatic end-to-end tests, you have test architecture, even if you’ve never given it a thought. Test architecture encompasses everything from code to more theoretical concerns like enterprise architecture, but with concrete, immediate consequences. Let's explore how you can achieve the goals you have for your automatic testing effort.

James Westfall
on Dec 04, 2024
AI, ML & Data Engineering

Efficient Resource Management with Small Language Models (SLMs) in Edge Computing

Small Language Models (SLMs) bring AI inference to the edge without overwhelming the resource-constrained devices. In this article, author Suruchi Shah dives into how SLMs can be used in edge computing applications for learning and adapting to patterns in real-time, reducing the computational burden and making edge devices smarter.

Suruchi Shah
on Nov 11, 2024
AI, ML & Data Engineering

Article Series: Practical Applications of Generative AI

Generative AI (GenAI) has become a major component of the artificial intelligence (AI) and machine learning (ML) industry. However, using GenAI comes with challenges and risks. In the InfoQ "Practical Applications of Generative AI" article series, we present real-world solutions and hands-on practices from leading GenAI practitioners.

Anthony Alford
on Sep 17, 2024
AI, ML & Data Engineering

Llama 3 in Action: Deployment Strategies and Advanced Functionality for Real-World Applications

This article details the enhanced capabilities of the open-source Llama 3 LLM, and how businesses can adopt the model in their applications. The author gives step-by-step instructions for deploying Llama 3 in the cloud or on-premise, and how to leverage fine-tuned versions for specific tasks.

Tingyi Li
on Sep 17, 2024
AI, ML & Data Engineering

InfoQ AI, ML and Data Engineering Trends Report - September 2024

InfoQ editorial staff and friends of InfoQ are discussing the current trends in the domain of AI, ML and Data Engineering as part of the process of creating our annual trends report.

Srini Penchikala Mandy Gu Namee Oberst Roland Meertens Anthony Alford Daniel Dominguez
on Sep 06, 2024
AI, ML & Data Engineering

Adding a Natural Language Interface to Your Application

In this article, author Ashley Davis discusses how to add a natural language interface to a chatbot application using OpenAI REST API. He also shows how to extend the chatbot by adding voice commands using MediaRecorder API and OpenAI's speech transcription API.

Ashley Davis
on Apr 02, 2024
AI, ML & Data Engineering

Unpacking How Ad Ranking Works at Pinterest

Aayush Mudgal describes how Pinterest serves advertisements. He discussed in detail how Machine Learning is used to serve ads at large scale. He went over ads marketplaces and the ad delivery funnel, the ad serving architecture, and two of the main problems: ad retrieval and ranking. Finally, he discussed some of the challenges and solutions for training and serving large models.

Anthony Alford
on Mar 26, 2024
Culture & Methods

Testing Machine Learning: Insight and Experience from Using Simulators to Test Trained Functionality

When testing machine learning systems, we must apply existing test processes and methods differently. Machine Learning applications consist of a few lines of code, with complex networks of weighted data points that form the implementation. The data used in training is where the functionality is ultimately defined, and that is where you will find your issues and bugs.

Martin Karsberg
on Mar 07, 2024

Newer Articles

Older Articles

BT