BT

InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

View an example

We protect your privacy.

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Logo - Back to homepage

News Articles Presentations Podcasts Guides

Topics

Development

Featured in Development

How to Compute Without Looking: A Sneak Peek into Secure Multi-Party Computation

This article shows how you can compute a function across multiple parties that do not trust each other without forcing them to share their individual inputs. This technique can be used to split secrets among parties, perform logical operations, or count votes in a way that ensures data privacy is preserved.

All in development

Architecture & Design

Featured in Architecture & Design

OpenSearch Cluster Topologies for Cost Saving Autoscaling

Amitai Stern discusses cost-saving autoscaling topologies for OpenSearch. He explains the inherent challenges in autoscaling unstructured data systems like OpenSearch and Elasticsearch, using analogies to illustrate the complexities beyond simply adding nodes. He shares architectural patterns (burst indexes, burst clusters) to optimize resource utilization and handle fluctuating loads effectively.

All in architecture-design

AI Infrastructure

Featured in AI, ML & Data Engineering

Navigating LLM Deployment: Tips, Tricks, and Techniques

Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization. She details quantized models, batching, and workload optimizations to improve LLM serving. Insights cover model selection and infrastructure consolidation, emphasizing the differences between enterprise and large-scale AI lab deployments.

All in ai-ml-data-eng

Culture & Methods

Featured in Culture & Methods

Data, Drugs, and Disruption: Leading High-Performance Company in Drug Development

Olga Kubassova shares her journey from mathematician to CEO, detailing how engineering skills translate into business leadership. She discusses building a company, emphasizing team dynamics, strategic growth, and overcoming challenges. Learn how to leverage your technical background for entrepreneurship and navigate business complexities.

All in culture-methods

DevOps

Featured in DevOps

Checklist for Kubernetes in Production: Best Practices for SREs

This article provides SREs with a checklist for managing Kubernetes in production environments. It identifies common challenges including resource management, workload placement, high availability, health probes, storage, monitoring, and cost optimization. By implementing consistent GitOps automation across these areas, teams can significantly reduce complexity, and prevent downtime.

All in devops

Events

Helpful links

Choose your language

Discover emerging trends, insights, and real-world best practices in software development & tech leadership. Join now.

InfoQ Dev Summit Boston

Learn how senior software developers are solving the challenges you face. Register now with early bird tickets.

InfoQ Dev Summit Munich

Learn practical solutions to today's most pressing software challenges. Register now with early bird tickets.

QCon San Francisco

Explore insights, real-world best practices and solutions in software development & leadership. Register now.

InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

Articles

RSS Feed

Newer Older

AI, ML & Data Engineering

AI Trends Disrupting Software Teams

In this article, author Bilgin Ibryam discusses various AI trends disrupting the overall software development process and tools, and how these trends are influencing different IT teams like developers, operations, technical writers, and SaaS service providers.

Bilgin Ibryam
on Mar 21, 2025
AI, ML & Data Engineering

Beyond Notebook: Building Observable Machine Learning Systems

In this article, the author discusses a machine learning pipeline with observability built-in for credit card fraud detection use case, with tools like MLflow, FastAPI, Streamlit, Apache Kafka, Prometheus, Grafana, and Evidently AI.

Lakshmithejaswi Narasannagari
on Mar 14, 2025
AI, ML & Data Engineering

Secure AI-Powered Early Detection System for Medical Data Analysis & Diagnosis

In this article, the author discusses the techniques for securing AI applications in healthcare with an use case of early detection system for medical data analysis & diagnosis. The proposed layered architecture includes application components to support secure computation, ai modeling, governance and compliance, and monitoring and auditing.

Mahesh Vaijainthymala Krishnamoorthy
on Mar 03, 2025
AI, ML & Data Engineering

Prompt Engineering: Challenges, Strengths, and Its Place in Software Development's Future

Prompt engineering is evolving as a crucial skill that bridges AI communication and programming, blending creativity and precision to shape the future of software development. The future of software development might involve a synergistic blend of both approaches. Prompt engineering can accelerate prototyping and enhance interactivity, while traditional programming ensures robustness.

Hien Luu
on Feb 24, 2025
Architecture & Design

2025 Article Contest: Win Your Conference Ticket

The InfoQ Team is excited to invite you to participate in our annual article writing competition. Authors of top-rated articles will win complimentary tickets to prominent software development conferences such as QCon and InfoQ Dev Summit.

InfoQ
on Feb 17, 2025
AI, ML & Data Engineering

Eclipse LMOS: Launching AI Agents across Europe at Breakneck Speed

In this talk, the authors share some of our company’s key learnings in developing customer-facing LLM-powered applications deployed across Europe. They used multi-agent architecture and systems design to create an open-source set of tools, a framework, and a full-fledged platform to accelerate the development of AI agents. This is a summary of a presentation from InfoQ Dev Summit Boston 2024.

Arun Joseph Patrick Whelan
on Feb 17, 2025
AI, ML & Data Engineering

Building Trust in AI: Security and Risks in Highly Regulated Industries

Explore the transformative power of responsible AI across industries, emphasizing security, MLOps, and compliance. As AI drives innovation—from predicting hurricanes to enhancing legal workflows—organizations must prioritize ethical practices, transparency, and robust governance to safeguard sensitive data while navigating an evolving regulatory landscape.

Stefania Chaplin Azhir Mahmood
on Feb 10, 2025
AI, ML & Data Engineering

Launching GenAI Productivity Tools: Insights and Lessons

In this article, based on a talk at QCon San Francisco 2024, author Mandy Gu shares some of the ways her company uses GenAI to enhance productivity and the lessons they learned along the way, including failed bets and features that were rolled back because of low user adoption. Most important, they learned to focus on building tools that were aligned with business goals.

Mandy Gu
on Feb 06, 2025
AI, ML & Data Engineering

Prompt Injection for Large Language Models

This article will cover two common attack vectors against large language models and tools based on them, prompt injection and prompt stealing. We will additionally introduce three approaches to make your LLM-based systems and tools less vulnerable to this kind of attacks and review their benefits and limitations, including fine-tuning, adversarial detectors, and system prompt hardening.

Georg Dresler
on Feb 03, 2025
Architecture & Design

The End of the Bronze Age: Rethinking the Medallion Architecture

A shift left approach to data processing relies on data products that form the basis of data communication across the business. This addresses many flaws in traditional data processing and makes data more relevant, complete, and trustworthy.

Adam Bellemare
on Jan 29, 2025
AI, ML & Data Engineering

Elevate Developer Experience with Generative AI Capabilities on AWS

This is a summary of a talk I gave at InfoQ Dev Summit Munich 2024. I discussed the transformative potential of generative AI in enhancing developer experiences, particularly through AWS. I’ll introduce key tools like Amazon Bedrock, Code Review Assistant, Agentic Code Generation, and Code Summarization in this article.

Olalekan Elesin
on Jan 27, 2025
AI, ML & Data Engineering

A Framework for Building Micro Metrics for LLM System Evaluation

LLM accuracy is a challenging topic to address and is much more multi-dimensional than a simple accuracy score. Denys Linkov introduces a framework for creating micro metrics to evaluate LLM systems, focusing on goal-aligned metrics that improve performance and reliability. By adopting an iterative "crawl, walk, run" methodology, teams can incrementally develop observability.

Denys Linkov
on Jan 21, 2025

Newer Articles

Older Articles

BT