InfoQ Homepage Apache Kafka Content on InfoQ

Articles

RSS Feed

Newer Older

Architecture & Design

Managing 238M Memberships at Netflix

In this article Surabhi Diwan shared how the Netflix membership team does distributed systems: the architecture bets, technology choices, and operational semantics that serve the needs of Netflix’s ever-growing member base.

Surabhi Diwan
on Mar 25, 2024
.NET

Building Kafka Event-Driven Applications with KafkaFlow

KafkaFlow, a .NET open-source project, simplifies Kafka-based event-driven app development with features like middleware for message processing, enhancing maintainability, customization potential, and allowing developers to prioritize business logic.

Guilherme Ferreira
on Sep 08, 2023
Architecture & Design

Tales of Kafka at Cloudflare: Lessons Learnt on the Way to 1 Trillion Messages

Cloudflare uses Kafka clusters to decouple microservices and communicate the creation, change or deletion of various resources via protobuf, a common data format in a fault-tolerant manner. The authors suggest investing in metrics for problem detection, prioritizing clear SDK documentation, and balancing flexibility and simplicity for standardized pipelines.

Matt Boyle Andrea Medda
on May 29, 2023
Java

Article Series: Developing Apache Kafka applications on Kubernetes

Apache Kafka has integrations with most of the languages used these days, but in this article series, we cover its integration with Java. In this series, we also discuss how to provision, configure and secure an Apache Kafka cluster on a Kubernetes cluster.

Alex Soto
on Feb 06, 2023
Java

Securing a Kafka Cluster in Kubernetes Using Strimzi

Deploying an Apache Kafka cluster to Kubernetes is easy if you use Strimzi, but that’s only the first step; you need to secure the communication between Kafka and the consumers and producers, provide RBAC to access topics, spread the secrets correctly to Kafka Connect components and all using a Kubernetes GitOps way.

Alex Soto
on Dec 30, 2022
Architecture & Design

Building & Operating High-Fidelity Data Streams

At QCon Plus 2021 last November, Sid Anand, chief architect at Datazoom and PMC Member at Apache Airflow, presented on building high-fidelity nearline data streams as a service within a lean team. In this talk, Anand provides a master class on building high-fidelity data streams from the ground up.

Sid Anand
on Sep 30, 2022
Java

Moving Kafka and Debezium to Kubernetes Using Strimzi - the GitOps Way

Deploying an Apache Kafka cluster to a Kubernetes is not an easy task. There are a lot of pieces to configure like the zookeeper, the Kafka cluster, topics, and users. Strimzi is a Kubernetes controller making the deployment process of Kafka a child game. Moreover, Strimzi lets you manage Kafka using GitOps methodology as everything is executed using a Kubernetes YAML file.

Alex Soto
on Sep 28, 2022
Java

Debezium and Quarkus: Change Data Capture Patterns to Avoid Dual-Writes Problems

It’s common in microservices to write data in two places, a database and then send the content to another microservice. One approach to tackle this problem is dual writes, but you may lose data because of concurrent writes. Debezium is an open-source project for change data capture using the log scanner approach to avoid dual writes and communicate persisted data correctly between services.

Alex Soto
on Aug 15, 2022
Java

Kafka Streams and Quarkus: Real-Time Processing Events

Consuming Kafka messages is simple; you get them as long as they are produced, but nothing more. But if you need real-time processing of the data (filtering, joining, or manipulating events), just using the Kafka-consuming API might not be the best approach as the resulting code becomes complex. Kafka Streams and Quarkus are the perfect matches to start processing Kafka events in real-time.

Alex Soto
on Jul 05, 2022
Java

Getting Started to Quarkus Reactive Messaging with Apache Kafka

How data is processed/consumed nowadays is different from how it was once practiced. In the past, data was stored in a database and it was batch processed for analytics. Apache Kafka is a distributed event store and stream-processing platform for storing, consuming, and processing data streams in real-time. In this post, we’ll learn how to produce and consume data using Apache Kafka and Quarkus.

Alex Soto
on Apr 29, 2022
Architecture & Design

Turning Microservices Inside-Out

Turning microservices inside-out means moving past a single, request/response API to designing microservices with an inbound API for queries and commands, an outbound APIs to emit events, and a meta API to describe them both. A database can be supplemented with Apache Kafka via a connecting tissue such as Debezium.

Bilgin Ibryam
on Aug 03, 2021
AI, ML & Data Engineering

Building an SQL Database Audit System Using Kafka, MongoDB and Maxwell's Daemon

In this article, the author discusses the importance of a database audit logging system outside of traditional built-in data replication, using technologies like Kafka, MongoDB, and Maxwell's Daemon.

Vishal Sinha
on Jan 01, 2021

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Articles