InfoQ Homepage Apache Flink Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Uber Freight Near-Real-Time Analytics Architecture

Uber Freight is the Uber platform dedicated to connecting shippers with carriers. Providing reliable service to shippers is crucial for Uber Freight. This is why the Carrier Scorecard was developed, with several metrics including on-time pickup/delivery, tracking automation, and late cancellations.

Claudio Masolo
on Nov 08, 2022
Java

Apache InLong: Integration Framework for Massive Data

Apache InLong, an integration framework designed for massive data, was originally built at Tencent, where it was used in production for more than eight years, to support massive data reporting services in big data scenarios. The project officially graduated as an Apache top-level project three years after the introduction of the project in the Apache Incubator.

Andrea Messetti
on Oct 12, 2022
Architecture & Design

Netflix Builds a Custom High-Throughput Priority Queue Backed by Redis, Kafka and Elasticsearch

Netflix recently published how it built Timestone, a custom high-throughput, low-latency priority queueing system. They built it using open-source components such as Redis, Apache Kafka, Apache Flink and Elasticsearch. Engineers state that they made Timestone since they could not find an off-the-shelf solution that met all of its requirements.

Eran Stiller
on Oct 11, 2022
AI, ML & Data Engineering

Next Generation of Data Movement and Processing Platform at Netflix

Netflix engineering recently published in a tech blog how they used data mesh architecture and principles as the next generation of data platform and processing to unleash more business use cases and opportunities. Data mesh is the new paradigm shift in data management that enables users to easily import and use data without transporting it to a centralized location like a data lake.

Reza Rahimi
on Aug 29, 2022
Architecture & Design

Netflix Studio Search: Using Elasticsearch and Apache Flink to Index Federated GraphQL Data

Netflix engineers recently published how they built Studio Search, using Apache Kafka streams, an Apache Flink-based Data Mesh process, and Elasticsearch to manage the index. They designed the platform to take a portion of Netflix's federated GraphQL graph and make it searchable. Today, Studio Search powers a significant portion of the user experience for many applications within the organisation.

Eran Stiller
on Apr 19, 2022
Architecture & Design

Real-Time Exactly-Once Event Processing at Uber with Apache Flink, Kafka, and Pinot

Uber faced some challenges after introducing ads on UberEats. The events they generated had to be processed quickly, reliably and accurately. These requirements were fulfilled by a system based on Apache Flink, Kafka, and Pinot that can process streams of ad events in real-time with exactly-once semantics. An article describing its architecture was published recently in the Uber Engineering blog.

Vasco Veloso
on Nov 12, 2021
AI, ML & Data Engineering

ApacheCon 2019 Keynote: Google Cloud Enhances Big-Data Processing with Kubernetes

At ApacheCon North America, Christopher Crosbie gave a keynote talk title "Yet Another Resource Negotiator for Big Data? How Google Cloud is Enhancing Data Lake Processing with Kubernetes." He highlighted Google's efforts to make Apache big-data software "cloud native" by developing open-source Kubernetes Operators to provide control planes for running Apache software in a Kubernetes cluster.

Anthony Alford
on Sep 13, 2019
Cloud

Netflix Keystone Real-Time Stream Processing Platform

Netflix recently published a post in their tech blog discussing the design considerations and insights of Keystone, their Real-time stream processing platform. Keystone has been operational since December 2015 and has grown significantly over the years as Netflix subscribers have grown from 65 to over 130 million in the past 3 years. This article follows on the latest state of Keystone platform...

Alex Giamas
on Sep 30, 2018
Cloud

Data Artisans Announces Serializable ACID Transactions on Streaming Data

Data Artisans has announced the general availability of Streaming Ledger, which extends Apache Flink with capabilities to perform serializable ACID transactions across tables, keys, and event streams. The patent-pending technology is a proprietary add-on for Flink and allows going beyond the current standard where operations could only consistently work on a single key at a time.

Eldert Grootenboer
on Sep 12, 2018
AI, ML & Data Engineering

Julien Le Dem on the Future of Column-Oriented Data Processing with Apache Arrow

Julien Le Dem, the PMC chair of the Apache Arrow project, presented on Data Eng Conf NY on the future of column-oriented data processing. Apache Arrow is an open-source standard for columnar in-memory execution. InfoQ interviewed Le Dem to find out the differences between Arrow and Parquet.

Alexandre Rodrigues
on Dec 08, 2016
AI, ML & Data Engineering

Microservices and Stream Processing Architecture at Zalando Using Apache Flink

Javier Lopez and Mihail Vieru spoke at Reactive Summit 2016 Conference about cloud-based data integration and distribution platform used for stream processing in business intelligence use cases. Their solution is based on technologies such as Flink, Kafka and Elasticsearch.

Srini Penchikala
on Oct 31, 2016
AI, ML & Data Engineering

Stream Processing and Lambda Architecture Challenges

Lambda architecture has been a popular solution that combines batch and stream processing. Kartik Paramasivam at LinkedIn wrote about how his team addressed stream processing and Lambda architecture challenges using Apache Samza for data processing. The challenges described are the late arrival of events and the processing of duplicated messages.

Alexandre Rodrigues
on Oct 19, 2016
AI, ML & Data Engineering

Data Streaming Architecture with Apache Flink

Jamie Grier recently spoke at OSCON 2016 Conference about data streaming architecture using Apache Flink. He talked about the building blocks of data streaming applications and stateful stream processing with code examples of Flink applications and monitoring.

Srini Penchikala
on Jun 09, 2016
AI, ML & Data Engineering

Apache Flink 1.0.0 is Released

InfoQ's Rags Srinivas caught up with Stephan Ewen, a project committer for Apache Flink about the 1.0.0 Release and the roadmap

Rags Srinivas
on Mar 24, 2016
AI, ML & Data Engineering

Yahoo! Benchmarks Apache Flink, Spark and Storm

Yahoo! has benchmarked three of the main stream processing frameworks: Apache Flink, Spark and Storm.

Abel Avram
on Dec 23, 2015

Newer News

Older News

InfoQ Software Architects' Newsletter

News