InfoQ Homepage Event Stream Processing Content on InfoQ
-
Running Apache Flink Applications on AWS KDA: Lessons Learnt at Deliveroo
Deliveroo introduced Apache Flink into its technology stack for enriching and merging events consumed from Apache Kafka or Kinesis Streams. The company opted to use AWS Kinesis Data Analytics (KDA) service to manage Apache Flink clusters on AWS and shared its experiences from running Flink applications on KDA.
-
Grab Reduces Traffic Cost for Kafka Consumers on AWS to Zero
Grab took advantage of the ability of Apache Kafka consumers to connect to the broker node in the same availability zone (AZ) introduced in Kafka 2.3 and reduced the traffic cost on AWS to zero for reconfigured consumers. The change has substantially reduced overall infrastructure costs for running Apache Kafka on AWS.
-
Grammarly Replaces its in-House Data Lake with Databricks Platform Using Medallion Architecture
Grammarly adopted the medallion architecture while migrating from their in-house data lake, storing Parquet files in AWS S3, to the Delta Lake lakehouse. The company created a new event store for over 6000 event types from 40 internal and external clients and, in the process, improved data quality and reduced the data-delivery time by 94%.
-
Public Preview of JSON Schema Support in Azure Event Hubs Schema Registry for Kafka Applications
Microsoft recently announced that the Azure Event Hubs schema registry now includes JSON schema support, providing Kafka applications with a centralized repository for schema documents used in messaging-centric and event-driven applications. The JSON schema support is currently in public preview.
-
Amazon EventBridge Pipes Support Point-to-Point Integrations between Event Producers and Consumers
At re:Invent, AWS introduced Amazon EventBridge Pipes, a new feature in Amazon EventBridge providing developers a more straightforward way to connect events from multiple services.
-
AWS Introduces Amazon EventBridge Scheduler
AWS recently introduced Amazon EventBridge Scheduler, a new capability from Amazon EventBridge that allows organizations to create, run, and manage scheduled tasks at scale.
-
Microsoft Releases Stream Analytics No-Code Editor into General Availability
During the Ignite Conference, Microsoft released Azure Stream Analytics no-code editor, a drag-and-drop canvas for developing jobs for stream processing scenarios such as streaming ETL, ingestion, and materializing data to data into general availability. The no-code editor is hosted in the company’s big-data streaming platform and event ingestion service, Azure Event Hubs.
-
Confluent Ships Stream Designer Democratizing Data Streams
Confluent recently released Stream Designer, a visual interface that lets developers quickly build and deploy streaming data pipelines.
-
Fitting Presto to Large-Scale Apache Kafka at Uber
The need for ad-hoc real-time data analysis has been growing at Uber. They run a large Apache Kafka deployment and need to analyse data going through the many workflows it supports. Solutions like stream processing and OLAP datastores were deemed unsuitable. An article was published recently detailing why Uber chose Presto for this purpose and what it had to do to make it performant at scale.
-
Amazon Rekognition Introduces Streaming Video Events
AWS recently announced the general availability of Streaming Video Events, a new feature of Amazon Rekognition to provide real-time alerts on live video streams.
-
Quine Aims to Simplify Event Processing on Data in Motion
Developed at thatDot, Quine is an open source streaming graph solution aimed at high-volume event processing. Quine combines graph data and streaming technologies to enable the creation of real-time, complex event processing workflows at scale, says thatDot.
-
AWS Launches Amazon Kinesis Data Streams On-Demand
Amazon Kinesis Data Streams is a fully-managed, serverless service on AWS for real-time processing of streamed data at a massive scale. Recently, the company released a new capacity mode On-demand for the service, which eliminates capacity provisioning and management for streaming workloads.
-
Real-Time Exactly-Once Event Processing at Uber with Apache Flink, Kafka, and Pinot
Uber faced some challenges after introducing ads on UberEats. The events they generated had to be processed quickly, reliably and accurately. These requirements were fulfilled by a system based on Apache Flink, Kafka, and Pinot that can process streams of ad events in real-time with exactly-once semantics. An article describing its architecture was published recently in the Uber Engineering blog.
-
Microsoft Announces Event Hubs Premium in Preview
Azure Event Hubs is Microsoft’s managed real-time event ingestion service designed to serve demanding big data streaming and event ingestion needs in the Cloud. Microsoft announced the public preview of Event Hubs Premium during the annual Build conference as a new product SKU tailor-made for high-end event streaming scenarios requiring elastic, superior performance with predictable latency.
-
Hazelcast Jet 4.4 Released - the Four-Year Anniversary Release as Seen by Scott McMahon
Hazelcast Jet recently celebrated its four-year anniversary with the release of version 4.4. Besides the normal bug fixes and performance enhancements, this new version ships with new features such as the unified file connector and the first beta version of the SQL interface. InfoQ spoke to Scott McMahon, technical director of field engineering at Hazelcast, about this new release.