InfoQ Homepage Data Analysis Content on InfoQ
-
Real-time Stream Computing & Analytics @Uber
Sudhir Tonse discusses using stream processing at Uber: indexing and querying of geospatial data, aggregation and computing of streaming data, extracting patterns, TimeSeries analyses and predictions.
-
Rethinking Streaming Analytics for Scale
Helena Edelson addresses new architectures emerging for large scale streaming analytics based on Spark, Mesos, Akka, Cassandra and Kafka (SMACK) or Apache Flink or GearPump.
-
Insights from History of Rock Music via Machine Learning
Ali Kheyrollahi uses clustering and network analysis algorithms to analyze the publicly available Wiki data on rock music to find mathematical relationship between artists, trends and subgenres.
-
Supercharging Operations and Analytics: Using Spring XD to Support Analytics and CEP
Joseph Paulchell discusses the journey from batch-oriented processes using databases to a real-time data streaming solution and the significant benefits achieved as well as the challenges encountered.
-
IoT Realized - The Connected Car v2
Phil Berman and Michael T Minella present a solution developed with Spring XD to stream real-time analytics from a moving car using open standards.
-
Powering the Industrial Enterprise: Introducing the IOT Platform-as-a-Service
Jesus Rodriguez explores the characteristics of the IOT PaaS vs. predecessor PaaS architectures, focusing on device management, event driven integration, real-time analytics and offline communication.
-
Data Driven Product Development
Lyndon Maher, Paul McManus discuss data driven development, how to collect data, getting feedback, tools to use, and how to integrate a data-driven mentality into the team.
-
Reporting and the First Law of Holes
Sebastian von Conrad advises on reporting: capturing the right data at the right time, best practices and cleaning up reporting debts in code bases.
-
Pulsar: Real-time Analytics at Scale
Sharad Murthy & Tony Ng present Pulsar, a real-time streaming system which can scale to millions of events per second with high availability and 4GL language support.
-
Exploratory Data Analysis with R
Matthew Renze introduces the R programming language and demonstrates how R can be used for exploratory data analysis.
-
Interactive Analytics at Scale with Druid
Julien Lavigne du Cadet discusses how Criteo uses Druid: an open-source, real-time data store designed to power interactive applications at scale, covering Druid's architecture and internals.
-
Product thru the Looking Glass
Chris Matts discusses how to manage product mastery, how do we decide whether to use analysis or product management techniques, and what does an end-to-end process looks like.