InfoQ Homepage Data Analysis Content on InfoQ
-
Using Data Effectively: beyond Art and Science
Hilary Parker talks about approaches and techniques to collect the most useful data, analyze it in a scientific way, and use it most effectively to drive actions and decisions.
-
How to Use Data Responsibly
Emma Prest and Clare Kitching discuss practical, pragmatic and ethical data science, talking about real world experience from the work of DataKind UK.
-
How Machines Help Humans Root Case Issues @ Netflix
Seth Katz discusses ways to build tools designed to enhance the cognitive ability of humans through automated analysis to speed root cause detection in distributed systems.
-
Gimel: PayPal’s Analytics Data Platform
Deepak Chandramouli introduces and demos Gimel, a unified analytics data platform which provides access to any storage through a single unified data API and SQL.
-
Forecasting Using Data
Troy Magennis discusses the top three reasons forecasts fail to match reality, and challenges the assumption that work complexity and effort correlates with delivery time.
-
A Cloud-centric Ecosystem Approach to Ease IoT Development
Yujing Wu discusses two use cases of a cloud-based IoT ecosystem that enables IoT device communication across silos and interoperability across different vendors.
-
Enabling High Performance Real-time Analytics for IoT Environments
Mahish Singh discusses how to use methodologies during design, development, deployment and operation for delivery of analytics platforms which offer real-time SLAs.
-
Scaling up Near Real-Time Analytics @Uber &LinkedIn
Chinmay Soman and Yi Pan discuss how Uber and LinkedIn use Apache Samza, Calcite and Pinot along with the analytics platform AthenaX to transform data to make it available for querying in minutes.
-
Stream Processing & Analytics with Flink @Uber
Danny Yuan discusses how Uber builds its next generation of stream processing system to support real-time analytics as well as complex event processing.
-
Data Cleansing and Understanding Best Practices
Casey Stella talks about discovering missing values, values with skewed distributions and likely errors within data, as well as a novel approach to finding data interconnectedness.
-
Elastic Data Analytics Platform @Datadog
Doug Daniels discusses the cloud-based platform they have built at DataDog and how it differs from a traditional datacenter-based analytics stack, pros and cons and the tooling built.
-
Streaming Live Data and the Hadoop Ecosystem
Oleg Zhurakousky discusses the Hadoop ecosystem – Hadoop, HDFS, Yarn-, and how projects such as Hive, Atlas, NiFi interact and integrate to support the variety of data used for analytics.