InfoQ Homepage Data Analysis Content on InfoQ
-
LLMs in the Real World: Structuring Text with Declarative NLP
Adam Azzam discusses why building machine learning pipelines to extract structured data from unstructured text is a popular problem within an unpopular development lifecycle.
-
Performance and Scale - Domain-Oriented Objects vs Tabular Data Structures
Donald Raab and Rustam Mehmandarov discuss three library solutions for managing data based on an example of high-performance CSV processing.
-
Speed of Apache Pinot at the Cost of Cloud Object Storage with Tiered Storage
Neha Pawar discusses how to query data on the cloud directly with sub-seconds latencies, diving into data fetch and optimization strategies, challenges faced and learnings.
-
Evolving Analytics in the Data Platform
Blanca Garcia-Gil discusses the BBC’s analytics platform architecture, the failure modes they designed for, and the investigation of the new unknowns and how they automated them away.
-
Qualitative Analysis for Digital Transformation
John Willis discusses how Computer Assisted Qualitative Data Analysis and a QDA approach can be used to analyze group, leadership interviews to better understand Digital Transformation outcomes.
-
Real-Time Stream Analysis in Functional Reactive Programming
Riccardo Terrell discusses about a reactive approach to application design, and how to account for handling events in near real time employing the Functional Reactive Programming paradigm.
-
Putting the Spark in Functional Fashion Tech Analytics
Gareth Rogers shows how his team used Clojure to provide a solid platform to connect and manage an AWS hosted analytics pipeline and the pitfalls they encountered on the way.
-
Streaming Log Analytics with Kafka
Kresten Thorup discusses how and why they use Kafka internally and demos how they utilize it as a straightforward event-sourcing model for distributed deployments.
-
Using Data Effectively: beyond Art and Science
Hilary Parker talks about approaches and techniques to collect the most useful data, analyze it in a scientific way, and use it most effectively to drive actions and decisions.
-
How to Use Data Responsibly
Emma Prest and Clare Kitching discuss practical, pragmatic and ethical data science, talking about real world experience from the work of DataKind UK.
-
How Machines Help Humans Root Case Issues @ Netflix
Seth Katz discusses ways to build tools designed to enhance the cognitive ability of humans through automated analysis to speed root cause detection in distributed systems.
-
Gimel: PayPal’s Analytics Data Platform
Deepak Chandramouli introduces and demos Gimel, a unified analytics data platform which provides access to any storage through a single unified data API and SQL.