InfoQ Homepage Data Warehousing Content on InfoQ
Presentations
RSS Feed-
Back to Basics: Scalable, Portable ML in Pure SQL
Evan Miller walks through the architecture of Eppo's portable, performant, privacy-preserving, multi-warehouse regression engine, and discusses the challenges with implementation.
-
Customer Insight, from Data to Information
Thore Thomassen shares from experience how to combine structured data in a DWH with unstructured data in NoSQL, and using parallel data warehouse appliances to boost the analytical capabilities.
-
Agile Data Warehouse - The Final Frontier
Terry Bunio discusses applying Agile principles to building a data warehouse based on a data model and making use of visualization tools.
-
Facebook’s Petabyte Scale Data Warehouse using Hive and Hadoop
Ashish Thusoo and Namit Jain explain how Facebook manages to deal with analysis of 12 TB of compressed new data everyday with Hive’s help, an open source data warehousing framework built on Hadoop.
-
Transforming the Reconciliation Process
Brian Oliver explains what the Reconciliation Process is, why the current approach to reconciliation based on client-server is no longer suitable and how data grids and events might help.