InfoQ Homepage Global Data Science Conference 2016 Content on InfoQ
Presentations
RSS Feed-
Structuring Data for Self-Serve Customer Insights
Jim Porzak discusses creating an analyst ready data mart that is complete at different levels of abstraction and models customer decision points in order to be able to understand customers.
-
Monitoring and Troubleshooting Real-Time Data Pipelines
Alan Ngai and Premal Shah discuss best practices on monitoring distributed real-time data processing frameworks and how DevOps can gain control and visibility over these data pipelines.
-
Solving Business Problems with Data Science
The panelists discuss some of the unique problems that only data science can solve, the pitfalls and the success rate of data science projects.
-
Hybrid Artificial Intelligence
Manuel Ebert explores how hybrid AI works, its impact on businesses, using it in existing businesses, and what we can expect from hybrid artificial intelligence in the years to come.
-
Building a Predictive Intelligence Engine
Viral Bajaria explains a formula for reaching the B2B buyer early in the sales cycle by tying together billions of rows of customer data and overlaying predictive intelligence technology.
-
The Future of Data Science
The panelists discuss some of the trends in data science today, the job of a data scientist, the tools and other related issues.
-
Detecting Anomalies in Streaming Data, Evaluating Algorithms for Real-World Use
Alexander Lavin introduces the Numenta Anomaly Benchmark (NAB), a framework for evaluating anomaly detection algorithms on streaming data.
-
Predicting the Future: Surprising Revelations trom Truly Big Data
Pushpraj Shukla discusses how Microsoft Bing predicts the future based on aggregate human behavior using one of the largest scale data sets, and recent progress in large scale deep learnt models.
-
Netflix Keystone - How We Built a 700B/day Stream Processing Cloud Platform in a Year
Peter Bakas presents in detail how Netflix has used Kafka, Samza, Docker, and Linux to implement a multi-tenant pipeline processing 700B events/day in the Amazon AWS cloud.
-
Hunting Criminals with Hybrid Analytics
David Talby demos using Python libraries to build a ML model for fraud detection, scaling it up to billions of events using Spark, and what it took to make the system perform and ready for production.
-
Big-Data Analytics Misconceptions
Irad Ben-Gal discusses Big Data analytics misconceptions, presenting a technology predicting consumer behavior patterns that can be translated into wins, revenue gains, and localized assortments.
-
How Comcast Uses Data Science and ML to Improve the Customer Experience
Jan Neumann presents how Comcast uses machine learning and big data processing to facilitate search for users, for capacity planning, and predictive caching.