BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Topics

Choose your language

InfoQ Homepage Presentations Runaway Complexity in Big Data, and a Plan to Stop It

Runaway Complexity in Big Data, and a Plan to Stop It

Bookmarks
48:54

Summary

Nathan Marz outlines several sources of complexity introduced in data systems - Lack of human fault-tolerance, Conflation of data and queries, Schemas done wrong - and what can be done to avoid them.

Bio

Nathan Marz is an engineer at Twitter. Previously Nathan was the lead engineer of BackType which was acquired by Twitter in July of 2011. Nathan has been involved in the Big Data space for more than four years. He is the author of Cascalog, a high level abstraction for MapReduce, and Storm, a distributed and fault-tolerant real time computation system. He blogs at http://nathanmarz.com.

About the conference

Strange Loop is a multi-disciplinary conference that aims to bring together the developers and thinkers building tomorrow's technology in fields such as emerging languages, alternative databases, concurrency, distributed systems, mobile development, and the web. Strange Loop was created in 2009 by software developer Alex Miller and is now run by a team of St. Louis-based friends and developers under Strange Loop LLC, a for-profit but not particularly profitable venture.

Recorded at:

Oct 25, 2012

BT