InfoQ Homepage Presentations Distributed Data Analysis with Hadoop and R
Distributed Data Analysis with Hadoop and R
Summary
Jonathan Seidman and Ramesh Venkataramaiah present how they run R on Hadoop in order to perform distributed analysis on large data sets, including some alternatives to their solution.
Bio
Jonathan Seidman is Lead Engineer on the Business Intelligence/Big Data team at Orbitz Worldwide and co-founder and organizer of the Chicago Hadoop User Group and founder of the Chicago Big Data User Group. Ramesh Venkataramaiah is a member of the Operations and Engineering Team at Orbitz Worldwide with a focus on analysis of distributed, high availability systems in the travel data domain.
About the conference
Strange Loop is a multi-disciplinary conference that aims to bring together the developers and thinkers building tomorrow's technology in fields such as emerging languages, alternative databases, concurrency, distributed systems, mobile development, and the web.