InfoQ Homepage Presentations A Taste of Random Decision Forests on Apache Spark
A Taste of Random Decision Forests on Apache Spark
Summary
Sean Owen introduces Spark, Scala and random decision forests, and demonstrates the process of analyzing a real-world data set with them.
Bio
Sean Owen is Director of Data Science at Cloudera in London. Before Cloudera, he founded Myrrix Ltd (now known as the Oryx project) to commercialize large-scale real-time recommenders on Apache Hadoop. He is an Apache Spark contributor, and was a committer and VP for Apache Mahout.
About the conference
Software is Changing the World. QCon empowers software development by facilitating the spread of knowledge and innovation in the developer community. A practitioner-driven conference, QCon is designed for technical team leads, architects, engineering directors, and project managers who influence innovation in their teams.