InfoQ Homepage Presentations Docker Data Science Pipeline
Docker Data Science Pipeline
Summary
Lennard Cornelis explains why they chose OpenShift and Docker to connect to the Hadoop environment, and also how to set up a Docker container running a data science model using Hive, Python, and Spark.
Bio
Lennard Cornelis is a senior big data engineer who has a great passion for technology. He is a hands-on person and who loves to solve difficult and challenging problems. Knowledge sharing is very important to him as he loves the role of mentoring colleagues.
About the conference
Big Data Conference Vilnius is a three-day conference with technical talks in the fields of Big Data, High Load, Data Science, Machine Learning and AI.Conference brings together developers, IT professionals and users to share their experience, discuss best practices, describe use cases and business applications related to their successes.