Accueil InfoQ Présentations Apache Spark : a practical feedback after implementing a data analysis workflow

Apache Spark : a practical feedback after implementing a data analysis workflow

Voir la Présentation

Speed:

Télécharger

41:43

Résumé

Within a few months, we have rewritten the complete workflow for a data analysis engine: eXenGine. We'll give our feedback about using Apache Spark for implementing a proprietary matrix factorization method and analyzing Wikipedia for textual content, links and meta-data. Focus will be on the nice things we have found about Spark.

Bio

Founder, Chief Scientist @ eXenSa : #recsys and #textmining for #BigData. #MachineLearning, Startups. http://blog.guillaume-pitel.fr Paris · wikinsights.org

A propos de cette Conférence

Let's get together and chat about machine-learning, natural language processing, large scale data analytics using open source tools such as Hadoop MapReduce, Shark, NoSQL databases, the semantic web and linked data.

Enregistré à :

09 mai 2014

par

Guillaume Pitel

Contenu éditorial lié

Débloquez l'expérience InfoQ complète

Vous n'avez pas encore de compte InfoQ ?

Sujets

Comment Utiliser Le Chiffrement Pour La Défense En Profondeur Dans Les Apps Natives Et Navigateurs

Manipulation De Données Avec Programmation Fonctionnelle Et Requêtes Dans Ballerina

Les Prédictions De Temps Chez Uber Eats

Les Processus De Tests Individuels Ne Peuvent Convenir A Tout Le Monde.

Pourquoi La Gouvernance DevOps Est Cruciale Pour Permettre La Vélocité Des Développeurs

Liens utiles

Sélectionner votre région

Apache Spark : a practical feedback after implementing a data analysis workflow

Résumé

Bio

A propos de cette Conférence

Ce contenu est dans le sujet Java

Sujets liés

Sponsored Content

Contenu éditorial lié

Contenu sponsorisé lié

La Nouvelle Version D'Asahi Linux Prend En Charge Les Processeurs Apple M1 Ultra Et M2

PostgreSQL 14 Casse Les Pilotes .NET Et Java Pour PostgreSQL

Docker Desktop 4.6 Pour Mac Améliore Les Performances De Partage

Comment Eviter Le Verrouillage Des Fournisseurs Sans Serveurs Avec Design Patterns ?

Manipulation De Données Avec Programmation Fonctionnelle Et Requêtes Dans Ballerina

Ballerina : Un Langage De Programmation Orienté Données

La Dette Technique Est Quantifiable En Tant Que Dette Financière : Impossible Pour Les Développeurs

Les Tests De Performance Doivent S'Appuyer Sur Les Tendances

Les Processus De Tests Individuels Ne Peuvent Convenir A Tout Le Monde.

Grab A Partagé Son Experience Sur La Conception De Plate-formes De Données Distribuées

Microsoft Research Développe un Nouveau Système de Language-Vision : VinVL

Les Prédictions De Temps Chez Uber Eats

Les Facteurs Clés De La "MFA Fatigue" Dont A Ete Victime Uber

Adoption D'Environnements De Développement À Distance Chez Slack

Pourquoi La Gouvernance DevOps Est Cruciale Pour Permettre La Vélocité Des Développeurs

QCon London

QCon AI Boston

QCon San Francisco