login

Data Devroom Program

Saturday 5th February from 13:00 to 19:00:

  • 13h00-13h15 Introduction
  • 13h15-13h45 Hadoop Pig: Mapping & reducing the easy way, Nathan Bijnens

  • 13h45-14h15 Introduction to Clustering with Mahout, Frank Scholten

  • 14h15-14h45 Mapping Wikileaks' Cablegate using Python, Mongo.DB and Gephi, Elias Showk and Julian Bilcke

  • 14h45-15h00 Tools and Methods for Web Data Extraction, Nils Grünwald

  • 15h00-15h15 Datalift, A catalyser for the Web of data, Francois Scharffe

  • 15h15-15h30 Break

  • 15h30-15h45 GDL - GNU Data Language, Sylwester Arabas

  • 15h45-16h00 PimPy : Indexing Multimedia with Python, Sébastien Campion

  • 16h00-16h15 scikits.learn, machine learning in python, Fabian Pedregosa

  • 16h15-16h30 PyF: a python framework for dataflow processing, mining, transforming and reporting, Jonathan Schemoul

  • 16h30-16h45 The My.Media.Lite Recommender System Library, Zeno Gantner

  • 16h45-17h00 Break

  • 17h00-17h30 Free Culture, Free Data - How we use Data to Drive at Wikipedia

  • 17h30-18h00 A real-time search engine with Lucene and S4, Michaël Figuière

  • 18h00-18h15 How Seeks let you do your Web search at home, Emmanuel Benazera

  • 18h15-18h30 Graph databases, the Web of Data storage engines, Pere Urbón-Bayes

  • 18h30-19h00 Comparing Scalable NOSQL Databases: Functionality and Measurements, Thibault Dory

Abstracts

Additional Info

  • This programme is still subject to change
  • Room AW1.124 with 59 seats
  • Practical details on the official FOSDEM conference (Feb 5-6 2011) website
  • Twitter hashtag : #datadevroom

Contact