The Disco MapReduce Framework
Chris Mueller from Life Technologies introduces us to Disco, a MapReduce framework built in Python and Erlang.
Showing that Hadoop is not alone in the MapReduce world, Chris reviews the basic MapReduce paradigm, dataflow, file and job distribution, and goes on to explain the Disco Distributed Filesystem (DDFS) before going into some use-case scenarios in next generation genomic sequencing.
This post is part of 2012 PyData Workshop Videos.
Be the first one to post a comment!