Changeset [e1b2052a447676ba48555dae0b3cdf3d2648c2f6] by Shayan Pooya

March 18th, 2014 @ 12:50 PM

Hdfs: Add support for saving the results of the jobs to HDFS.

This support is not complete yet and should not be used in production. An
example has been added for a job that reads the input from hdfs and writes the
data in hdfs.
https://github.com/discoproject/disco/commit/e1b2052a447676ba48555d...

Committed by Shayan Pooya

  • A examples/util/wordcount_hdfs.py
  • A master/src/hdfs.erl
  • M lib/disco/worker/task_io.py
  • M master/src/ddfs/ddfs_http.erl
  • M master/src/ddfs/ddfs_node.erl
  • M master/src/worker_runtime.erl
New-ticket Create new ticket

Create your profile

Help contribute to this project by taking a few moments to create your personal profile. Create your profile ยป

Disco is an open-source implementation of the Map-Reduce framework for distributed computing. As the original framework, Disco supports parallel computations over large data sets on unreliable cluster of computers.

Shared Ticket Bins