Changeset [eceb7123fb9e1a143e46f1ac197dc53cbe3038ed] by Shayan Pooya
March 20th, 2014 @ 12:42 PM
Examples: Re-implement the kclustering example with the pipelines.
In this implementation, we tried to stick to the original model
and did not
use any of the capabilities of the pipelining.
https://github.com/discoproject/disco/commit/eceb7123fb9e1a143e46f1...
Committed by Shayan Pooya
- A examples/datamining/kclustering_pipeline.py
Create your profile
Help contribute to this project by taking a few moments to create your personal profile. Create your profile ยป
Disco is an open-source implementation of the Map-Reduce framework for distributed computing. As the original framework, Disco supports parallel computations over large data sets on unreliable cluster of computers.