4949

Distributed Pattern Recognition in RapidMiner

Alexander Arimond, Christian Kofler, Faisal Shafait

RapidMiner Community Meeting and Conference RapidMiner Community Meeting and Conference (RCOMM-10), September 13-16, Dortmund, Germany , Online , 2010
RapidMiner already provides easy to use interfaces for developing and evaluating Pattern Recognition and Machine Learning applications. However, it has only limited support for parallelization and it lacks functionality to spread long-running computations over multiple machines. A solution to this is distributed computing with paradigms like MapReduce. In this paper, we present a system called DisPaRe, which integrates distributed computing frameworks into RapidMiner. A special focus is put on utilizing MapReduce as a programming model. The frameworks GridGain and Oracle Coherence are reviewed and evaluated with respect to their suitability to fit into the context of RapidMiner. The system provides effective means for transparently utilizing these frameworks and enabling RapidMiner processes to parallelize their computations within a distributed environment.

Show BibTex:

@inproceedings {
       abstract = {RapidMiner already provides easy to use interfaces for developing and evaluating Pattern Recognition
and Machine Learning applications. However, it has only
limited support for parallelization and it lacks functionality
to spread long-running computations over multiple machines.
A solution to this is distributed computing with paradigms
like MapReduce. In this paper, we present a system called
DisPaRe, which integrates distributed computing frameworks
into RapidMiner. A special focus is put on utilizing MapReduce
as a programming model. The frameworks GridGain and
Oracle Coherence are reviewed and evaluated with respect
to their suitability to fit into the context of RapidMiner. The
system provides effective means for transparently utilizing these
frameworks and enabling RapidMiner processes to parallelize
their computations within a distributed environment.},
       number = {}, 
       month = {9}, 
       year = {2010}, 
       title = {Distributed Pattern Recognition in RapidMiner}, 
       journal = {}, 
       volume = {}, 
       pages = {}, 
       publisher = {Online}, 
       author = {Alexander Arimond, Christian Kofler, Faisal Shafait}, 
       keywords = {},
       url = {http://www.dfki.de/web/forschung/publikationen/renameFileForDownload?filename=Arimond-DisPaRe-RCOMM10.pdf&file_id=uploads_782}
}