Skip to content

Latest commit

 

History

History
35 lines (29 loc) · 1015 Bytes

README.md

File metadata and controls

35 lines (29 loc) · 1015 Bytes

Hadopy - Easy parallel map-reduce command line tool

License: GPL v3 Python Versions PyPI version

If you want to map reduce parallel but hadoop is overkill, with Hadopy you can run map reduce in python.

Installing

To get Hadopy, either install from PyPi:

$ pip install hadopy 

or clone this github project and install:

$ pip install .

Usage

Hadopy was programmed with ease-of-use in mind. To run it use one of the following command:

Linux / MacOS

$ cat example.txt | hadopy --mapper "python mapper.py" --reducer "python reducer.py"

Windows

$ type example.txt | hadopy --mapper "python mapper.py" --reducer "python reducer.py"

For more information use

$ hadopy --help