unable to run map reduce using python in Hadoop?
问题 I have written mapper and reducer in python for word count program that works fine. Here is a sample: echo "hello hello world here hello here world here hello" | wordmapper.py | sort -k1,1 | wordreducer.py hello 4 here 3 world 2 Now when i try to submit a hadoop job for a large file, I get errors hadoop jar share/hadoop/tools/sources/hadoop-*streaming*.jar -file wordmapper.py -mapper wordmapper.py -file wordreducer.py -reducer wordreducer.py -input /data/1jrl.pdb -output /output/py_jrl