I\'m building a Hadoop (0.20.1) mapreduce job that uses HBase (0.20.1) as both the data source and data sink. I would like to write the job in Python which has required me t
This seems to do what I want but it's not part of the Hadoop distribution. Any other suggestions or comments still welcome.
http://github.com/wanpark/hadoop-hbase-streaming