Need help in writing Map/Reduce job to find average
问题 I'm fairly new to Hadoop Map/Reduce. I'm trying to write a Map/Reduce job to find average time taken by n processes, given an input text file as below: ProcessName Time process1 10 process2 20 processn 30 I went through few tutorials but I'm still not able to get a thorough understanding. What should my mapper and reducer classes do for this problem? Will my output always be a text file or is it possible to directly store the average in some sort of a variable? Thanks. 回答1: Your Mappers read