Where Mapper output in Hadoop is saved?
问题 I am interested in efficiently manage the Hadoop shuffling traffic and utilize the network bandwidth effectively. To do this I want to know how much shuffling traffic generated by each Datanodes ? Shuffling traffic is nothing but the output of mappers. So where this mapper output is saved ? How can i get the size of mapper output from each datanodes in a real time ? Appreciate your help. I have created a directory to store this mapper output as below. <property> <name>mapred.local.dir</name>