hadoop: difference between 0 reducer and identity reducer?

前端 未结 4 872
故里飘歌
故里飘歌 2020-12-01 05:50

I am just trying to confirm my understanding of difference between 0 reducer and identity reducer.

  • 0 reducer means reduce step will be skipped and mapper outp
4条回答
  •  执念已碎
    2020-12-01 06:04

    You understanding is correct. I would define it as following: If you do not need sorting of map results - you set 0 reduced,and the job is called map only.
    If you need to sort the mapping results, but do not need any aggregation - you choose identity reducer.
    And to complete the picture we have a third case : we do need aggregation and, in this case we need reducer.

提交回复
热议问题