Configure Map Side join for multiple mappers in Hadoop Map/Reduce

前端 未结 2 1775
别跟我提以往
别跟我提以往 2020-12-17 05:01

I have a question about configuring Map/Side inner join for multiple mappers in Hadoop. Suppose I have two very large data sets A and B, I use the same partition and sort a

2条回答
  •  半阙折子戏
    2020-12-17 05:38

    I think you're missing the point. You don't control the number of mappers. It's the number of reducers that you have control over. Simply emit the correct keys from your mapper. Then run 10 reducers.

提交回复
热议问题