Configure Map Side join for multiple mappers in Hadoop Map/Reduce

前端未结

关注

 2  1775

别跟我提以往 2020-12-17 05:01

I have a question about configuring Map/Side inner join for multiple mappers in Hadoop. Suppose I have two very large data sets A and B, I use the same partition and sort a

2条回答

半阙折子戏 (楼主)

2020-12-17 05:38

I think you're missing the point. You don't control the number of mappers. It's the number of reducers that you have control over. Simply emit the correct keys from your mapper. Then run 10 reducers.

0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...