Hadoop shuffle uses which protocol?

假如想象 提交于 2019-12-07 18:52:08

问题


During the shuffle stage of Hadoop data the mapped data is transferred across nodes of the clusters according to the partitions for the reducer. What protocol does Hadoop use for performing the shuffle of data across nodes for the reduce stage?


回答1:


I really laughed for the first time, but the whole shuffeling and merging is done by a HTTPServlet.

You can see this in the Tasktrackers sourcecode in the anonymous class MapOutputServlet It gets a HTTP request with IDs of the tasks and jobs and then it is going to transfer the incoming input stream into the local filesystem on disk.



来源:https://stackoverflow.com/questions/8285217/hadoop-shuffle-uses-which-protocol

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!