发表新帖

发表新帖

In Hadoop where does the framework save the output of the Map task in a normal Map-Reduce Application?

后端未结

关注

 3  1916

爱一瞬间的悲伤 2021-02-06 08:38

I am trying to find out where does the output of a Map task is saved to disk before it can be used by a Reduce task.

Note: - version used is Hadoop 0.20

3条回答

南旧 (楼主)

2021-02-06 09:20

Task tracker starts a separate JVM process for every Map or Reduce task.

Mapper output (intermediate data) is written to the Local file system (NOT HDFS) of each mapper slave node. Once data transferred to Reducer, We won’t be able to access these temporary files.

If you what to see your Mapper output, I suggest using IdentityReducer?

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题