Change File Split size in Hadoop

前端未结

关注

 4  1956

陌清茗 2020-12-01 00:56

I have a bunch of small files in an HDFS directory. Although the volume of the files is relatively small, the amount of processing time per file is huge. Th

4条回答

谎友^ (楼主)

2020-12-01 01:44

The parameter mapred.max.split.size which can be set per job individually is what you looking for. Don't change dfs.block.size because this is global for HDFS and can lead to problems.

0 讨论(0)

查看其它4个回答
发布评论:

提交评论
- 加载中...