Hadoop chunk size vs split vs block size
问题 I am little bit confused about Hadoop concepts. What is the difference between Hadoop Chunk size , Split size and Block size ? Thanks in advance. 回答1: Block size & Chunk Size are same. Split size may be different to Block/Chunk size. Map Reduce algorithm does not work on physical blocks of the file. It works on logical input splits. Input split depends on where the record was written. A record may span two mappers. The way HDFS has been set up, it breaks down very large files into large