How gzip file gets stored in HDFS
问题 HDFS storage support compression format to store compressed file. I know that gzip compression doesn't support splinting. Imagine now the file is a gzip-compressed file whose compressed size is 1 GB. Now my question is: How this file will get stored in HDFS (Block size is 64MB) From this link I came to know that The gzip format uses DEFLATE to store the compressed data, and DEFLATE stores data as a series of compressed blocks. But I couldn't understand it completely and looking for broad