Hadoop gzip compressed files

后端未结

关注

 4  1024

攒了一身酷 2020-12-09 10:37

I am new to hadoop and trying to process wikipedia dump. It\'s a 6.7 GB gzip compressed xml file. I read that hadoop supports gzip compressed files but can only be processed

4条回答

轮回少年 (楼主)

2020-12-09 10:53

Why not ungzip it and use Splittable LZ compression instead?m

http://blog.cloudera.com/blog/2009/11/hadoop-at-twitter-part-1-splittable-lzo-compression/

0 讨论(0)

查看其它4个回答
发布评论:

提交评论
- 加载中...