inputformatter

Implementation for CombineFileInputFormat Hadoop 0.20.205

丶灬走出姿态 提交于 2019-11-27 20:19:23
Can someone please point out where I could find an implementation for CombineFileInputFormat (org. using Hadoop 0.20.205? this is to create large splits from very small log files (text in lines) using EMR. It is surprising that Hadoop does not have a default implementation for this class made specifically for this purpose and googling it looks like I'm not the only one confused by this. I need to compile the class and bundle it in a jar for hadoop-streaming, with a limited knowledge of Java this is some challenge. Edit: I already tried the yetitrails example, with the necessary imports but I

Implementation for CombineFileInputFormat Hadoop 0.20.205

烈酒焚心 提交于 2019-11-26 20:22:02
问题 Can someone please point out where I could find an implementation for CombineFileInputFormat (org. using Hadoop 0.20.205? this is to create large splits from very small log files (text in lines) using EMR. It is surprising that Hadoop does not have a default implementation for this class made specifically for this purpose and googling it looks like I'm not the only one confused by this. I need to compile the class and bundle it in a jar for hadoop-streaming, with a limited knowledge of Java