Hadoop MapReduce provide nested directories as job input
问题 I'm working on a job that processes a nested directory structure, containing files on multiple levels: one/ ├── three/ │ └── four/ │ ├── baz.txt │ ├── bleh.txt │ └── foo.txt └── two/ ├── bar.txt └── gaa.txt When I add one/ as an input path, no files are processed, since none are immediately available at the root level. I read about job.addInputPathRecursively(..) , but this seems to have been deprecated in the more recent releases (I'm using hadoop 1.0.2). I've written some code to walk the