My map tasks need some configuration data, which I would like to distribute via the Distributed Cache.
The Hadoop MapReduce Tutorial shows the usage of the Distribut
The new DistributedCache API for YARN/MR2 is found in the org.apache.hadoop.mapreduce.Job class.
Job.addCacheFile()
Unfortunately, there aren't as of yet many comprehensive tutorial-style examples of this.
http://hadoop.apache.org/docs/stable/api/org/apache/hadoop/mapreduce/Job.html#addCacheFile%28java.net.URI%29