Is it possible to tell HDFS where to store particular files?
Use case
I\'ve just loaded batch #1 of files into HDFS and w
Technically, you can, but I wouldn't.
If you want full control over where the data goes, you can extend BlockPlacementPolicy
(see how does hdfs choose a datanode to store). This won't be easy to do and I don't recommend it.
You can probably take steps to minimize the amount of traffic between your two sets of nodes with some clever setup to use rack-awareness to your advantage.