Hadoop distcp 命令跨集群复制文件
hadoop提供了Hadoop distcp命令在Hadoop不同集群之间进行数据复制和copy。 使用格式为:hadoop distcp -pbc hdfs://namenode1/test hdfs://namenode2/test distcp copy只有Map没有Reduce usage: distcp OPTIONS [source_path...] <target_path> OPTIONS -append Reuse existing data in target files and append new data to them if possible -async Should distcp execution be blocking -atomic Commit all changes or none -bandwidth <arg> Specify bandwidth per map in MB -delete Delete from target, files missing in source -diff <arg> Use snapshot diff report to identify the difference between source and target -f <arg> List of files that need to be