This is what I saw on Web UI recently
Configured Capacity : 232.5 GB
DFS Used : 112.44 GB
Non DFS Used : 119.46 GB
DFS Remaining : 61
The correct simplified definition is: "Any data that is not written by HDFS in the same filesystem(s) as the dfs.data.dirs
. In other words, if you use hdfs dfs
commands to copy data, it ends up under dfs.data.dirs
but then it is considered "DFS usage", and if you use regular cp
command to copy files into dfs.data.dirs
, then it will become "non-DFS usage".