Hadoop dfs replicate

后端 未结 4 719
情歌与酒
情歌与酒 2021-01-02 23:57

Sorry guys,just a simple question but I cannot find exact question on google. The question about what\'s dfs.replication mean? If I made one file named filmdata.txt in hdfs,

4条回答
  •  长情又很酷
    2021-01-03 00:43

    In HDFS framework, we use commodity machines to store the data, these commodity machines are not high end machines like servers with high RAM, there will be a chance of loosing the data-nodes(d1, d2, d3) or a block(b1,b2,b3), as a result HDFS framework splits the each block of data(64MB, 128MB) into three replications(as a default) and each block will be stored in a separate data-nodes(d1, d2, d3). Now consider block(b1) gets corrupted in data-node(d1) the copy of block(b1) is available in data-node(d2) and data-node(d3) as well so that client can request data-node(d2) to process the block(b1) data and provide the result and same as if data-node(d2) fails client can request data-node(d3) to process block(b1) data . This is called-dfs.replication mean.

    Hope you got some clarity.

提交回复
热议问题