Hadoop MapReduce (Yarn) using hosts with different power/specifications

我是研究僧i 提交于 2019-12-11 12:09:03

问题


I currently have high power (cpu/ram) hosts in the cluster and we are considering to add some good storage but low power hosts.

My concern is that it will reduce the jobs performance. Map/Reducers from the new (less powerful) hosts will run slower and the more powerful ones will just have to wait for the result.

Is there a way to configure this in Yarn ? Maybe to set a priority for the hosts or to assign mapper/reducers according to the number of cores on each machines.

Thanks, Horatiu


回答1:


In YARN you can provide a different hardware resources configuration to each of the cluster workers. The YARN will then decide how many tasks (containers) may be allocated at each worker to balance the workload.

In other words you can set different number of cores and memory in yarn-site.xml for each worker separately.

These two configuraton options are relevant for you:

  • yarn.nodemanager.resource.cpu-vcores to set number of CPU cores per current worker node
  • yarn.nodemanager.resource.memory-mb to set the amount of physical memory (in MB) that can be used at the current worker node


来源:https://stackoverflow.com/questions/30840962/hadoop-mapreduce-yarn-using-hosts-with-different-power-specifications

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!