Dask Memory Management with Default Scheduler

巧了我就是萌 提交于 2019-12-24 11:35:08

问题


I have been trying to manage the memory usage of Dask on a single local machine. For some reason, the default Dask Client() and LocalCluster() scheduler always seem to break, however Dask works great without specifying the scheduler and thus the default scheduler works the best for my purposes, however I am finding almost no documentation on this default scheduler let alone how to set a RAM limit on it. All of the information is for their specialized distributed client which does not seem to work. Does anyone know what/where the default scheduler documentation can be found or what they are even using as the default scheduler (as I cannot even find that information on their site/documentation). Or if not, has anyone found a solution to limit the RAM usage without using Client()?

Cheers


回答1:


in order to control the memory-limit in my local dask cluster i use the following set up:

ncores = 2
cluster = LocalCluster(n_workers=ncores, threads_per_worker=1,memory_limit =8e9)
client=Client(cluster)


来源:https://stackoverflow.com/questions/52998272/dask-memory-management-with-default-scheduler

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!