Python multiprocessing run time per process increases with number of processes

问题

I have a pool of workers which perform the same identical task, and I send each a distinct clone of the same data object. Then, I measure the run time separately for each process inside the worker function.

With one process, run time is 4 seconds. With 3 processes, the run time for each process goes up to 6 seconds.

With more complex tasks, this increase is even more nuanced.

There are no other cpu-hogging processes running on my system, and the workers don't use shared memory (as far as I can tell). The run times are measured inside the worker function, so I assume the forking overhead shouldn't matter.

Why does this happen?

def worker_fn(data):
    t1 = time()    
    data.process()
    print time() - t1
    return data.results


def main( n, num_procs = 3):       
    from multiprocessing import Pool
    from cPickle import dumps, loads 

    pool = Pool(processes = num_procs)
    data = MyClass()
    data_pickle = dumps(data)
    list_data = [loads(data_pickle) for i in range(n)]
    results = pool.map(worker_fn,list_data)

Edit: Although I can't post the entire code for MyClass(), I can tell you that it involves a lot of numpy matrix operations. It seems that numpy's use of OpenBlass may somehow be to blame.

来源：https://stackoverflow.com/questions/28789377/python-multiprocessing-run-time-per-process-increases-with-number-of-processes

标签

python

performance

optimization

python-multiprocessing

易学教程内所有资源均来自网络或用户发布的内容，如有违反法律规定的内容欢迎反馈！
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!