Different summation results with Parallel.ForEach

前端 未结 4 748
梦谈多话
梦谈多话 2020-12-15 18:31

I have a foreach loop that I am parallelizing and I noticed something odd. The code looks like

double sum = 0.0;

Parallel.ForEach(myCollection         


        
4条回答
  •  無奈伤痛
    2020-12-15 19:15

    Like the others answers mentioned, updating the sum variable from multiple threads (which is what Parallel.ForEach does) is not a thread-safe operation. The trivial fix of acquiring a lock before doing the update will fix that problem.

    double sum = 0.0;
    Parallel.ForEach(myCollection, arg => 
    { 
      lock (myCollection)
      {
        sum += ComplicatedFunction(arg);
      }
    });
    

    However, that introduces yet another problem. Since the lock is acquired on each iteration then that means the execution of each iteration will be effectively serialized. In other words, it would have been better to just use a plain old foreach loop.

    Now, the trick in getting this right is to partition the problem in separate and independent chucks. Fortunately that is super easy to do when all you want to do is sum the result of the iterations because the sum operation is commutative and associative and because the intermediate results of the iterations are independent.

    So here is how you do it.

    double sum = 0.0;
    Parallel.ForEach(myCollection,
        () => // Initializer
        {
            return 0D;
        },
        (item, state, subtotal) => // Loop body
        {
            return subtotal += ComplicatedFunction(item);
        },
        (subtotal) => // Accumulator
        {
            lock (myCollection)
            {
              sum += subtotal;
            }
        });
    

提交回复
热议问题