问题
I have a #pragma omp parallel for
loop inside a class method. Each thread readonly accesses few method local variables, few call private data and a method's parameter. All of them are declared in a shared
clause.
My questions:
- Performance wise should not make any difference declare these
variables
shared
orfirstprivate
. Right? - Is the same true if I'm not careful about making variable not sharing the same cache line?
- If one of the shared variables is a pointer and inside the thread I read a value through it, is there an aliasing problem like in ordinary loops?
Tomorrow I will try to profile my code. In the meantime thanks for your advice!
回答1:
Well, they're not the same thing. With
shared
, they are shared between all the threads. Withfirstprivate
, each thread gets it's own copy. If you're only reading the variable, then it's better to leave it asshared
as to avoid copying it. (In C++,firstprivate
will implicitly invoke the copy constructor.)Correct, multiple threads reading and writing to values that sit on the same cacheline is called false sharing. The cache line will bounce back and forth between the cores that are accessing it - which can result in significant slowdown if it happens often enough.
If you're just reading data through the shared pointer, then there shouldn't be a problem. But if you're also writing to it, then you need to make sure you don't have a race condition.
来源:https://stackoverflow.com/questions/7865555/openmp-shared-vs-firstprivate-performancewise