I am new to programming in general so please keep that in mind when you answer my question.
I have a program that takes a large 3D array (1 billion elements) and sums up
If you can divide the array in a way that the threads don't write/read to/from the same positions in the array it should increase your speed.