How to pin threads to cores with predetermined memory pool objects? (80 core Nehalem architecture 2Tb RAM)
问题 I've run into a minor HPC problem after running some tests on a 80core (160HT) nehalem architecture with 2Tb DRAM: A server with more than 2 sockets starts to stall a lot (delay) as each thread starts to request information about objects on the "wrong" socket, i.e. requests goes from a thread that is working on some objects on the one socket to pull information that is actually in the DRAM on the other socket. The cores appear 100% utilized, even though I know that they are waiting for the