hpc

How to pin threads to cores with predetermined memory pool objects? (80 core Nehalem architecture 2Tb RAM)

两盒软妹~` 提交于 2019-12-03 09:04:47
问题 I've run into a minor HPC problem after running some tests on a 80core (160HT) nehalem architecture with 2Tb DRAM: A server with more than 2 sockets starts to stall a lot (delay) as each thread starts to request information about objects on the "wrong" socket, i.e. requests goes from a thread that is working on some objects on the one socket to pull information that is actually in the DRAM on the other socket. The cores appear 100% utilized, even though I know that they are waiting for the

How to find from where a job is submitted in SLURM?

梦想与她 提交于 2019-12-03 07:28:15
问题 I submitted several jobs via SLURM to our school's HPC cluster. Because the shell scripts all have the same name, so the job names appear exactly the same. It looks like [myUserName@rclogin06 ~]$ sacct -u myUserName JobID JobName Partition Account AllocCPUS State ExitCode ------------ ---------- ---------- ---------- ---------- ---------- -------- 12577766 run.sh general ourQueue_+ 4 RUNNING 0:0 12659777 run.sh general ourQueue_+ 8 RUNNING 0:0 12675983 run.sh general ourQueue_+ 16 RUNNING 0:0

Intel MKL vs. AMD Math Core Library

霸气de小男生 提交于 2019-12-03 04:52:33
问题 Does anybody have experience programming for both the Intel Math Kernel Library and the AMD Math Core Library? I'm building a personal computer for high performance statistical computations and am debating on the components to buy. An appeal of the AMD Math Core library is that it is free, but I am in academia so the MKL is not that expensive. But I'd be interested in hearing thoughts on: Which provides a better API? Which provides better performance, on average, per dollar, including

Difference between nVidia Quadro and Geforce cards? [closed]

半城伤御伤魂 提交于 2019-12-03 00:45:25
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . I'm not a 3D or HPC guy, but I've been tasked with doing some research into those fields for a possible HPC application. Reading benchmarks, comparisons and specs between nVidia Quadro and Geforce cards, it seems that for similar generation cards: Quadro is 2x-3x the price of Geforce hardware wise, the

How to pin threads to cores with predetermined memory pool objects? (80 core Nehalem architecture 2Tb RAM)

泄露秘密 提交于 2019-12-02 23:11:15
I've run into a minor HPC problem after running some tests on a 80core (160HT) nehalem architecture with 2Tb DRAM: A server with more than 2 sockets starts to stall a lot (delay) as each thread starts to request information about objects on the "wrong" socket, i.e. requests goes from a thread that is working on some objects on the one socket to pull information that is actually in the DRAM on the other socket. The cores appear 100% utilized, even though I know that they are waiting for the remote socket to return the request. As most of the code runs asynchronously it is a lot easier to

How to find from where a job is submitted in SLURM?

跟風遠走 提交于 2019-12-02 22:20:24
I submitted several jobs via SLURM to our school's HPC cluster. Because the shell scripts all have the same name, so the job names appear exactly the same. It looks like [myUserName@rclogin06 ~]$ sacct -u myUserName JobID JobName Partition Account AllocCPUS State ExitCode ------------ ---------- ---------- ---------- ---------- ---------- -------- 12577766 run.sh general ourQueue_+ 4 RUNNING 0:0 12659777 run.sh general ourQueue_+ 8 RUNNING 0:0 12675983 run.sh general ourQueue_+ 16 RUNNING 0:0 How can I know from which directory a job is submitted so that I can differentiate the jobs? You can

Setting up a high performance computing cluster on servers that run different OSs

♀尐吖头ヾ 提交于 2019-12-02 19:14:38
问题 In my lab, we have several servers used for the simulation programs, but they worked independently. Now I want to combine them to become a cluster using MPICH to make them communicate. But there exists a problem, which is that these servers have different OSs. Some of them are Redhat, and some of them are Ubuntu. And on the homepage of MPICH, I saw that download sites of these two different operating systems are different, so will it be possible to set up a cluster with different operating

Difference between nVidia Quadro and Geforce cards? [closed]

╄→гoц情女王★ 提交于 2019-12-02 16:08:58
I'm not a 3D or HPC guy, but I've been tasked with doing some research into those fields for a possible HPC application. Reading benchmarks, comparisons and specs between nVidia Quadro and Geforce cards, it seems that for similar generation cards: Quadro is 2x-3x the price of Geforce hardware wise, the differences are not that great in benchmarks (3ds Max, Maya and some others) Quadro cards are much better performing than Geforce ones Does anyone know what are the exact and precise technical differences that can cause such better performance? My speculation (and what can be generally read on

Setting up a high performance computing cluster on servers that run different OSs

随声附和 提交于 2019-12-02 09:46:46
In my lab, we have several servers used for the simulation programs, but they worked independently. Now I want to combine them to become a cluster using MPICH to make them communicate. But there exists a problem, which is that these servers have different OSs. Some of them are Redhat, and some of them are Ubuntu. And on the homepage of MPICH, I saw that download sites of these two different operating systems are different, so will it be possible to set up a cluster with different operating system? And how to do it? The reason why I don't want to reinstall these servers is that there are too

mpirun - not enough slots available

守給你的承諾、 提交于 2019-11-30 17:09:37
Usually when I use mpirun, I can "overload" it, using more processors than there acctually are on my computer. For example, on my four-core mac, I can run mpirun -np 29 python -c "print 'hey'" no problem. I'm on another machine now, which is throwing the following error: $ mpirun -np 25 python -c "print 'hey'" -------------------------------------------------------------------------- There are not enough slots available in the system to satisfy the 25 slots that were requested by the application: python Either request fewer slots for your application, or make more slots available for use. ----