multicore

Matlabpool number of threads vs core

天大地大妈咪最大 提交于 2019-11-29 07:07:47
I have a laptop running Ubuntu on Intel(R) Core(TM) i5-2410M CPU @ 2.30GHz . According to Intel website for the above processor (located here ), this processor has two cores and can run 4 threads at a time in parallel (because although it has 2 physical cores it has 4 logical cores). When I start matlabpool it starts with local configuration and says it has connected to 2 labs. I suppose this means that it can run 2 threads in parallel. Does it not know that the CPU can actually run 4 threads in parallel? In my experience, the local configuration of matlabpool uses, by default, the number of

What is a Warm-Up Cache?

筅森魡賤 提交于 2019-11-29 05:57:31
问题 I am working with some multicore simulators such as GEMS or M5. In all of them there is an option to "Warm up the cache". What does that term mean? 回答1: The warm up is just the period of loading a set of data so that the cache gets populated with valid data. If you're doing performance testing against a system that usually has a high frequency of cache hits, without the warm up you'll get false numbers because what would normally be a cache hit in your usage scenario is not and will drag your

Assembly instructions to find how many threads are enabled in a multi-core system

早过忘川 提交于 2019-11-29 04:47:38
I'm working on a bare-bones system in which I need to determine sometime after boot how many cores and threads are enabled, so that I can send them SIPI events. I also want each thread to know which thread it is. For instance, in a single-core configuration with HT enabled, we have (for instance, Intel Atom): thread 0 --> core 0 thread 0 thread 1 --> core 0 thread 1 While in a dual-core configuration with no HT we have (for instance, Core 2 Duo): thread 0 --> core 0 thread 0 thread 1 --> core 1 thread 0 What's the best way to determine this? Edit: I found how each thread can find which thread

Scalable memory allocator experiences

别等时光非礼了梦想. 提交于 2019-11-29 04:41:02
问题 I am currently evaluating a few of scalable memory allocators, namely nedmalloc and ptmalloc (both built on top of dlmalloc), as a replacement for default malloc / new because of significant contention seen in multithreaded environment. Their published performance seems to be good, however I would like to check what are experiences of other people who have really used them. Were your performance goals satisfied? Did you experience any unexpected or hard to solve issues (like heap corruption)?

Are you concerned about multicore? [closed]

谁说我不能喝 提交于 2019-11-29 03:41:18
This is undeniable: multicore computers are here to stay. So is this: efficient multicore programming is pretty difficult. It's not just a case of understanding pthreads. This is arguable: the 'developer on the street' need concern him/herself with these developments. To what extent are you concerned about having to expand your skillset for multicore? Is the software you are writing a candidate for parallelisation, and if so are you doing anything to educate yourself (if you didn't already know the techniques)? Or do you believe that the operating system will take care of most of it, the

Are cache-line-ping-pong and false sharing the same?

冷暖自知 提交于 2019-11-29 02:52:14
问题 For my bachelor thesis I have to evaluate common problems on multicore systems. In some books I have read about false sharing and in other books about cache-line-ping-pong. The specific problems sound very familiar, so are these the same problems but given other names? Can someone give me names of books which discuss these topics in detail? (I already have literature from Darry Glove, Tanenbaum,...) 回答1: Summary: False sharing and cache-line ping-ponging are related but not the same thing.

Why does this Java code not utilize all CPU cores?

人盡茶涼 提交于 2019-11-29 01:32:28
The attached simple Java code should load all available cpu core when starting it with the right parameters. So for instance, you start it with java VMTest 8 int 0 and it will start 8 threads that do nothing else than looping and adding 2 to an integer. Something that runs in registers and not even allocates new memory. The problem we are facing now is, that we do not get a 24 core machine loaded (AMD 2 sockets with 12 cores each), when running this simple program (with 24 threads of course). Similar things happen with 2 programs each 12 threads or smaller machines. So our suspicion is that

numpy on multicore hardware

我是研究僧i 提交于 2019-11-29 01:26:00
What's the state of the art with regards to getting numpy to use mutliple cores (on Intel hardware) for things like inner and outer vector products, vector-matrix multiplications etc? I am happy to rebuild numpy if necessary, but at this point I am looking at ways to speed things up without changing my code. For reference, my show_config() is as follows, and I've never observed numpy to use more than one core: atlas_threads_info: libraries = ['lapack', 'ptf77blas', 'ptcblas', 'atlas'] library_dirs = ['/usr/local/atlas-3.9.16/lib'] language = f77 include_dirs = ['/usr/local/atlas-3.9.16/include

Multi-core usage, threads, thread-pools

筅森魡賤 提交于 2019-11-29 01:00:04
问题 I have some questions about multi-threaded programming and multi-core usage. In particular I'm wondering how the operating system and/or framework (this is .NET) deals with cores that are heavily used. Here's my questions regarding threads: When a new thread is spawned, what is the algorithm for assigning the thread to a particular core? Round-robin type of algorithm Random The currently least used core If not the currently least used core, would this type of code that determined this dwarf

Using Hardware Performance Counters in Linux

好久不见. 提交于 2019-11-29 00:36:19
问题 I want to use the Hardware Performance Counters that come with the Intel and AMD x86_64 multicore processors to calculate the number of retired stores by a program. I want each thread to calculate its retired stores separately. Can it be done? And if so, how in C/C++? 回答1: You can use Perfctr or PAPI if you want to count hardware events on some part of the program internally (without starting any 3rd party tool). Perfctr quickstart: http://www.ale.csce.kyushu-u.ac.jp/~satoshi/how_to_use