profiling

Profiling thread sync lock contention

一笑奈何 提交于 2019-12-19 08:06:12
问题 In the past I've used performance profiling tools such as nprof, Equatec profiler and Yourkit profiler to identify and remove/reduce performance bottlenecks in code mostly running in one thread (serialized execution). Nowadays I write a lot of multi-threaded code which can be slowed down by lock contention; what tools and tricks can be used to identify where lock contention is occuring and by how much? 回答1: This is available soon, currently shipping in Visual Studio 2010 beta 2. Here's a

Profiling thread sync lock contention

梦想与她 提交于 2019-12-19 08:05:13
问题 In the past I've used performance profiling tools such as nprof, Equatec profiler and Yourkit profiler to identify and remove/reduce performance bottlenecks in code mostly running in one thread (serialized execution). Nowadays I write a lot of multi-threaded code which can be slowed down by lock contention; what tools and tricks can be used to identify where lock contention is occuring and by how much? 回答1: This is available soon, currently shipping in Visual Studio 2010 beta 2. Here's a

What are “Other” Issue Stall Reasons displayed by the Nsight profiler?

穿精又带淫゛_ 提交于 2019-12-19 07:29:09
问题 I have a kernel that is performing poorly on CC 3.0 (Kepler) as opposed to CC 2.0 (Fermi). In the Nsight profiler, the Warp Issue Efficiency chart is showing that 60% of the time, there were no eligible warps and the Issue Stall Reasons chart is showing that 60% of these are due to "Other". I'm wondering what the Other issue stall reasons are and what I might do to reduce them. CUDA 5.0. / Nsight 3.0. RC / CC 3.0. 回答1: In Nsight Visual Studio Edition 3.0 CUDA Profiler the Issue Efficiency

Why dotmemory can't collect memory traffic and stack trace info when try to attached to a live process

痴心易碎 提交于 2019-12-19 06:14:49
问题 I have an asp.net app, before the app can provide services, it needs to build a very large local cache. I want to profile a specific part of the code. My app is quite huge, I don’t want to profile it entirely. I just want to profile the code path that serves the request. I start to use dotMemory to track my app memory usage. When I try to attach to a process, the profiler options of two checkbox on the dialog: 1) collect every N-th object 2) collect memory traffic can't be checked. Why

Is it possible to speed up a quicksort with par in Haskell?

本秂侑毒 提交于 2019-12-19 05:13:10
问题 I have got this seemingly trivial parallel quicksort implementation, the code is as follows: import System.Random import Control.Parallel import Data.List quicksort :: Ord a => [a] -> [a] quicksort xs = pQuicksort 16 xs -- 16 is the number of sparks used to sort -- pQuicksort, parallelQuicksort -- As long as n > 0 evaluates the lower and upper part of the list in parallel, -- when we have recursed deep enough, n==0, this turns into a serial quicksort. pQuicksort :: Ord a => Int -> [a] -> [a]

Gprof: specific function time [duplicate]

断了今生、忘了曾经 提交于 2019-12-19 04:15:33
问题 This question already has answers here : Function execution time (2 answers) Closed 5 years ago . I want to find out the time spent by a particular function in my program. FOr that purpose, I am making use of gprof. I used the following command to get the time for the specific function but still the log file displays the results for all the functions present in the program. Please help me in this regard. gprof -F FunctionName Executable gmon.out>log 回答1: You are nearly repeating another

Gprof: specific function time [duplicate]

纵然是瞬间 提交于 2019-12-19 04:15:22
问题 This question already has answers here : Function execution time (2 answers) Closed 5 years ago . I want to find out the time spent by a particular function in my program. FOr that purpose, I am making use of gprof. I used the following command to get the time for the specific function but still the log file displays the results for all the functions present in the program. Please help me in this regard. gprof -F FunctionName Executable gmon.out>log 回答1: You are nearly repeating another

Using VADump to track memory usage - OpenProcess Failed c0000034

让人想犯罪 __ 提交于 2019-12-19 04:14:35
问题 After reading an article in the most recent issue of MSDN magazine, I wanted to try using VADump to break down the memory usage of some applications I've written. The only copy of the executable on my PC is at: c:\Program Files\Microsoft Visual Studio 8\Common7\Tools\Bin\winnt\vadump.exe No matter what combination of arguments I give VADump, it gives back the error: OpenProcess Failed c0000034 Googling for this turns up a few others who've had the same issue, but none of these posts had

If a Julia script is run from the command line, does it need to be re-compiled every time?

南笙酒味 提交于 2019-12-19 03:14:22
问题 I've read through quite some documentation and questions but I'm still confused about this. In the Profiling section of the documentation it's suggested to first run the target function in the REPL once, so that it's already compiled before being profiled. However, what if the script is fairly complicated and is inteded to be run in the command line, taking arguments? When the julia process finishes and I run the script the second time, is the compilation performed again? Posts like https:/

Profile Haskell without installing profiling libraries for all dependencies

别来无恙 提交于 2019-12-18 14:29:29
问题 I wish to profile my program written in Haskell. On compilation, I am told that I do not have profiling libraries for certain dependencies (e.g., criterion ) installed and cabal aborts. I have no interest in profiling parts of those dependencies; code called from main doesn't even use them. How can I profile my application without installing profiling libraries I don't need and without removing all those dependencies? 回答1: A good way to circumvent having to compile everything with profiling