Changes

Jump to: navigation, search

GPU621 Team 1

No change in size, 21:08, 30 November 2016
VTune Tutorial 2: Locks and Wait Tutorial
The Thread Concurrency Histogram represents the Elapsed time and concurrency level for the specified number of running threads. Ideally, the highest bar of your chart should be within the OK or Ideal utilization range.
[[File:78.PNG]]
Note the Target Concurrency value. By default, this number is equal to the number of physical cores. Consider this number as your optimization goal.
For the sample code, the chart shows that analyze_locks is a multithreaded application running maximum 12 threads simultaneously on a machine with 12 cores. But it is not using available cores effectively.
Hover over the second bar to understand how long the application ran serially. The tooltip shows that the application ran one thread for almost 56.343 611 seconds, which is classified as Poor concurrency.
The CPU Usage Histogram represents the Elapsed time and usage level for the logical CPUs. Ideally, the highest bar of your chart should be within the Ok or Ideal utilization range.
47
edits

Navigation menu