Changes

Jump to: navigation, search

GPU621 Team 1

740 bytes added, 20:04, 30 November 2016
VTune Tutorial 2: Locks and Wait Tutorial
[[File:7.PNG]]
 
Note the Target Concurrency value. By default, this number is equal to the number of physical cores. Consider this number as your optimization goal.
 
For the sample code, the chart shows that analyze_locks is a multithreaded application running maximum 12 threads simultaneously on a machine with 12 cores. But it is not using available cores effectively.
 
Hover over the second bar to understand how long the application ran serially. The tooltip shows that the application ran one thread for almost 5.343 seconds, which is classified as Poor concurrency.
 
The CPU Usage Histogram represents the Elapsed time and usage level for the logical CPUs. Ideally, the highest bar of your chart should be within the Ok or Ideal utilization range.
== VTune Tutorial 3: Disk input Output Analysis ==
47
edits

Navigation menu