Changes

Jump to: navigation, search

GPU621/VTuners

771 bytes added, 21:42, 5 December 2022
Parallelism
[[File:Vtune Roadmap.png|500px]]
 
 
'''Total Thread Count''': This section indicates the number of threads used when running the application. The term Thread Oversubscription indicates time spent in the code with the number of simultaneously working threads more than the number of available logical cores on the system.
 
Wait Time with poor CPU Utilization The value is the accumulated wait time of each thread where APIs blocks or cause synchronization. Therefore, this value can be higher than the application's Elapsed Time.
 
'''Top waiting objects'''': the Top Waiting Object section provides a table listing object names that took most time waiting in the application. Reasons for waiting could be function calls or synchronization. The higher wait time the more reductions of parallelism.
 
[[File:Effective-gpu.png|500px]]
117
edits

Navigation menu