Changes

GPU621/VTuners

771 bytes added, 21:42, 5 December 2022

→‎Parallelism

[[File:Vtune Roadmap.png|500px]]

'''Total Thread Count''': This section indicates the number of threads used when running the application. The term Thread Oversubscription indicates time spent in the code with the number of simultaneously working threads more than the number of available logical cores on the system.

Wait Time with poor CPU Utilization The value is the accumulated wait time of each thread where APIs blocks or cause synchronization. Therefore, this value can be higher than the application's Elapsed Time.

'''Top waiting objects'''': the Top Waiting Object section provides a table listing object names that took most time waiting in the application. Reasons for waiting could be function calls or synchronization. The higher wait time the more reductions of parallelism.

[[File:Effective-gpu.png|500px]]

Nko4

117

edits

Changes

GPU621/VTuners

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

get involved with CDOT

courses

course projects

links

Tools