Open main menu

CDOT Wiki β

Changes

GPU621/VTuners

20 bytes added, 20:37, 5 December 2022
Parallelism
By evaluating compute-intense or throughput high-performance computing (HPC) applications for CPU efficiency, vectorization, and memory allocation, the parallelism feature enables users to check how efficient their threaded code is and can identify the thread issues that affect performance. The terms explained below are the most common statistics, in an advanced version, algorithm-specific analysis may be available, (see Method for OpenMP Code Analysis and Schedule Overhead in Intel® oneAPI Threading Building Blocks Applications)
{| class="wikitable"
! title !! something
|-
| Parallelism Pattern || OpenMP, OpenMP-MPI, TBB
|}
 
== Platform and I/O ==
117
edits