Open main menu

CDOT Wiki β

Changes

GPU621/VTuners

805 bytes added, 21:36, 5 December 2022
no edit summary
== Parallelism ==
 
By evaluating compute-intense or throughput high-performance computing (HPC) applications for CPU efficiency, vectorization, and memory allocation, the parallelism feature enables users to check how efficient their threaded code is and can identify the thread issues that affect performance. The terms explained below are the most common statistics, in an advanced version, algorithm-specific analysis may be available, (see Method for OpenMP Code Analysis and Schedule Overhead in Intel® oneAPI Threading Building Blocks Applications)
 
! title !! something
|-
| Main Analysis Features || Threading, HPC Performance Characterization
|-
| Suggested Intel Compiler Version || Intel Composer XE 2013 Update 2 or higher (for CPU utilization analysis)
|-
| Parallelism Pattern || OpenMP, OpenMP-MPI, TBB
|}
 
== Platform and I/O ==
117
edits