Changes

Jump to: navigation, search

GPU621/GPU Targeters

1 byte removed, 19:22, 3 December 2020
Means of parallelisation on GPUs
CUDA version 9.2, using multiple P100 server GPUs, you can realize up to 50x performance improvements over CPUs.
 
OpenMP API specification for parallel programming provides an application programming interface (API) that supports multi-platform shared memory multiprocessing programming in C, C++, and Fortran, on most platforms. It consists of a set of compiler directives, library routines, and environment variables that influence run-time behavior.
 
 
The HIPify tool automates much of the conversion work by performing a source-to-source transformation from Cuda to HIP. HIP code can run on AMD hardware (through the HCC compiler) or Nvidia hardware (through the NVCC compiler) with no performance loss compared with the original Cuda code.
more [More information: https://www.olcf.ornl.gov/wp-content/uploads/2019/09/AMD_GPU_HIP_training_20190906.pdf]
51
edits

Navigation menu