Changes

Jump to: navigation, search

BetaT

12 bytes added, 15:09, 12 April 2017
GETTING 100% OCCUPANCY PER MULTIPROCESSOR
The CUDA Toolkit includes a spreadsheet that accepts as parameters the compute capability, the number of threads per block, the number of registers per thread and the shared memory per block. This spreadsheet evaluates these parameters against the resource limitations of the specified compute capability. This spreadsheet is named CUDA_Occupancy_Calculator.xls and stored under the ../tools/ sub-directory of the installed Toolkit.'''
[Source--> https://scs.senecac.on.ca/~gpu610/pages/content/resou.html]
With the existing CALCULATE Kernel the CUDA Occupancy Calculator is providing the following statistics as shown below...
 
 
[[File:OriginalCalculator.png]]
 
The current CALCULATE Kernel is only utilizing 50% of the MultiProcessor as showb above. If the threads are switched from 32 to 512 we will achieve 100% occupancy as shown below.
 
[[File:100Calculator.png]]
212
edits

Navigation menu