Open main menu
CDOT Wiki
β
Search
Changes
← Older edit
UnknownX
5 bytes added
,
04:50, 13 April 2017
→
Assignment 2 - V1 Parallelization
CPU code:
The most expensive part in the program.
Main code on .cu:
1. Allocate memory on device.
2. run kunal. ntpb = 1024.
3. copy the key data out.
Kernel:
before:
for (int y = 0; y < N; ++y)
Zzha1
51
edits