Open main menu

CDOT Wiki β

Changes

BarraCUDA Boiz

298 bytes added, 20:24, 13 April 2017
Assignment 3
=== Assignment 3 ===
 
For assignment 3, we optimized the kernels by allocating the correct amounts of grids and block for each kernel. Previously, we allocated 32 threads by 32 blocks for every kernel call even when it did not require it. After adjustments, we found significant improvements for many of the kernels.
Runtime of program:
31
edits