Open main menu

CDOT Wiki β

Changes

Algo holics

No change in size, 02:45, 8 April 2019
Assignment 3
Here is a comparison between the naive and optimized kernel
[[File:Examplekernel2.jpg]]
Evidently, there is some performance boost for the new version. However, each call to atomicAdd by a thread locks the global memory until the old value is read and added to the passed value. This deters faster execution as might be expected.
57
edits