Changes

Algo holics

No change in size, 02:45, 8 April 2019

→‎Assignment 3

Here is a comparison between the naive and optimized kernel

[[File:~~Example~~kernel2.jpg]]

Evidently, there is some performance boost for the new version. However, each call to atomicAdd by a thread locks the global memory until the old value is read and added to the passed value. This deters faster execution as might be expected.

Ssdhillon20

57

edits

Changes

Algo holics

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

get involved with CDOT

courses

course projects

links

Tools