1
edit
Changes
→Assignment 2
http://i.imgur.com/9AtFQ48.png
We saw the GPU smash 1000 iterations in 22 milliseconds. That's over 10,000 times faster! Clearly, image processing begs to be worked on by parallel processors. The massive throughput of the 1024 cuda cores, which can operate on thousands of pixels at the same time reduces the time, really beat beating the CPU without much of a sweat. Here is the NSIGHT performance analysis:
http://i.imgur.com/H9P0pWX.png