Changes

Jump to: navigation, search

AAA Adrina Arsa Andriy

639 bytes added, 14:26, 4 December 2014
Assignment 3
=== Assignment 3 ===
For or assignment 3 we did a few things to speed up the program, and we were able to observe an approximate speed up of around 50%.
 
[[Image:Hang man graph.png|thumb|800px|center]]
To observe this speed up we removed thread divergence from the kernels, and we removed some unnecessary memory copies.
[[Image:Nvidia Occupancy Calculator on Code.jpg|thumb|800px|center]]
In the previous version we dynamically found the number of threads per block, we could not dynamically use the information in this version due to the fact that shared memory was used. On the school lab computers the NBPT was 1024.
 
'''Real World Application'''
To make the application more "real world" friendly, we were able to make test data load form a large dictionary file. This makes it so you can search for real words instead of gibberish.
 
'''What Would We Do Different?'''
We would have spent more time on our A1s. When we picked out A1 programs we tried to find programs that were cool, and had unique uses. We profiled the programs without taking an in depth look at the code base, and when it came to picking a topic for A2, we were stuck with only one program, since the other two were much too complex.

Navigation menu