1
edit
Changes
→Assignment 2
This loop is the hot spot of the previous program.
[[ImageFile:filename|200|400| gpu-pic2.jpg]]
During this assignment we converted our program structure to be more feasible for parallelization. We rewrote the program and changed the “for loop” from the previous program and we created a kernel which will execute the task on the device.
[[ImageFile:filename|thumb|widthpx| gpu-pic3.jpg]]
'''Program execution'''
The following table and chart compare the CPU runtime vs the GPU runtime.
[[ImageFile:filename|thumb|widthpx| gpu-pic4.jpg]]
[[ImageFile:filename|thumb|widthpx| gpu-pic5.jpg]]
'''Conclusion'''