1
edit
Changes
Top Solo
,→Profiling Results
=== Profiling Results ===
The entirety of the GPU code executes in 0.15 - 0.16 seconds. I profiled the application using nvvp and NSight on Visual Studio 2010.=== Issues Encountered === Makefile does not work on the Windows platform. Tried adding the -lcurand linker flag but to no avail. The code does compile cleanly without error on Visual Studio 2010 IDE. Coding issues: Stackoverflow Questions I made:[http://stackoverflow.com/questions/15245723/using-arrayij-in-a-cuda-kernel-memcpy-call Using 2D Arrays] [http://stackoverflow.com/questions/15238009/concurrently-initializing-many-arrays-with-random-numbers-using-curand-and-cuda Memory Issues] With the help of the open-source community I was able to solve my problems and gain a better understanding of CUDA and CURAND.
== Assignment 3 ==