Changes

Jump to: navigation, search

Kernal Blas

8 bytes added, 09:10, 4 April 2018
Assignment 2
[[File:Prof.PNG]] <br>
Profiling the code shows that '''memcpycudaMalloc''' takes up most of the time spent. Even when <br>
there are 10 iterations, the time remains at 300 milliseconds. <br>
As the iteration passes 25 million, we have a bit of memory leak which results in inaccurate results. <br><br>
In order to optimize the code, we must find a way reduce the time memcpy cudaMalloc takes.<br>
=== Assignment 3 ===
96
edits

Navigation menu