Open main menu

CDOT Wiki β

Changes

Kernal Blas

7 bytes added, 09:04, 4 April 2018
Assignment 3
----
After realizing the cudaMemcpy was took and cudaMalloc takes quite a bit of time, we focused our efforts on optimizing it.It was difficult to find a solution because the initial copy always takes a bit of timeto set up.<br>
We tried using cudaMallocHost to see if we can allocate memory instead of using malloc. <br>
cadaMallocHost cudaMallocHost will allocate pinned memory which is stored in RAM and can be accessed by the GPU's DMA directly.
We changed one part of our code
<br/>
Here is we can see where an error occurs and onward where , we suspect that a memory leak causes the problem resulting in an error in pi calculation
'''Optimized time run results
96
edits