96
edits
Changes
→Assignment 3
----
After realizing the cudaMemcpy and cudaMalloc takes quite a bit of time, we focused our efforts on optimizing it.
It was difficult to find a solution because the initial copy always takes a bit of time to set up.<br>
We tried using cudaMallocHost to see if we can allocate memory instead of using malloc. <br>
We changed one part of our code
<br/>
Here we can see is where an error occurs, and onward where we suspect that a memory leak causes the problem resulting in an error in pi calculation
'''Optimized time run results