44
edits
Changes
K2
,→Assignment3
==Assignment3==
While working on the project, we discovered that cudaMemcpy(HtoD) and (DtoH) takes long time.
"It cudaHostAlloc() will allocate pinned memory which is always stored in RAM and can be accessed by GPU's DMA(direct memory access) directly"
[[File:Resultone.png|500px|thumb|left|pinned memory vs pageable memory]]
using pinned memory is 2 times faster than pageable memory