58
edits
Changes
K2
,→Assignment3
==Assignment2==
Let's see how this algorithm works to sortand explanation of several terms.
[[File:BitonicSort1.png]]
Below is the comparison between bitonic sort using GPU, quick sort using CPU, bitonic sort using CPU.
[[File:BitonicGraph.jpg|500px|thumb|left|sorting algorithms comparison result]]
==Assignment3==
cudaMemcpy(HtoD) takes long time.
Use pinned memory instead of pageable memorycudaHostAlloc();
"It will allocate pinned memory which is always stored in RAM and can be accessed by GPU's DMA(direct memory access) directly"
[[File:Resultone.png|500px|thumb|left|pinned memory vs pageable memory]]