58
edits
Changes
K2
,→Assignment3
Below is the comparison between bitonic sort using GPU, quick sort using CPU, bitonic sort using CPU.
[[File:BitonicGraph.jpg|500px|thumb|left|2^24sorting algorithms comparison result]] ==Assignment3== cudaMemcpy(HtoD) takes long time. Use pinned memory instead of pageable memory cudaHostAlloc(); "It will allocate pinned memory which is always stored in RAM and can be accessed by GPU's DMA(direct memory access) directly"