Open main menu

CDOT Wiki β

Changes

Studyapplocator

44 bytes added, 14:33, 22 April 2018
Presentation
Although there are more ways to optimize the code by better using available GPU resources, like using more available bandwidth, using more cores depending on compute capability, having better memcpy efficiency. For simplicity we decided to reduce memory access times as it was the main area where the kernel was spending most of its time as indicated by the nvvp profiles we collected.
 
= Presentation =
[[File:Presentation.pdf]]
53
edits