= Assignment 3 =
Under ProgressIn this assignment we decided to enhance memory access to a vital data point which decreased the run time of the render kernel by half. This effect is shown in the graph below: [[File:optimizedExcel.jpg]] We can see the difference of run times in the kernel from the Nvidia Visual Profiler===Image Resolution at 512===[[File:512Optimized.jpg]]===Image Resolution at 1024===[[File:1024Optimized.jpg]] ===Image Resolution at 2048===[[File:2048Optimized.jpg]] ===Image Resolution at 512===[[File:4096Optimized.jpg]] Although there are more ways to optimize the code by better using available GPU resources, like using more available bandwidth, using more cores depending on compute capability, having better memcpy efficiency. For simplicity we decided to reduce memory access times as it was the main area where the kernel was spending most of its time as indicated by the nvvp profiles we collected.