53
edits
Changes
→Assignment 3
[[File:512Optimized.jpg]]
===Optimized Image Resolution Results at 1024===
[[File:1024Optimized.jpg]]
===Optimized Image Resolution Results at 2048===
[[File:2048Optimized.jpg]]
===Optimized Image Resolution Results at 4096===
[[File:4096Optimized.jpg]]
Although there are more ways to optimize the code by better using available GPU resources, like using more available bandwidth, using more cores depending on compute capability, having better memcpy efficiency. For simplicity we decided to reduce memory access times as it was the main area where the kernel was spending most of its time as indicated by the nvvp profiles we collected.