Open main menu

CDOT Wiki β

Changes

Studyapplocator

122 bytes added, 14:33, 22 April 2018
Presentation
for (unsigned x = 0; x < IMG_RES; ++x) {
int k = x + y * IMG_RES;
float xx xxPoints = (2 * ((x + 0.5) * iwidth) - 1) * viewangle * aspectratio; float yy yyPoints = (1 - 2 * ((y + 0.5) * iheight)) * viewangle; Vec3f raydirrayDirection, rayorigrayOrigin; raydirrayDirection.init(xxxxPoints, yyyyPoints, -1); raydirrayDirection.normalize(); rayorigrayOrigin.init(0);  // Begin tracing // trace(rayorigrayOrigin, raydirrayDirection, 0, pixel, sphere, k);
}
}
Although there are more ways to optimize the code by better using available GPU resources, like using more available bandwidth, using more cores depending on compute capability, having better memcpy efficiency. For simplicity we decided to reduce memory access times as it was the main area where the kernel was spending most of its time as indicated by the nvvp profiles we collected.
 
= Presentation =
[[File:Presentation.pdf]]
53
edits