1
edit
Changes
Top Solo
,→Potential Parallelization Candidates
Upon analyzing this function I discovered two possible areas where I could optimize the code using threads sent to the GPU. The first is a for loop which sets the attributes for 100 triangles in serial. This task can be done in parallel using 100 threads on the GPU.
<presource lang="cpp">
for (int k = 0; k<100; ++k) {
</presource>
=== Amdahls Law Calculations ===