100
edits
Changes
→Assignment 3
[[File:DPS915 Team7 Coalesced.PNG]]
In the kernel, the two calls to '''sin''' and '''cos''' were replaced by a single call to '''__sincosf''' which calculated both the sine and the cosine at the same time. This resulted in timing improvements as shown below. The timing for the kernel has gone down from 206 ms to 179 ms.
[[File:DPS915 Team7 Optimized Trig Functions.PNG]]