49
edits
Changes
→What was done
=== What was done ===
At first the power function used was switched out with __pow in the kernel as the traditional pow function is more heavy of a function.Afterwards the kernel was upgrade upgraded to implement grid and strides. Other implementations were made to transfer all the data necessary for calculations all at once and perform all calculations on the device side , but due to time constraints and the complication of the project we were unable to fully implement these changes. The code is however left (commented) in the included project download.
=== Optimized Kernel ===