Open main menu

CDOT Wiki β

Changes

A-Team

351 bytes added, 00:50, 8 April 2019
Final Profile
===Dynamic Parallelism===
Dynamic Parallelism in CUDA allows for the support of kernels to create and synchronize new nested kernels. Additionally, for our use case it also allows us to spend more time on the device to process information quickly without constant cudaMemcpy() or cudaMalloc() calls.
{| class="wikitable mw-collapsible mw-collapsed"
===Final Iteration===
{| class="wikitable mw-collapsible mw-collapsed"
! Train_kernelGPU code
|-
|
</syntaxhighlight>
|}
===Final Profile===
This final profile is only of 20 iterations as we had errors occur beyond 20 iterations, likely due to naive coding and bad coding practice.
[[File:nnfinalprofile.jpg]]
===Compiling===
113
edits