1
edit
Changes
→Chadd's Findings
==== Chadd's Findings ====
Profiling the Diffusion Equation I noticed that the majority of the time is spent in the Evolve Time Step Evolvetimestep function.
Using my home computer with a GTX 560 ti Nvidia graphics card I ran a matrix 9000x9000 10 times. I have the runtime results in the chart below.
[[Image:Runtime.png|400px]]
I use 32 threads per block size in my paralellization of the nested for loop found in the Evolvetimestep function.