Open main menu

CDOT Wiki β

Changes

GPUSquad

208 bytes removed, 11:22, 7 April 2018
Idea 2 - LZW Compression
</source>
The hotspot seems to clearly be the double triple for-loop based on m and n in the Jacobi iterations code of the dojacobi() function. I believe these matrix calculations could be parallelized for improved performance. Note that the for-loop that the double loop is inside of is based on a constant numbers, iters, so it doesn't grow with the problem size. It would be O(iters * n^2) which is still O(n^2) not O(n^3).
==== Idea 2 - LZW Compression ====
93
edits