70
edits
Changes
→Result
==== Result ====
[[File:optimized.png|center|frame|GPU highlights. para-ghost-pre-co2, which implements Ghost Cell + Prefetch + Coaleased memory + logic change , is slightly faster than simpler Prefetch+Coaleased memory that uses Global Memory. Both methods are superior than calling the conditional-less kernel 1000 times over PCIe.]]
[[File:all.png|center|frame|Using GPU significantly improved Calculation Time over the CPU counterparts.]]