Open main menu

CDOT Wiki β

Changes

GPU621/Analyzing False Sharing

33 bytes added, 16:33, 24 November 2022
no edit summary
After running it, I found that the difference between the runtime with and without byte alignment is very small and there is no instability, the runtime is around 200ms.
But the unexpected thing is that when I execute increment1() and increment2() in serial code only, it only consumes 53ms16ms[[File:serial.jpg|400px]]<br />
118
edits