Open main menu

CDOT Wiki β

Changes

GPU621/Analyzing False Sharing

No change in size, 14:01, 23 November 2022
no edit summary
Theoretically, this code should be executed faster on a multicore machine with a Thread block than a serial block. But the result is.
[[File:exampleOutput1.jpg|500px400px]]<br />
Or
[[File:exampleOutput2.jpg|500px400px]]<br />
Or this
[[File:exampleOutput3.jpg|500px400px]]<br />
To our surprise, the serial block took much less time, no matter how many times I ran it. This turned our existing knowledge upside down, but don't worry, it's because you don't understand False Sharing yet.
118
edits