Changes

Jump to: navigation, search

GPU621/Analyzing False Sharing

10 bytes added, 12:26, 4 December 2022
Solutions Of False Sharing
When we ran the program again, we came to this conclusion:
[[File:newBlockOutput1newBlockOutput(1).jpg|400px]]<br />
[[File:newBlockOutput2newBlockOutput(2).jpg|400px]]<br />
[[File:newBlockOutput3newBlockOutput(3).jpg|400px]]<br />
[[File:newBlockOutput4newBlockOutput(4).jpg|400px]]<br />
[[File:newBlockOutput5newBlockOutput(5).jpg|400px]]<br />
We can see that the Local variable block is already much faster than the Thread block, and even comparable to the Serial block is almost the same. But overall the Serial block is still the least time-consuming because the Local variable block still needs to incur extra overhead for thread creation and scheduling.
118
edits

Navigation menu