Changes

Jump to: navigation, search

GPU621/Analyzing False Sharing

2 bytes added, 13:35, 4 December 2022
Use local variables for each thread
We could try increasing sizeOfNumbers to 1000000 as well, which would allow the program to process more data, thus compensating for the extra overhead of thread creation and scheduling.
[[File:newBlockOutput1000000(1new).jpg|400px]]<br />
Now we can already see the advantage of multi-threading. Even when the vector numbers reach the size of 1000000, the Thread block even runs faster than the Serial block.
118
edits

Navigation menu