49
edits
Changes
→Padding
To address this issue, add some padding.
For example, if you have a <code>4 x 19</code> array of floats, and your system has access to a 128-bit vector registers, then you should add 1 column to make the array <code>4 x 20</code> so that the number of columns is evenly divisible by the number of floats that can be loaded onto into a 128-bit vector register, which is 4 floats.
[[File:Memory-alignment-padding.png]]