Open main menu

CDOT Wiki β

Changes

GPU621/Intel Advisor

241 bytes added, 15:27, 23 November 2018
no edit summary
__m128i prod = _mm_unpacklo_epi64(prod01, prod23); // (ab3,ab2,ab1,ab0)
</source>
 
Code sample was taken from this StackOverflow thread: [https://stackoverflow.com/questions/17264399/fastest-way-to-multiply-two-vectors-of-32bit-integers-in-c-with-sse Fastest way to multiply two vectors of 32bit integers in C++, with SSE]
Here is a link to an interactive guide to Intel Intrinsics: [https://software.intel.com/sites/landingpage/IntrinsicsGuide/#techs=SSE,SSE2,SSE3,SSSE3,SSE4_1,SSE4_2 Intel Intrinsics SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2]
49
edits