60
edits
Changes
no edit summary
the image above shows the timings for each function
matmul_0 - represents serial version
matmul_1 - represents serial version with reverse logic
matmul_2 - uses cilk_for
matmul_3 - uses cilk_for and reducer hyperboject
matmul_4 - uses cilk_for, reducer and vectorization
===Locks & Waits===
[[File:Lock1.png]]
[[File:Lock3.png]]
==references==
https://software.intel.com/en-us/vtune-amplifier-help-locks-and-waits-analysis
https://software.intel.com/en-us/vtune-amplifier-help-hpc-performance-characterization-analysis https://software.intel.com/en-us/vtune-amplifier-help-general-exploration-analysisvtuneampxe_hotspots_win_c
https://software.intel.com/en-us/vtune-amplifier-help-memory-access-analysisvtuneampxe_locks_win_c