Changes

Jump to: navigation, search

Team Lion F2017

87 bytes added, 12:53, 5 January 2018
no edit summary
the image above shows the timings for each function
 
matmul_0 - represents serial version
 
matmul_1 - represents serial version with reverse logic
 
matmul_2 - uses cilk_for
 
matmul_3 - uses cilk_for and reducer hyperboject
matmul_4 - uses cilk_for, reducer and vectorization
 
 
===Advanced hotspot analysis===
matmul_4 - uses cilk_for, reducer and vectorization
===Locks & Waits===
===HPC Performance Characterization===* Best for locating causes of low concurrency, such as heavily used locks and large critical sections.* Locks are when threads are waiting too long on synchronization objects.* Uses user-mode sampling and tracing collection to identify processes. ==Microarchitecture==* This analysis shows time spent waiting on synchronizations.
===General Exploration===
[[File:Lock1.png]]
===Memory Access===[[File:Lock2.png]]
[[File:Lock3.png]]
==references==
https://software.intel.com/en-us/vtune-amplifier-help-locks-and-waits-analysis
https://software.intel.com/en-us/vtune-amplifier-help-hpc-performance-characterization-analysis https://software.intel.com/en-us/vtune-amplifier-help-general-exploration-analysisvtuneampxe_hotspots_win_c
https://software.intel.com/en-us/vtune-amplifier-help-memory-access-analysisvtuneampxe_locks_win_c
60
edits

Navigation menu