Open main menu

CDOT Wiki β

Changes

GPU621/Intel Advisor Assistant

38 bytes added, 15:23, 8 December 2021
Important Features of Intel Advisor
== Important Features of Intel Advisor ==
Below are few of the main features of the Intel Advisor:
 =Optimize Vectorization for Better Performance= 
Vectorization is one of the most important features of the Intel Advisor, it is an operation of Single instruction Multiple Data(SIMD) instructions on multiple data objects in parallel in single core of a CPU. This can very effectively increase performance by reducing loop overhead and making better use of math unit.
 
It does the following:
 
• Find loops that will benefit from better vectorization.
 
• Identify where it is safe to force compiler vectorization.
 
• Pinpoint memory-access issues that may cause slowdowns.
 
• Get actionable user code-centric guidance to improve vectorization efficiency.
 
//Screenshot if possible
=Model, Tune, and Test Multiple Threading Designs= 
The Threading Advisor tool is applied to the model and tests the performance of various multi-threading designs such as OpenMP, Threading Building Blocks (TBB), and Microsoft Task Parallel Library (TPL) without limiting the development of the project. The tool achieves this by helping you with prototyping, testing the project's scalability for larger systems, and optimizing faster. It will also help identify issues before implementing parallelization.
Flow Graph Analyzer – FGA is a
=Optimize for Memory and Compute= 
The analysis gives a realistic visual representation of application performance against hardware-imposed conditions, such as memory bandwidth and computer capacity.
 
With automated roofline analysis, you can:
 
• See performance headroom against hardware limitations
 
• Get insights into a practical optimization roadmap
 
• Visualize optimization progress
 
=Roofline Chart Data= 
The Roofline chart plots an application's achieved performance and arithmetic intensity against the machine's maximum achievable performance.
 
• Arithmetic intensity (x-axis) - measured in number of floating-point operations (FLOPs) or integer operations (INTOPs) per byte, based on the loop algorithm, and transferred between CPU/VPU and memory
 
• Performance (y-axis) - measured in billions of floating-point operations per second (GFLOPS) or billions of integer operations per second (GINTOPS)
 
=Efficiently Offload Your Code to GPUs= 
The Offload Advisor is an extended version of the Intel® Advisor, a code modernization, and performance estimation tool that supports OpenCL and SYCL/Data-Parallel C++ languages on CPU and GPU.
 
Benefits of using Offload Advisor:
 
• It will Estimate data-transfer costs and get guidance on how to optimize data transfer.
 
• Identify offload opportunities where it pays off the most.
=Create, Visualize, and Analyze Task and Dependency Computation Graphs= 
Flow Graph Analyzer (FGA) is a rapid visual prototyping environment for applications that can be expressed as flow graphs using Intel Threading Building Blocks (Intel TBB) applications.
 
• It helps construct, validate and model application design and performance before coding in Intel TBB.
 
• It provides insights into the parallel algorithm efficiency
 
• Apart from TBB, it helps visualize DPC++ asynchronous task graphs, provides insights to task scheduling inefficiencies and analyzes open MP task dependencies for better performance
 
== Example of How to Use Intel Advisor ==
67
edits