Use this tool to:
Evaluate profiling statistics and load balancing
Learn about communication patterns, parameters, and performance data
Identify communication hotspots
Decrease time to solution and increase application efficiency
Intel® Trace Analyzer and Collector is installed as part of Intel® Parallel Studio XE Cluster Edition.
Set up the environment by sourcing the psxevars.[c]sh script available at <install_dir>/parallel_studio_xe_2018.x.xxx/bin.
By default, <install_dir> is:
This sets the required environment variables for compilers, Intel® MPI Library, and Intel® Trace Analyzer and Collector, and you are ready to trace your applications.
For system requirements, see the product Release Notes.
Run your MPI application using the mpirun command with the -trace option to generate a trace file. For example:
$ mpirun -trace -n 4 ./myApp
A trace file .stf is generated, for example myApp.stf. For more details on tracing, see Intel® Trace Collector Help.
Open the generated .stf file in Intel Trace Analyzer:
$ traceanalyzer ./myApp.stf
Start analyzing your application using various charts and timelines. As a starting point, use the Summary Page to get the following information:
For detailed description of the available charts and diagrams, see Intel® Trace Analyzer Help.
Hover over a GUI element for short description, click for more details.
The Trace Map represents a miniature view of the MPI functions activity over time. It also enables you to zoom into relevant subsets of a trace file on all open charts.
The Event Timeline is the most common chart, which provides a graphical display of individual process activities over time. It enables you to get detailed impression of the program structure, as well as retrieve detailed event information.
Horizontal bars represent processes with functions called in them:
— MPI functions
— serial code
— OpenMP* regions
— messages between processes
— collective operations
Time scale for the currently open timelines. Shows the currently selected time interval.
The Function Profile provides an overview of all functions used in the application. It consists of four tabs:
The Message Profile shows statistics about point-to-point operations.
It categorizes messages by groupings in a matrix, and shows values of the selected attribute in each cell. By default, the Message Profile shows how long messages took to send for each pair of sender/receiver processes. Cell colors represent the message send time, relative to other process pairs, red being the longest.
The Collective Operations Profile is similar to the Message Profile, but it shows statistics about collective operations.
Like in the Message Profile, you can configure the columns, rows and attributes to show, and compare the attribute values for each pair. By default, this profile shows how long each operation took to complete for each process. Cell colors represent the operation execution time, relative to other processes/operations, red being the longest.
Use the toolbar to get easy access to the most frequently used Intel® Trace Analyzer functionality.
Interface and Displays
MPI Correctness Checking
Identify performance issues in your application using the Performance Assistant, which provides the following information:
Ideal Environment Simulation
|Online Training||The online training site is an excellent resource for learning the Intel® Trace Analyzer and Collector capabilities through various tutorials, videos, webinars, and more.|
|Release Notes||This page contains the most up-to-date information about the product, including: what's new, features, system requirements and known limitations.|
|Online Documentation||This page provides links to all available Intel® Trace Analyzer and Collector documents.|
| Intel's compilers may or may not optimize to the same degree for non-Intel
microprocessors for optimizations that are not unique to Intel microprocessors.
These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other
optimizations. Intel does not guarantee the availability, functionality, or
effectiveness of any optimization on microprocessors not manufactured by Intel.
Microprocessor-dependent optimizations in this product are intended for use
with Intel microprocessors. Certain optimizations not specific to Intel
microarchitecture are reserved for Intel microprocessors. Please refer to the
applicable product User and Reference Guides for more information regarding the
specific instruction sets covered by this notice.
Notice revision #20110804
Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.
*Other names and brands may be claimed as the property of others.
© Intel Corporation.