User Tools

Site Tools


skill-tree:pe:2:2:b

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
skill-tree:pe:2:2:b [2020/07/14 00:47] lucianaskill-tree:pe:2:2:b [2021/09/13 11:57] (current) – external edit 127.0.0.1
Line 1: Line 1:
 # PE2.2-B Profiling # PE2.2-B Profiling
 # Background # Background
-Profiling is explained for the CPU level, where it can be supported by hardware performance counters and by sampling techniques.  + 
-Sampling is used to see, by examining the program counter, what routines and source code lines of a program are responsible for which portions of the total runtime.  +Profiling is explained for the CPU level, where it can be supported by hardware performance counters and by sampling techniques. 
-Automatically adding trace code to a parallel program by so-called instrumentation to record its execution in a strict chronology is explained and the difference to profiling is emphasized.  + 
-Similar techniques are explained for profiling the network level (e.g. based on InfiniBand counters and I/O server states). +Sampling is used to see, by examining the program counter, what routines and source code lines of a program are responsible for which portions of the total runtime. 
 + 
 +Automatically adding trace code to a parallel program by so-called instrumentation to record its execution in a strict chronology is explained and the difference to profiling is emphasized. 
 + 
 +Similar techniques are explained for profiling the network level (e.g. based on InfiniBand counters and I/O server states).
  
 # Aim # Aim
 # Outcomes # Outcomes
-  *  Detect performance issues and bottlenecks caused, for example, by inefficient programming, memory accesses, I/O operations, cache-misses, page-faults, and parallelization overheads  +  * Detect performance issues and bottlenecks caused, for example, by inefficient programming, memory accesses, I/O operations, cache-misses, page-faults, and parallelization overheads. 
-  *  Use environment variables like $I_MPI_STATS to control the built-in performance analysis functionality in MPI+  * Use environment variables like $I_MPI_STATS to control the built-in performance analysis functionality in MPI.
  
 # Subskills # Subskills
 +  * [[skill-tree:pe:2:2:2:b]]
   * [[skill-tree:pe:2:2:1:b]]   * [[skill-tree:pe:2:2:1:b]]
 +  * [[skill-tree:pe:2:2:i]]
 +  * [[skill-tree:pe:2:2:e]]
skill-tree/pe/2/2/b.1594680435.txt.gz · Last modified: 2020/07/14 00:47 by luciana