skill-tree:use:2:b
                Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| skill-tree:use:2:b [2020/06/25 20:17] – kai_h | skill-tree:use:2:b [2025/04/16 18:30] (current) – external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | # USE2-B Running of Parallel Programs | + | # USE2 Running of Parallel Programs | 
| - | # Background | + | |
| - | Parallel computers are operated differently than a normal PC, all users must share the system. Therefore, various operative procedures are in place. Users must understand these concepts and procedures to be able to use the available resources of a system to run a parallel application. Moreover, individual solutions can often be found in a specific system. | + | |
| - | # Aim | + | Parallel computers are operated differently than a normal PC, all users must share the system. | 
| - | * To enable practitioners to comprehend | + | Therefore, various operative | 
| - | * To use a workload manager like SLURM or TORQUE to allocate HPC resources (e.g. CPUs) and to submit a batch job | + | Users must understand these concepts | 
| - | * To use the system to run and monitor the execution of parallel | + | Moreover, individual solutions can often be found in a specific | 
| + | ## Learning Outcomes | ||
| - | # Outcomes | + | * Use a workload manager like SLURM or TORQUE to allocate HPC resources (e.g. CPUs) and to submit a batch job. | 
| - |  | + | * Write robust | 
| - | * use the command line interface | + | |
| - | * write robust job scripts, e.g. to simplify job submissions by the help of automated job chaining | + | ## Subskills | 
| - | * select the appropriate software environment | + | |
| - | * use a workload manager like SLURM or TORQUE to allocate HPC resources (e.g. CPUs) and to submit a batch job | + | * [[skill-tree: | 
| - | * Job submission and cancellation (SLURM) | + | * [[skill-tree: | 
| - | * sbatch | + | |
| - | * salloc | + | |
| - | * srun | + | |
| - | * Monitoring | + | |
| - | * sinfo | + | |
| - | * squeue | + | |
| - | * sstat | + | |
| - | * scontrol | + | |
| - | * Retrieving accounting information (SLURM) | + | |
| - | * sacct | + | |
| - | * sacctmgr | + | |
| - | * consider cost aspects | + | |
| - | * measure system performance as a basis for benchmarking a parallel program | + | |
| - | * benchmark a parallel program | + | |
| - | * tune a parallel program from the outside via runtime options | + | |
| - | * apply the workflow for tuning | + | |
| - | # Subskills | ||
| - | * [[skill-tree: | ||
| - | * [[skill-tree: | ||
| - | * [[skill-tree: | ||
| - | * [[skill-tree: | ||
| - | * [[skill-tree: | ||
| - | * [[skill-tree: | ||
| - | * [[skill-tree: | ||
skill-tree/use/2/b.1593109024.txt.gz · Last modified: 2020/06/25 20:17 by kai_h
                
                