User Tools

Site Tools


skill-tree:k:4:b

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
skill-tree:k:4:b [2020/07/14 00:38] lucianaskill-tree:k:4:b [2020/07/19 11:30] – external edit 127.0.0.1
Line 1: Line 1:
 # K4-B Job Scheduling # K4-B Job Scheduling
 # Background # Background
-Parallel computers are operated differently than a normal PC, all users must share the system. Therefore, various operative procedures are in place. Users must understand these concepts and procedures to be able to use the available resources of a system to run a parallel application.+Parallel computers are operated differently than a normal PC, all users must share the system. 
 +Therefore, various operative procedures are in place. Users must understand these concepts and procedures to be able to use the available resources of a system to run a parallel application.
 A workload manager/job scheduler controls how available hardware resources are distributed among the user requests (jobs). A workload manager/job scheduler controls how available hardware resources are distributed among the user requests (jobs).
  
-Users of compute centers typically compete for the expensive HPC resources of cluster systems. +Users of computing centers typically compete for the expensive HPC resources of cluster systems. 
-HPC resources can be distinguished as  +HPC resources can be distinguished as 
-  * shared resources (e.g. a parallel file system that is often shared across all cluster nodes and therefore shared between all users), +  * Shared resources (e.g. a parallel file system that is often shared across all cluster nodes and therefore shared between all users), 
-  * not-shared resources (e.g. cluster nodes dedicated to a particular parallel program of an individual user).+  * Not-shared resources (e.g. cluster nodes dedicated to a particular parallel program of an individual user).
  
 The configuration of the cluster system matters as well: a cluster node can also be a resource that is shared between several users. The configuration of the cluster system matters as well: a cluster node can also be a resource that is shared between several users.
  
-A major aspect of job scheduling is to manage these resources in a way that users are treated fairly. +A major aspect of job scheduling is to manage these resources in a way that users are treated fairly.
 Accounting for users or user groups can additionally support this. Accounting for users or user groups can additionally support this.
  
 # Aim # Aim
   * To enable practitioners to comprehend and describe the basic architecture and concepts of resource allocation for an HPC system   * To enable practitioners to comprehend and describe the basic architecture and concepts of resource allocation for an HPC system
-  * To provide knowlegde about how workload managers control the unattended background execution of programs or jobs, respectively, by the help of job queues +  * To provide knowledge about how workload managers control the unattended background execution of programs or jobs, respectively, by the help of job queues 
-  * To provide knowlegde about typical scheduling principles (e.g. first come first served, shortest job first) to achieve objectives like minimizing the averaged elapsed program runtimes, and maximizing the utilization of the available HPC resources+  * To provide knowledge about typical scheduling principles (e.g. first come first served, shortest job first) to achieve objectives like minimizing the averaged elapsed program runtimes, and maximizing the utilization of the available HPC resources
  
 # Outcomes # Outcomes
Line 49: Line 50:
   * [[skill-tree:k:4:1:b]]   * [[skill-tree:k:4:1:b]]
   * [[skill-tree:k:4:2:b]]   * [[skill-tree:k:4:2:b]]
 +  * [[skill-tree:k:4:i]]
 +  * [[skill-tree:k:4:e]]
skill-tree/k/4/b.txt · Last modified: 2020/07/19 19:44 by lucy