skill-tree:k:4:b
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
skill-tree:k:4:b [2020/07/14 00:38] – luciana | skill-tree:k:4:b [2020/07/19 19:44] (current) – lucy | ||
---|---|---|---|
Line 1: | Line 1: | ||
# K4-B Job Scheduling | # K4-B Job Scheduling | ||
# Background | # Background | ||
- | Parallel computers are operated differently than a normal PC, all users must share the system. Therefore, various operative procedures are in place. Users must understand these concepts and procedures to be able to use the available resources of a system to run a parallel application. | + | Parallel computers are operated differently than a normal PC, all users must share the system. |
+ | Therefore, various operative procedures are in place. Users must understand these concepts and procedures to be able to use the available resources of a system to run a parallel application. | ||
A workload manager/job scheduler controls how available hardware resources are distributed among the user requests (jobs). | A workload manager/job scheduler controls how available hardware resources are distributed among the user requests (jobs). | ||
- | Users of compute | + | Users of computing |
- | HPC resources can be distinguished as | + | HPC resources can be distinguished as |
- | * shared | + | * Shared |
- | * not-shared resources (e.g. cluster nodes dedicated to a particular parallel program of an individual user). | + | * Not-shared resources (e.g. cluster nodes dedicated to a particular parallel program of an individual user). |
The configuration of the cluster system matters as well: a cluster node can also be a resource that is shared between several users. | The configuration of the cluster system matters as well: a cluster node can also be a resource that is shared between several users. | ||
- | A major aspect of job scheduling is to manage these resources in a way that users are treated fairly. | + | A major aspect of job scheduling is to manage these resources in a way that users are treated fairly. |
Accounting for users or user groups can additionally support this. | Accounting for users or user groups can additionally support this. | ||
# Aim | # Aim | ||
- | * To enable practitioners to comprehend and describe the basic architecture and concepts of resource allocation for an HPC system | + | * To enable practitioners to comprehend and describe the basic architecture and concepts of resource allocation for an HPC system. |
- | * To provide | + | * To provide |
- | * To provide | + | * To provide |
# Outcomes | # Outcomes | ||
- | * Comprehend the differences between **Batch Systems** and **Time Sharing Systems** | + | * Comprehend the differences between **Batch Systems** and **Time-Sharing Systems**. |
- | * Explain the concepts and procedures for resource allocation and job execution in an HPC environment | + | * Explain the concepts and procedures for resource allocation and job execution in an HPC environment. |
- | * Run interactive jobs and batch jobs | + | * Run interactive jobs and batch jobs. |
- | * Comprehend and describe the expected behavior of job scripts | + | * Comprehend and describe the expected behavior of job scripts. |
- | * Change provided job scripts and embed them into shell scripts to run a variety of parallel applications | + | * Change provided job scripts and embed them into shell scripts to run a variety of parallel applications. |
- | * Analyze the output generated from a job scheduler and describe the cause of typically generated errors | + | * Analyze the output generated from a job scheduler and describe the cause of typically generated errors. |
- | * Comprehend accounting principles (billing for the jobs) | + | * Comprehend accounting principles (billing for the jobs). |
- | * Comprehend the set of terms for performance criteria like | + | * Comprehend the set of terms for performance criteria like: |
- | * Resource Utilization | + | * Resource Utilization. |
- | * Throughput | + | * Throughput. |
- | * Waiting Time | + | * Waiting Time. |
- | * Execution Time | + | * Execution Time. |
- | * Turnaround Time | + | * Turnaround Time. |
- | * Comprehend scheduling strategies that increase productivity | + | * Comprehend scheduling strategies that increase productivity. |
- | * Comprehend that typical goals of job scheduling are | + | * Comprehend that typical goals of job scheduling are: |
- | * Maximization of resource utilization | + | * Maximization of resource utilization. |
- | * Maximization of throughput | + | * Maximization of throughput. |
- | * Minimization of waiting time | + | * Minimization of waiting time. |
- | * Minimization of turnaround time | + | * Minimization of turnaround time. |
- | * Comprehend that there is a variety of scheduling algorithms from rather simple to more complex like | + | * Comprehend that there is a variety of scheduling algorithms from rather simple to more complex like: |
- | * First-Come-First-Served (FCFS) | + | * First-Come-First-Served (FCFS). |
- | * Shortest-Job-First (SJF) | + | * Shortest-Job-First (SJF). |
- | * Priority-based | + | * Priority-based. |
- | * Fair-Share | + | * Fair-Share. |
- | * Backfilling | + | * Backfilling. |
# Subskills | # Subskills | ||
* [[skill-tree: | * [[skill-tree: | ||
* [[skill-tree: | * [[skill-tree: | ||
+ | * [[skill-tree: | ||
+ | * [[skill-tree: |
skill-tree/k/4/b.txt · Last modified: 2020/07/19 19:44 by lucy