skill-tree:ai:3:1:b
Table of Contents
AI3.1 Language Models (LLMs)
This skill covers the structure, training, and deployment of large language models (LLMs) in HPC environments. It includes tokenization, transformer architectures, training dynamics, and considerations for inference scalability and memory usage.
Requirements
- External: Familiarity with basic deep learning concepts and NLP tasks
- Internal: None
Learning Outcomes
- Describe the transformer architecture and how it underpins most LLMs.
- Explain tokenization strategies and their impact on model efficiency.
- Identify memory and compute bottlenecks in LLM training and inference.
- Compare distributed training strategies used for scaling LLMs across HPC resources.
- Evaluate the performance and limitations of LLMs on different HPC configurations.
Caution: All text is AI generated
skill-tree/ai/3/1/b.txt · Last modified: 2025/11/05 11:30 by 127.0.0.1
