Opleiding: Building Large Language Models

The course Building Large Language Models from SpiralTrain teaches you how to design, train, fine-tune, and deploy transformer-based LLMs using PyTorch and modern tooling.

LLM Intro

The course starts by explaining what LLMs are, where they’re used, and the lifecycle of building vs. using them. We introduce the Transformer/GPT architecture, how models learn from large datasets, and when to use classic QA versus RAG.

Working with Text Data

You’ll move from raw text to model-ready tensors: tokenization (e.g., BPE), token→ID mapping, special/context tokens, and sliding-window sampling. We cover embeddings and positional encodings while handling unknown words and basic sentence structure.

Attentions Mechanism

This module demystifies self-attention for long-sequence modeling: queries, keys, values, and causal masking to hide future tokens. We add positional encoding, multi-head attention, and stacked layers to capture dependencies across different parts of the input.

Pytorch Deep Learning

This module explains PyTorch fundamentals—tensors, core operations, and training loops—with the tooling to measure model quality. We cover feature scaling/normalization (including categorical features), activation and loss functions, and backpropagation.

Neural Networks

Next the course proceeds with building MLPs and CNNs in PyTorch while choosing appropriate activations and losses and implementing backprop. We touch NLP-specific preprocessing and walk through end-to-end binary and multi-class classification.

GPT from scratch

Next you will implement a minimal GPT with layer normalization, residual connections, and attention + feed-forward (GELU) blocks.

Pretraining

Then pretrain the LLM on unlabeled text with next-token prediction, tracking training vs. validation losses. You will explore decoding strategies (e.g., temperature, top-k), control randomness for reproducibility, and save/load PyTorch weights.

Tuning for Classification

Then the course covers preparing datasets and dataloaders, initializing from pretrained weights, and add a classification head with softmax. Train and evaluate with loss/accuracy, culminating in an LLM-based spam-classification example.

Fine-Tuning

Finally you will practice supervised instruction tuning: format datasets, batch efficiently, and fine-tune a pretrained LLM. Also evaluate outputs, export responses/checkpoints, and apply parameter-efficient methods such as LoRA.

Audience Course Building Large Language Models

The course Building Large Language Models is intended for engineers who want to design transformer-based LLMs.

Prerequisites Course Building Large Language Models

Participants should be comfortable with Python. Prior exposure to PyTorch or a similar Deep Learning framework is helpful.

Realization Training Building Large Language Models

The training blends concise theory with guided, hands-on labs. Through code-alongs you’ll build a mini-GPT, prepare datasets, run pretraining and fine-tuning and deploy models.

Building Large Language Models Certificate

After completion, participants receive a certificate of participation for the course Building Large Language Models.

Modules

Module 1 : LLM Intro

  • What is an LLM?
  • Applications of LLMs
  • Stages of Building LLMs
  • Stages of Using LLMs
  • Transformer Architecture
  • Utilizing Large Datasets
  • GPT Architecture Internals
  • Learn Language Patterns
  • Retrieval Augmented Generation
  • Question and Answer Systems
  • QA versus RAG
  • Building an LLM

Module 2 : Working with Text Data

  • Word Embeddings
  • Decoders and Encoders
  • Decoder Only Transformer
  • Tokenizing text
  • Convert Tokens into IDs
  • Special Context Tokens
  • Understand Sentence Structure
  • Byte Pair Encoding
  • Unknown Words
  • Sampling with Sliding Window
  • Creating Token Embeddings
  • Encoding Word Positions

Module 3 : Attentions Mechanism

  • Modeling Long Sequences
  • Capturing Data Dependencies
  • Attention Mechanisms
  • Attending Different Input Parts
  • Using Self-Attention
  • Trainable Weights
  • Hiding Future Words
  • Positional Encoding
  • Causal Attention
  • Masking Weights with Dropout
  • Multihead Attention
  • Stacking Attentions Layers

Module 4 : Pytorch Deep Learning

  • Deep Learning Intro
  • Overview of PyTorch
  • PyTorch Tensors
  • Tensor Operations
  • Model Evaluation Metrics
  • Feature Scaling
  • Feature Normalization
  • Categorical Features
  • Activation Functions
  • Loss Functions
  • Backpropagation

Module 5 : Neural Networks

  • Neural Networks Intro
  • Building NN with PyTorch
  • Multiple Layers of Arrays
  • Convolutional Neural Networks
  • Activation Functions
  • Loss Functions
  • Backpropagation
  • Natural Language Processing
  • Stopword Removal
  • Binary Classification
  • Multi-class Classification

Module 6 : GPT from scratch

  • Coding an LLM Architecture
  • Layer Normalization
  • Normalizing Activations
  • Feed Forward Network
  • GELU Activations
  • Adding Shortcut Connections
  • Connecting Attention
  • Weight Tying
  • Linear Layers in Transformer Block
  • Coding the GPT Model
  • Generating Text

Module 7 : Pretraining

  • Pretraining on Unlabeled Data
  • Calculating Text Generation Loss
  • Training Losses
  • Validation Set Losses
  • Training an LLM
  • Decoding Strategies
  • Control Randomness
  • Temperature Scaling
  • Saving Model Weights in PyTorch
  • Loading Pretrained Weights

Module 8 : Tuning for Classification

  • Categories of Fine-Tuning
  • Preparing the Dataset
  • Creating Data Loaders
  • Top-k Sampling
  • Soft-Max Function
  • Initializing with Pretrained Weights
  • Adding Classification Head
  • Classification Loss and Accuracy
  • Fine-tuning on Supervised Data
  • Using LLM as Spam Classifier

Module 9 : Fine-Tuning

  • Instruction Fine-tuning
  • Supervised Instruction
  • Preparing a Dataset
  • Organizing Training Batches
  • Creating Data Loaders
  • Loading a pretrained LLM
  • Fine-tuning the LLM
  • Extracting and Saving Responses
  • Evaluating Fine-tuned LLM
  • Fine Tuning with LoRA
Meer...
€3.200
ex. BTW
Aangeboden door
SpiralTrain
Onderwerp
Niveau
Duur
4 dagen
Looptijd
24 dagen
Taal
en
Type product
cursus
Lesvorm
Klassikaal
Aantal deelnemers
Max: 12
Tijdstip
Overdag
Tijden en locaties
Amsterdam
ma 8 jun. 2026
Eindhoven
ma 8 jun. 2026
Houten
ma 8 jun. 2026
Rotterdam
ma 8 jun. 2026
Utrecht
ma 8 jun. 2026
Zwolle
ma 8 jun. 2026
Amsterdam
ma 10 aug. 2026
Eindhoven
ma 10 aug. 2026
Houten
ma 10 aug. 2026
Rotterdam
ma 10 aug. 2026
Utrecht
ma 10 aug. 2026
Zwolle
ma 10 aug. 2026
Amsterdam
ma 12 okt. 2026
Eindhoven
ma 12 okt. 2026
Houten
ma 12 okt. 2026
Rotterdam
ma 12 okt. 2026
Utrecht
ma 12 okt. 2026
Zwolle
ma 12 okt. 2026
Amsterdam
ma 14 dec. 2026
Eindhoven
ma 14 dec. 2026
Houten
ma 14 dec. 2026
Rotterdam
ma 14 dec. 2026
Utrecht
ma 14 dec. 2026
Zwolle
ma 14 dec. 2026
Amsterdam
ma 8 feb. 2027
Eindhoven
ma 8 feb. 2027
Houten
ma 8 feb. 2027
Rotterdam
ma 8 feb. 2027
Utrecht
ma 8 feb. 2027
Zwolle
ma 8 feb. 2027
Amsterdam
ma 12 apr. 2027
Eindhoven
ma 12 apr. 2027
Houten
ma 12 apr. 2027
Rotterdam
ma 12 apr. 2027
Utrecht
ma 12 apr. 2027
Zwolle
ma 12 apr. 2027
Amsterdam
ma 14 jun. 2027
Eindhoven
ma 14 jun. 2027
Houten
ma 14 jun. 2027
Rotterdam
ma 14 jun. 2027
Utrecht
ma 14 jun. 2027
Zwolle
ma 14 jun. 2027
Amsterdam
ma 9 aug. 2027
Eindhoven
ma 9 aug. 2027
Houten
ma 9 aug. 2027
Rotterdam
ma 9 aug. 2027
Utrecht
ma 9 aug. 2027
Zwolle
ma 9 aug. 2027
Amsterdam
ma 11 okt. 2027
Eindhoven
ma 11 okt. 2027
Houten
ma 11 okt. 2027
Rotterdam
ma 11 okt. 2027
Utrecht
ma 11 okt. 2027
Zwolle
ma 11 okt. 2027
Amsterdam
ma 13 dec. 2027
Eindhoven
ma 13 dec. 2027
Houten
ma 13 dec. 2027
Rotterdam
ma 13 dec. 2027
Utrecht
ma 13 dec. 2027
Zwolle
ma 13 dec. 2027
Amsterdam
ma 14 feb. 2028
Eindhoven
ma 14 feb. 2028
Houten
ma 14 feb. 2028
Rotterdam
ma 14 feb. 2028
Utrecht
ma 14 feb. 2028
Zwolle
ma 14 feb. 2028
Amsterdam
ma 10 apr. 2028
Eindhoven
ma 10 apr. 2028
Houten
ma 10 apr. 2028
Rotterdam
ma 10 apr. 2028
Utrecht
ma 10 apr. 2028
Zwolle
ma 10 apr. 2028
Amsterdam
ma 12 jun. 2028
Eindhoven
ma 12 jun. 2028
Houten
ma 12 jun. 2028
Rotterdam
ma 12 jun. 2028
Utrecht
ma 12 jun. 2028
Zwolle
ma 12 jun. 2028
Amsterdam
ma 14 aug. 2028
Eindhoven
ma 14 aug. 2028
Houten
ma 14 aug. 2028
Rotterdam
ma 14 aug. 2028
Utrecht
ma 14 aug. 2028
Zwolle
ma 14 aug. 2028
Amsterdam
ma 9 okt. 2028
Eindhoven
ma 9 okt. 2028
Houten
ma 9 okt. 2028
Rotterdam
ma 9 okt. 2028
Utrecht
ma 9 okt. 2028
Zwolle
ma 9 okt. 2028
Keurmerken aanbieder
NRTO
UWV scholingsvoucher