Training

Pretraining

Pretraining is the act of training a model from scratch: the weights are randomly initialized, and the training starts without any prior knowledge.
Trains on massive datasets
Very computationally expensive
Performed by self-supervised learning
- It is a type of training in which the objective is automatically computed from the inputs of the model

https://huggingface.co/learn/llm-course/chapter11/1
https://unsloth.ai/docs/get-started/fine-tuning-llms-guide
Phases
- Instruction Tuning
  - Adapting pre-trained language models to follow human instructions and engage in conversations
  - Technique: Supervised Fine Tuning (SFT)
- Preference Alignment
  - Polishes the model’s behavior, safety, and helpfulness using Chosen vs. Rejected preference data
  - Technique: Reinforcement Learning from Human Feedback (RLHF)
Optimizations
- Low Rank Adaptation (LoRA)
- Quantized LoRA (QLoRA)
The term “fine tuning” is confusing: https://www.reddit.com/r/MachineLearning/comments/1ewezs4/d_have_people_stopped_saying_fine_tuning_in_place/?rdt=52830

Use Local CPU/GPU
Free Hosted Compute
- Google Colab
  - Sessions can expire
- Kaggle Notebooks
Rented GPU Providers
- People Rent GPUs, Data privacy depends on provider
- Vast.ai
- Runpod.io
- Jarvislabs.ai
Managed ML Platform
- Azure ML
- AWS Sage Maker
- Google Vertex AI

aka Transfer learning (or type of?)
Training done after model has been pretrained
In Computer Vision, this has been successfully applied already
- For image classification, knowledge gained while learning to recognize cars could be applied when trying to recognize trucks.
We initialize weights from pretrained model and perform training on smaller dataset
The final weights layer is modified based on the use case
Performed by supervised learning

https://huggingface.co/learn/llm-course/chapter3/5
Learning curves are visual representations of your model’s performance metrics over time during training
Training and Validation are represented

Healthy Learning
Overfitting
- It occurs when the model learns too much from the training data and is unable to generalize to different data (represented by the validation set).
Underfitting
- It occurs when the model is too simple to capture the underlying patterns in the data.