Low Rank Adaptation (LoRA)

LoRA Configuration

  • r: rank
  • lora_alpha
  • lora_dropout
  • bias
  • target_modules

Parameter-Efficient Fine-Tuning (PEFT)

  • lets you adapt large models by training a small number of additional parameters while keeping the base model frozen
  • Most used method: LoRA
  • These methods can be used on any training stage be it SFT or RLHF