-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Open
Labels
feature requestNew feature or requestNew feature or request
Description
Our finetuning focus has been a bit lacking lately. Let's change that, starting with LoRA!
- Run a finetuning baseline with a Pythia model that saturates VRAM, measure its TFLOPs/VRAM usage in wandb and link it here. Choose a small finetuning dataset for this that requires little compute to converge without being trivial.
- Add prototype LoRA support to gpt-neox
- Compare to baseline in step 1 and ensure TFLOP/VRAM changes make sense
- Compare to baseline in step 1 and ensure loss is maintained
As discussed in Discord @mkerin
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or request