-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Open
Labels
good first issueGood for newcomersGood for newcomers
Description
Follow the GPT-2 reference .py file and initialize the weights in C from scratch in the exact same way.
Allow init from scratch instead of init from checkpoint when building the GPT-2.
Add argparse flag to configure which way to go.
Ok to only change the mainline development file train_gpt2.cu.
Metadata
Metadata
Assignees
Labels
good first issueGood for newcomersGood for newcomers