Results with pretrain_data_fraction args

I'm trying to train MNLI in low-data regime (~750 examples) on RoBERTa, using the pretrain_data_fraction argument. I'm using default RoBERTa hyperparameter setup, but set the num epochs higher to allow the model trained longer. However, I got quite high performance 89.02 macro_avg, just slightly lower than using full data, 90.2 macro_avg. Could there possibly be some bugs?

Attaching log file, config file, and my command.

Command:
`python main.py --config_file jiant/config/nli-roberta_conf.txt -o "run_name=run1, random_seed=123456"`

[nli-roberta_conf.txt](https://github.com/nyu-mll/jiant/files/4477438/nli-roberta_conf.txt)
[log.log](https://github.com/nyu-mll/jiant/files/4477412/log.log)

(changed .conf to _conf.txt since I can't upload the format to GitHub)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Results with pretrain_data_fraction args #1066

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Results with pretrain_data_fraction args #1066

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions