Fix label datatype in TF Trainer #9616

jplu · 2021-01-15T10:02:58Z

What does this PR do?

This PR fixes the case where labels can be either a dict or a tf.Tensor when doing gradient accumulation.

sgugger

This looks okay to me, but it looks increasingly clearer that we should have tests of the TFTrainer otherwise we are doing more harm than good by merging those kinds of PRs.

LysandreJik

Ok, LGTM!

LysandreJik · 2021-01-15T13:46:14Z

I agree with Sylvain that while this is not tested, it's hard to recommend using it.

Fix label datatype

d19b63c

jplu requested review from LysandreJik and sgugger January 15, 2021 10:03

jplu mentioned this pull request Jan 15, 2021

Gradient accumulation for TFTrainer #9585

Merged

5 tasks

Apply style

74e4f34

sgugger reviewed Jan 15, 2021

View reviewed changes

LysandreJik approved these changes Jan 15, 2021

View reviewed changes

jplu merged commit 12f0d7e into huggingface:master Jan 20, 2021

jplu deleted the fix-trainer branch January 20, 2021 11:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix label datatype in TF Trainer #9616

Fix label datatype in TF Trainer #9616

Uh oh!

jplu commented Jan 15, 2021

Uh oh!

sgugger left a comment

Uh oh!

LysandreJik left a comment

Uh oh!

LysandreJik commented Jan 15, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix label datatype in TF Trainer #9616

Fix label datatype in TF Trainer #9616

Uh oh!

Conversation

jplu commented Jan 15, 2021

What does this PR do?

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik commented Jan 15, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants