Skip to content

Conversation

@guoshengCS
Copy link
Contributor

@guoshengCS guoshengCS commented Nov 15, 2020

PR types

Bug fixes

PR changes

APIs

Describe

Fix scaled_params append error and no_grad setting in AdamW.
Using assign to replace .numpy() and then set_value to speed up. (Bert-base 0.8step/s->3.43step/s).

Fix no_grad setting in AdamW.
test=develop
Copy link
Contributor

@wawltor wawltor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@guoshengCS guoshengCS merged commit a3bc3bc into PaddlePaddle:develop Nov 16, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants