Skip to content

Conversation

@zhiqiu
Copy link
Contributor

@zhiqiu zhiqiu commented Nov 16, 2021

PR types

Others

PR changes

OPs

Describe

By default, beta_pow of adam optimizer is initialized on CPU Place.
When training on CUDA Place, the original implementation will copy it to CUDA Place when skip_update=1.

This PR fixes that.

@paddle-bot-old
Copy link

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@zhiqiu zhiqiu requested review from Aurelius84 and phlrain November 16, 2021 08:49
@zhiqiu zhiqiu force-pushed the dev/refine_adam_skip_update branch from fb56835 to eedf679 Compare November 16, 2021 10:20
Copy link
Contributor

@Aurelius84 Aurelius84 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@zhiqiu zhiqiu merged commit 5e4b419 into PaddlePaddle:develop Nov 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants