[AMP] add state_dict and load_state_dict and unittest for class GradScaler #34300

zhangbo9674 · 2021-07-21T07:25:49Z

PR types

New features

PR changes

APIs

Describe

add state_dict and load_state_dict and unittest for class GradScaler
中文文档链接：http://10.136.157.23:8090/documentation/docs/zh/api/paddle/amp/GradScaler_cn.html?reviewVersion=jenkins-doc-review-2-191
英文文档链接：http://10.136.157.23:8090/documentation/docs/zh/api/paddle/amp/GradScaler_cn.html?reviewVersion=jenkins-doc-review-2-191

paddle-bot-old · 2021-07-21T07:26:15Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

TCChenlong

LGTM

zhiqiu

LGTM

lanxianghit · 2021-08-02T07:02:00Z

python/paddle/amp/grad_scaler.py


+                # required: gpu,xpu
                import paddle
+                paddle.set_device('gpu')


新增的两行注释和代码在中英文文档预览里都没有看到，是生成预览之后新添加的？另外，其他代码示例是否也需要添加类似的代码？

thanks，新增的注释和代码是在预览之后新增加的，我去更新一下预览代码，paddle.set_device('gpu')可以不添加，后面的GradScaler初始化的时候会针对device进行提醒，这里添加的这行代码已删除。

lanxianghit · 2021-08-02T07:03:25Z

python/paddle/amp/grad_scaler.py


        Args:
-            new_init_loss_scaling(int):  The new_init_loss_scaling used to update initial loss scaling factor.
+            new_init_loss_scaling(float):  The new_init_loss_scaling used to update initial loss scaling factor.


确认一下类型变更之后对原来的情况是否完全兼容？比如参数类型检查是否有相关设置

thanks，原本数据类型是float，初次添加注释的时候错误提交了int，这里修改为了正确的数据类型float。

lanxianghit

LGTM

… dev_gradscaler_state_dict

zhiqiu · 2021-08-10T03:59:43Z

python/paddle/fluid/tests/unittests/test_imperative_auto_mixed_precision.py

+        print('save_load:', out_use_state_dict[0], out_no_state_dict[0])
+        self.assertTrue(
+            np.allclose(
+                out_use_state_dict[0], out_no_state_dict[0], atol=1.e-2))


should be equal?

thanks，it is equal after set flag FLAGS_cudnn_deterministic=True.

zhiqiu · 2021-08-10T04:07:58Z

python/paddle/fluid/tests/unittests/test_imperative_auto_mixed_precision.py

+                paddle.save(scaler.state_dict(), 'ResNet_model.pdparams')
+                dict_load = paddle.load('ResNet_model.pdparams')
+                scaler.load_state_dict(dict_load)


check if the state value are equal

thanks，the state values are euqal.

… dev_gradscaler_state_dict

add state_dict and load_state_dict and unittest for class GradScaler

c615f3d

zhangbo9674 added 3 commits July 26, 2021 12:36

refine unittest for coverage of load_state_dict

1dce33a

refine comments of code-block

f917dff

refine some comments

eb96c24

TCChenlong previously approved these changes Jul 28, 2021

View reviewed changes

refine state_dict code and unittest

e2f855a

zhangbo9674 dismissed TCChenlong’s stale review via e2f855a August 2, 2021 02:48

zhangbo9674 added 2 commits August 2, 2021 03:33

add #require gpu, xpu for GradScaler get/set example code

b74be1b

add #require gpu, xpu for GradScaler get/set example code

81aa57b

zhiqiu previously approved these changes Aug 2, 2021

View reviewed changes

lanxianghit reviewed Aug 2, 2021

View reviewed changes

refine example code

5085547

zhangbo9674 dismissed zhiqiu’s stale review via 5085547 August 2, 2021 07:27

lanxianghit previously approved these changes Aug 2, 2021

View reviewed changes

refine unittest for state_dict

c039e7e

zhangbo9674 dismissed lanxianghit’s stale review via c039e7e August 6, 2021 02:43

zhangbo9674 added 3 commits August 6, 2021 02:56

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

5b699d3

… dev_gradscaler_state_dict

refine unittest for state_dict

97453ed

fix bug of DataLoader in TestGradScalerStateDict

f85b0e4

zhiqiu reviewed Aug 10, 2021

View reviewed changes

zhangbo9674 added 2 commits August 10, 2021 12:05

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

d171365

… dev_gradscaler_state_dict

add flag FLAGS_cudnn_deterministic

42561fd

lanxianghit approved these changes Aug 11, 2021

View reviewed changes

TCChenlong approved these changes Aug 11, 2021

View reviewed changes

zhiqiu merged commit 99f8f5c into PaddlePaddle:develop Aug 11, 2021

zhangbo9674 deleted the dev_gradscaler_state_dict branch September 14, 2022 02:23

[AMP] add state_dict and load_state_dict and unittest for class GradScaler #34300

[AMP] add state_dict and load_state_dict and unittest for class GradScaler #34300

Uh oh!

Conversation

zhangbo9674 commented Jul 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Jul 21, 2021

Uh oh!

TCChenlong left a comment

Choose a reason for hiding this comment

Uh oh!

zhiqiu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lanxianghit left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhangbo9674 commented Jul 21, 2021 •

edited

Loading