-
Notifications
You must be signed in to change notification settings - Fork 5.9k
[AMP] Support pure fp16 training mode for dygraph #35521
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 29 commits
Commits
Show all changes
31 commits
Select commit
Hold shift + click to select a range
98c720f
add pure fp16 major function in auto_cast & tracer
zhangbo9674 228c855
support master weight in dygraph for pure fp16
zhangbo9674 a7f00a1
check mix dtype of fp16&fp32 for check_finite_and_unscale op
zhangbo9674 1dce0cc
change pure fp16 funtion name
zhangbo9674 422ced8
refine some bug in auto_cast
zhangbo9674 1366dad
refine auto_cast interface logic
zhangbo9674 5cdc012
add param _casted_by_pure_fp16 for class Layer
zhangbo9674 9e5399b
support state_dict hook for save model by user appointed dtype in pur…
zhangbo9674 91af1e9
refine pure_fp16_decorator as decorator
zhangbo9674 6cb2108
merge paddle develop
zhangbo9674 00cfbef
add unittest
zhangbo9674 64e2af6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
zhangbo9674 ac2342b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
zhangbo9674 6df1354
add comment
zhangbo9674 528da76
add comment
zhangbo9674 ae6d0a4
support recompute
zhangbo9674 806018b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
zhangbo9674 d1c277a
add comment for auto_cast and decorator
zhangbo9674 c2b472b
support to_static_state_dict for paddle.jit.save
zhangbo9674 f9f75a4
unlimite models num and optimizers num
zhangbo9674 fa9c9d9
add lookup_table in black_list
zhangbo9674 cafea36
fix momentum and layer state_dict
zhangbo9674 cd545e6
merget upstream develop
zhangbo9674 1502f55
fix bug in layer state_dict
zhangbo9674 fb9a785
fix bug in layer state_dict_helper
zhangbo9674 704e7f6
refine unittest
zhangbo9674 5b56d84
refine test_momentun_op
zhangbo9674 042a953
refine interface and some code
zhangbo9674 b6e4a99
refine amp_decorator interface
zhangbo9674 4524bab
refine pure fp16 interface
zhangbo9674 e1118cb
refine master weight interface
zhangbo9674 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
下次提交的时候,帮忙把
sparse_momentum也加上MasterParam吧,谢谢There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
目前框架中还没有找到使用sparse_momentum的优化器,以及动态图调用sparse_momentum的地方,所以pure fp16的pr中暂时先不加入了。