Add more retry times when upgrade image#6810
Closed
ZhaohuiS wants to merge 2 commits intosonic-net:masterfrom
Closed
Add more retry times when upgrade image#6810ZhaohuiS wants to merge 2 commits intosonic-net:masterfrom
ZhaohuiS wants to merge 2 commits intosonic-net:masterfrom
Conversation
Signed-off-by: Zhaohui Sun <zhaohuisun@microsoft.com>
Collaborator
|
If adding retry from 5 to 10 can work around this, it indicates that there is a chance upgrading can complete in 5 minutes if we retry more times. It looks like the more fundamental issue is that if any ansible module needs more than 5 minutes to complete, ansible will force terminate execution of this task before it is completed. This sounds like a new fundamental issue. So, I don't think simply increasing retry times is the correct way to fix this issue. |
Signed-off-by: Zhaohui Sun <zhaohuisun@microsoft.com>
Contributor
Author
|
The root reason is introduced by this PR sonic-net/sonic-buildimage#12109. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Signed-off-by: Zhaohui Sun zhaohuisun@microsoft.com
Description of PR
Summary:
Fixes # (issue)
Type of change
Back port request
Approach
What is the motivation for this PR?
upgrade image may fail because upgrade image takes more than 5m.
For example, upgrade mellanox testbed with master image may take more than 5m sometimes.
But not sure why default timeout is 5m.
How did you do it?
Refer to ansible document:
https://docs.ansible.com/ansible/latest/user_guide/playbooks_async.html#:~:text=If%20you%20want%20to%20run,longer%20than%20its%20async%20value
Use async and poll to run script asynchronously. Set timeout to 500s.
How did you verify/test it?
Run
ansible-playbook upgrade_sonic.ymlAny platform specific information?
Supported testbed topology if it's a new test case?
Documentation