-
Notifications
You must be signed in to change notification settings - Fork 461
Avoid reboots for ICSP changes #1922
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: beekhof The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
pkg/daemon/apply.go
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/Storage/Registy/g perhaps? Also, I think this must be abstracted out, we won't add if/else(s) to this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm also in two minds about have flexible vs. simple this should be.
Definitely open to suggestions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
At one point there was basically a lookup table that mapped filenames to actions, with reboot used for anything not present.
How would you feel about re-introducing that?
pkg/daemon/apply.go
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
uhm, same as above - don't have a solution rn, will think about it!
69df51d to
1ead451
Compare
d094e44 to
0093ff6
Compare
0093ff6 to
b2bacda
Compare
kikisdeliveryservice
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All of the user stories on this are baremetal, why not only roll this out to baremetal clusters (at least first)?
That makes a lot of sense to me 👍 |
|
I'd be happy to do that... any hints on how? |
|
/retest e2e-aws |
|
@beekhof: The
Use
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/test e2e-aws |
16b9bc0 to
9f02c47
Compare
9f02c47 to
3c58868
Compare
|
//nogocyclo |
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
Signed-off-by: Andrew Beekhof <[email protected]>
08de2c6 to
54b43c3
Compare
A second go around the update was a result of this PR. |
@crawford this is a shift of almost 2y on this pattern of "always reboot", now we're going the route of "avoid reboots" which, given how important reboots are for upgrades, warrants more time to finalize and stabilize. In the enhancement we've clarified that this ICSP case is definitely a start in line with the plan of avoiding reboots and we can evolve from here but we haven't gauged and thought about other cases where maybe this needs further changes and maybe some break here and there (which will make us support another migration? or added logic to handle different cases for compatibility?). There are ~4 ways this can be gated:
@cgwalters @ashcrow I've replied above for those comments as well |
…reboots flow Signed-off-by: Andrew Beekhof <[email protected]>
@runcom fixed in the latest patch |
e59a449 to
a4f56a2
Compare
|
Is there a final decision about whether to gate, and if so, how? |
|
@beekhof: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
|
@beekhof: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
|
Issues go stale after 90d of inactivity. Mark the issue as fresh by commenting If this issue is safe to close now please do so with /lifecycle stale |
|
Stale issues rot after 30d of inactivity. Mark the issue as fresh by commenting If this issue is safe to close now please do so with /lifecycle rotten |
|
Rotten issues close after 30d of inactivity. Reopen the issue by commenting /close |
|
@openshift-bot: Closed this PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
- What I did
Cleaned up the PoC code for openshift/enhancements#159, replaces PR #1626
- How to verify it
e2e and unit tests included
- Description for the changelog
Provide an alternative to node reboots in the specific cases that we know it is safe to do so