Skip to content

[action] [PR:3788] [Mux] Clear bulkers when rolling back mux switchover#3855

Merged
mssonicbld merged 1 commit intosonic-net:202505from
mssonicbld:cherry/202505/3788
Sep 3, 2025
Merged

[action] [PR:3788] [Mux] Clear bulkers when rolling back mux switchover#3855
mssonicbld merged 1 commit intosonic-net:202505from
mssonicbld:cherry/202505/3788

Conversation

@mssonicbld
Copy link
Copy Markdown
Collaborator

What I did
When a switchover failure is detected in MuxOrch, clear relevant bulkers to provide a clean slate for the rollback process.

Why I did it
In certain failure scenarios, if an exception is thrown inside the bulker, it's possible that the bulker is not cleared and still contains data in creating_entries or removing_entries. When the rollback process begins, these entries will be programmed to the SAI a second time, which is a) incorrect b) could potentially trigger the same exception second time.

How I verified it
Run the MuxRollbackTest.StandbyToActiveExceptionRollbackToStandby test

Details if related

<!--
Please make sure you have read and understood the contribution guildlines:
https://github.com/Azure/SONiC/blob/gh-pages/CONTRIBUTING.md

1. Make sure your commit includes a signature generted with `git commit -s`
2. Make sure your commit title follows the correct format: [component]: description
3. Make sure your commit message contains enough details about the change and related tests
4. Make sure your pull request adds related reviewers, asignees, labels

Please also provide the following information in this pull request:
-->

**What I did**
When a switchover failure is detected in MuxOrch, clear relevant bulkers to provide a clean slate for the rollback process.

**Why I did it**
In certain failure scenarios, if an exception is thrown inside the bulker, it's possible that the bulker is not cleared and still contains data in `creating_entries` or `removing_entries`. When the rollback process begins, these entries will be programmed to the SAI a second time, which is a) incorrect b) could potentially trigger the same exception second time.

**How I verified it**
Run the `MuxRollbackTest.StandbyToActiveExceptionRollbackToStandby` test

**Details if related**
@mssonicbld
Copy link
Copy Markdown
Collaborator Author

Original PR: #3788

@mssonicbld
Copy link
Copy Markdown
Collaborator Author

/azp run

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@yejianquan yejianquan marked this pull request as draft September 2, 2025 01:05
@yejianquan yejianquan marked this pull request as ready for review September 3, 2025 02:38
@mssonicbld mssonicbld merged commit f5c6e02 into sonic-net:202505 Sep 3, 2025
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants