Skip to content

[supervisor][sfm]Fix the issue of swss.sh shows backtrace when shutdo…#342

Merged
gechiang merged 1 commit intoAzure:202205from
gechiang:202205
Apr 20, 2024
Merged

[supervisor][sfm]Fix the issue of swss.sh shows backtrace when shutdo…#342
gechiang merged 1 commit intoAzure:202205from
gechiang:202205

Conversation

@gechiang
Copy link
Copy Markdown
Contributor

…wn a SFM. (#18393)

On a Supervisor card of a VOQ chassis, when remove or shutdown a Fabric card, swss.sh shows Stacktrace for all related empty SFM slots in the syslog file. This PR fixes #18384

How I did it
In the asic_status.py, all empty SFM slots related swss.sh is in the waiting state to wait for the presence event of SFM -- SET operation. The subscriber event handler also includes the "DEL" operation when a SFM is shutdown/removal. When a SFM is shutdown, all empty slot's swss.sh also get the "DEL" event although it is not for them. In the "DEL" operation, the current implementation doesn't check if this "DEL" operation for them, and then they exit the wait state and proceed to docker-wait-any with wrong operation in the wrong slot. docker-wait0any raise the backtarce.

How to verify it
In a chassis which has some empty SMF slot, remove or shutdown a SFM. There should not be related stacktrace shown in syslog

Why I did it

Work item tracking
  • Microsoft ADO (number only):

How I did it

How to verify it

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

…wn a SFM. (#18393)

On a Supervisor card of a VOQ chassis, when remove or shutdown a Fabric card, swss.sh shows Stacktrace for all related empty SFM slots in the syslog file. This PR fixes #18384

How I did it
In the asic_status.py, all empty SFM slots related swss.sh is in the waiting state to wait for the presence event of SFM -- SET operation. The subscriber event handler also includes the "DEL" operation when a SFM is shutdown/removal. When a SFM is shutdown, all empty slot's swss.sh also get the "DEL" event although it is not for them. In the "DEL" operation, the current implementation doesn't check if this "DEL" operation for them, and then they exit the wait state and proceed to docker-wait-any with wrong operation in the wrong slot. docker-wait0any raise the backtarce.

How to verify it
In a chassis which has some empty SMF slot, remove or shutdown a SFM. There should not be related stacktrace shown in syslog

Signed-off-by: mlok <marty.lok@nokia.com>
@gechiang gechiang requested a review from arlakshm April 20, 2024 01:25
@gechiang gechiang requested a review from lguohan as a code owner April 20, 2024 01:25
@gechiang
Copy link
Copy Markdown
Contributor Author

Thi sis to manually cherry-pick (sonic-net/sonic-buildimage#18393) to msft repo 202205 branch.

@gechiang gechiang merged commit 66f93d7 into Azure:202205 Apr 20, 2024
prabhataravind pushed a commit that referenced this pull request Jul 7, 2025
…lly (#21564)

#### Why I did it
src/sonic-gnmi
```
* a023991 - (HEAD -> master, origin/master, origin/HEAD) GNOI Implementation of OS.Verify (#342) (33 hours ago) [Dawei Huang]
* a538f49 - Enable Pfcwd Queries (#332) (2 days ago) [Zain Budhwani]
```
#### How I did it
#### How to verify it
#### Description for the changelog
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants