Skip to content

[Mellanox] Stop SDK monitor thread before syncd shutdown#23681

Merged
kperumalbfn merged 1 commit intosonic-net:masterfrom
volodymyrsamotiy:stop_sdk_health_mon
Sep 22, 2025
Merged

[Mellanox] Stop SDK monitor thread before syncd shutdown#23681
kperumalbfn merged 1 commit intosonic-net:masterfrom
volodymyrsamotiy:stop_sdk_health_mon

Conversation

@volodymyrsamotiy
Copy link
Collaborator

Why I did it

To avoid error message is syslog during shutdown flow related to SDK monitor thread.
Error message was observed only during shutdown flow, hence there was no functional impact, just error log.
It was not always reproducible because the reason for it was race condition related to monitor thread.

Work item tracking
  • Microsoft ADO (number only):

How I did it

During shutdown flow stop SDK monitor thread before actual SDK shutdown in order to avoid race condition.

How to verify it

By running any type of reboot or config reload

Which release branch to backport (provide reason below if selected)

  • 202205
  • 202211
  • 202305
  • 202311
  • 202405
  • 202411
  • 202505

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
@mssonicbld
Copy link
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@bingwang-ms
Copy link
Contributor

LGTM. The error log is also noticed in 202505 qual.
If there is no functionality impact, I think we can skip 202505 branch and merge it to master only. How do you think?

@keboliu
Copy link
Collaborator

keboliu commented Sep 18, 2025

LGTM. The error log is also noticed in 202505 qual. If there is no functionality impact, I think we can skip 202505 branch and merge it to master only. How do you think?

Hi @bingwang-ms although the error log is not functional but it will be captured by the log analyzer and fail some test, so it sill good to cherry-pick to avoid log analyzer failure.

@bingwang-ms
Copy link
Contributor

@yejianquan Can you please evaluate the cherry-pick request to 202505? Thanks

@kperumalbfn kperumalbfn merged commit 46eafcb into sonic-net:master Sep 22, 2025
20 checks passed
@mssonicbld
Copy link
Collaborator

Cherry-pick PR to 202505: #24079

arfeigin pushed a commit to arfeigin/sonic-buildimage that referenced this pull request Oct 21, 2025
tirupatihemanth added a commit to tirupatihemanth/sonic-buildimage that referenced this pull request Nov 20, 2025
yuazhe pushed a commit to yuazhe/sonic-buildimage that referenced this pull request Nov 24, 2025
tirupatihemanth added a commit to tirupatihemanth/sonic-buildimage that referenced this pull request Nov 25, 2025
FengPan-Frank pushed a commit to FengPan-Frank/sonic-buildimage that referenced this pull request Dec 4, 2025
…3681)

[Mellanox] Stop SDK health monitor before syncd shutdown

Signed-off-by: Feng Pan <fenpan@microsoft.com>
volodymyrsamotiy added a commit to volodymyrsamotiy/sonic-buildimage that referenced this pull request Mar 4, 2026
…utdown (sonic-net#23681)

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
volodymyrsamotiy added a commit to volodymyrsamotiy/sonic-buildimage that referenced this pull request Mar 24, 2026
…utdown (sonic-net#23681)

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
volodymyrsamotiy added a commit to volodymyrsamotiy/sonic-buildimage that referenced this pull request Mar 24, 2026
volodymyrsamotiy added a commit to volodymyrsamotiy/sonic-buildimage that referenced this pull request Mar 25, 2026
…down (sonic-net#23681)"

This reverts commit 46eafcb.

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants