Orchagent send heartbeat during warm-reboot to prevent Orchagent stuck alert.#2923
Conversation
|
I think this qualifies for a new UT, to check of heartbeats are still sent after Orchagent pause. |
Fixed, add heartbeat message check to warm reboot UT. |
|
Trigger build again |
This comment was marked as resolved.
This comment was marked as resolved.
|
/AzurePipelines run Azure.sonic-swss |
|
Azure Pipelines successfully started running 1 pipeline(s). |
|
@liuh-80 Can you please address the pipeline failures? |
Sure, I'm still working on this PR. |
|
The UT failed because supervisord not update process stdout to syslog immediately, it always delays few minutes, so test cause in this repo is very difficult to check the heartbeat signal. Spend too much time on write UT in this repo, will add UT in sonic-mgmt repo to test heartbeat in warmreboot, which will be much easy. |
05ae172 to
6e19948
Compare
|
@qiluo-msft , @vaibhavhd to signoff |
|
@liuh-80 This PR conflicts when I cherry-pick to 202305 branch. Can you please create a new PR for fixing it in 202305? |
|
@prsunny @qiluo-msft @vaibhavhd Can you please sign off this PR? |
@liuh-80 , i dont see the UT changes. Can you check? |
@prsunny , I try to add UT but found it's difficult to create a UT in this repo, because heartbeat message is send to systemd and sytemd have few minutes delay to write syslog. So, my plan is to create test case in sonic-mgmt repo, will create PR in sonic-mgmt later. Currently I manually verified change in this PR works. |
|
UT created in sonic-mgmt repo: sonic-net/sonic-mgmt#10676 |
|
@liuh-80 could you help to update the test result with 202305? |
*Orchagent send heartbeat during warm-reboot to prevent Orchagent stuck alert.
Description updated, tested on SONiC.202305-17081.401641-ec2aed854 |
|
@liuh-80 cherry pick conflict, could you file separate PR to 202305? |
*Orchagent send heartbeat during warm-reboot to prevent Orchagent stuck alert.
Orchangent send heartbeat during warm-reboot to prevent Orchagent stuck alert. Why I did it Orchangent will freese during warm-reboot, then supervisor-proc-exit-listener will generate false alert during warm reboot: sonic-net/sonic-buildimage#16686 Work item tracking Microsoft ADO: 25295846 How I did it Send heartbeat during warm-reboot freeze. How to verify it Pass all UT. Manually verify issue fixed by check syslog.
*Orchagent send heartbeat during warm-reboot to prevent Orchagent stuck alert.
Orchangent send heartbeat during warm-reboot to prevent Orchagent stuck alert.
Why I did it
Orchangent will freese during warm-reboot, then supervisor-proc-exit-listener will generate false alert during warm reboot:
sonic-net/sonic-buildimage#16686
Work item tracking
How I did it
Send heartbeat during warm-reboot freeze.
How to verify it
Pass all UT.
Manually verify issue fixed by check syslog.
Which release branch to backport (provide reason below if selected)
Tested branch (Please provide the tested image version)
Description for the changelog
Orchangent send heartbeat during warm-reboot to prevent Orchagent stuck alert.
Link to config_db schema for YANG module changes
A picture of a cute animal (not mandatory but encouraged)