Skip to content

[Mellanox] fix for watchdog device not found, adding dependency on hw-management#14182

Merged
liat-grozovik merged 2 commits intosonic-net:masterfrom
dbarashinvd:dbarashi_watchdog_fix_hw_management
Mar 15, 2023
Merged

[Mellanox] fix for watchdog device not found, adding dependency on hw-management#14182
liat-grozovik merged 2 commits intosonic-net:masterfrom
dbarashinvd:dbarashi_watchdog_fix_hw_management

Conversation

@dbarashinvd
Copy link
Copy Markdown
Contributor

@dbarashinvd dbarashinvd commented Mar 9, 2023

Why I did it

sometimes mellanox watchdog device isn't ready when watchdog-control service is up after first installation from ONIE
need to delay watchdog control service to go up after hw-mgmt which gets devices up and ready

How I did it

Delay mellanox watchdog-control service before hw-mgmt has started on Mellanox platform in order to avoid missing or not ready watchdog device.

How to verify it

verification test of ONIE installation of image in a loop
making sure watchdog service is always up (not failed) after first installation from ONIE

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211

Description for the changelog

Ensure to add label/tag for the feature raised. example - PR#2174 under sonic-utilities repo. where, Generic Config and Update feature has been labelled as GCU.

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

@dbarashinvd dbarashinvd requested a review from lguohan as a code owner March 9, 2023 12:03
@liat-grozovik liat-grozovik merged commit 06d6daf into sonic-net:master Mar 15, 2023
@liat-grozovik
Copy link
Copy Markdown
Collaborator

@prgeor FYI

mssonicbld pushed a commit to mssonicbld/sonic-buildimage that referenced this pull request Mar 19, 2023
…-management (sonic-net#14182)

- Why I did it
Sometimes Nvidia watchdog device isn't ready when watchdog-control service is up after first installation from ONIE
need to delay watchdog control service to go up after hw-mgmt which gets devices up and ready

- How I did it
Delay Nvidia watchdog-control service before hw-mgmt has started on Mellanox platform in order to avoid missing or not ready watchdog device.

- How to verify it
verification test of ONIE installation of image in a loop
making sure watchdog service is always up (not failed) after first installation from ONIE
@mssonicbld
Copy link
Copy Markdown
Collaborator

Cherry-pick PR to 202211: #14335

mssonicbld pushed a commit that referenced this pull request Mar 19, 2023
…-management (#14182)

- Why I did it
Sometimes Nvidia watchdog device isn't ready when watchdog-control service is up after first installation from ONIE
need to delay watchdog control service to go up after hw-mgmt which gets devices up and ready

- How I did it
Delay Nvidia watchdog-control service before hw-mgmt has started on Mellanox platform in order to avoid missing or not ready watchdog device.

- How to verify it
verification test of ONIE installation of image in a loop
making sure watchdog service is always up (not failed) after first installation from ONIE
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants