Skip to content

[Mellanox][Smartswitch] Add no_wait option for dpu reboot and add platform information parsing #20943

Merged
liat-grozovik merged 9 commits intosonic-net:masterfrom
gpunathilell:no_wait_pl
Dec 4, 2024
Merged

[Mellanox][Smartswitch] Add no_wait option for dpu reboot and add platform information parsing #20943
liat-grozovik merged 9 commits intosonic-net:masterfrom
gpunathilell:no_wait_pl

Conversation

@gpunathilell
Copy link
Contributor

Why I did it

Changes for dpuctlplat.py:

  • Added option to invoke systemctl rshim start/stop from the pmon container (Using dbus)
  • Added no_wait option for reboot (Since we do not need to wait for the dpu to be ready if NPU+DPU reboot is ongoing)
  • Added platform JSON parsing for rshim and pcie information
Work item tracking
  • Microsoft ADO (number only):

How I did it

Changed dpuctlplat.py to support systemctl commands from pmon container using the dbus-send command

How to verify it

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211
  • 202305

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

finally:
os.close(fd)

def get_pci_dev_path(self):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gpunathilell any specific reason why this PCI remove could not be done in common code? @vvolam fyi

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we discussed this at the meeting. We have an extra dependency on the rshim here. Before removing PCI we need to stop rshim. After rescaning the PCI we need to start the rshim. Common code can't cover this.

@liat-grozovik liat-grozovik merged commit 3cc4c11 into sonic-net:master Dec 4, 2024
VladimirKuk pushed a commit to Marvell-switching/sonic-buildimage that referenced this pull request Jan 21, 2025
…tform information parsing (sonic-net#20943)

- Why I did it
Changes for dpuctlplat.py:
1. Added option to invoke systemctl rshim start/stop from the pmon container (Using dbus)
2. Added no_wait option for reboot (Since we do not need to wait for the dpu to be ready if NPU+DPU reboot is ongoing)
3. Added platform JSON parsing for rshim and pcie information

- How I did it
Changed dpuctlplat.py to support systemctl commands from pmon container using the dbus-send command
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants