Skip to content

[platform][arista] Fix NVMe sensor chip address in sensors.conf for 7060X6-64PE-B#22639

Merged
StormLiangMS merged 2 commits intosonic-net:masterfrom
vvolam:update-sensors
May 27, 2025
Merged

[platform][arista] Fix NVMe sensor chip address in sensors.conf for 7060X6-64PE-B#22639
StormLiangMS merged 2 commits intosonic-net:masterfrom
vvolam:update-sensors

Conversation

@vvolam
Copy link
Contributor

@vvolam vvolam commented May 16, 2025

Why I did it

The sensors.conf file was referencing a non-existent NVMe PCI address (nvme-pci-0500) on the Arista-7060X6-64PE-B platform. This mismatch caused pmon#sensord to report repeated I/O errors while attempting to read sensor data for a non-existent device (nvme/#7). Updating the config to use the correct PCI address (nvme-pci-0400) resolves the issue.

Work item tracking
  • Microsoft ADO (number only): 32849896

How I did it

Modified sensors.conf to change the chip identifier from nvme-pci-0500 to nvme-pci-0400 to match the actual hardware PCI bus location.

How to verify it

  • Verified that the /dev/nvme* devices are present and functional
  • Confirmed correct PCI ID using lspci
$ show plat sum
Platform: x86_64-arista_7060x6_64pe_b
HwSKU: Arista-7060X6-64PE-B-C512S2
ASIC: broadcom
ASIC Count: 1
Serial Number: XXXXXXXX
Model Number: DCS-7060X6-64PE-B
Hardware Revision: 02.00
$ lspci -nn | grep -i nvme
04:00.0 Non-Volatile memory controller [0108]: Phison Electronics Corporation E18 PCIe4 NVMe Controller [1987:5018] (rev 01)
  • Edited sensors.conf and restarted pmon (systemctl restart pmon)
  • Monitored logs to ensure pmon#sensord no longer reports I/O errors for nvme/#7

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211
  • 202305
  • 202412

Tested branch (Please provide the tested image version)

  • [SONiC.20241211.16 ]

Description for the changelog

Fix sensors.conf NVMe chip config for Arista-7060X6-64PE-B to match actual PCI address and prevent pmon sensor read errors

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

@mssonicbld
Copy link
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@vvolam vvolam requested a review from r12f May 16, 2025 22:02
@vvolam
Copy link
Contributor Author

vvolam commented May 16, 2025

@rick-arista could you review the PR

@mssonicbld
Copy link
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@r12f
Copy link
Contributor

r12f commented May 19, 2025

@sdszhang for viz.

Copy link
Contributor

@judyjoseph judyjoseph left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lgtm

@judyjoseph
Copy link
Contributor

/azp run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@r12f
Copy link
Contributor

r12f commented May 20, 2025

/azp run

@azure-pipelines
Copy link

Commenter does not have sufficient privileges for PR 22639 in repo sonic-net/sonic-buildimage

@vvolam
Copy link
Contributor Author

vvolam commented May 20, 2025

/azpw run

@mssonicbld
Copy link
Collaborator

/AzurePipelines run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Collaborator

Cherry-pick PR to msft-202412: Azure/sonic-buildimage-msft#1165

@mssonicbld
Copy link
Collaborator

Cherry-pick PR to 202505: #22781

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants