Skip to content

[Mellanox] read temperature threshold from SDK sysfs for firmware-controlled module#24855

Merged
liat-grozovik merged 9 commits intosonic-net:masterfrom
Junchao-Mellanox:master-retry-threshold
Jan 30, 2026
Merged

[Mellanox] read temperature threshold from SDK sysfs for firmware-controlled module#24855
liat-grozovik merged 9 commits intosonic-net:masterfrom
Junchao-Mellanox:master-retry-threshold

Conversation

@Junchao-Mellanox
Copy link
Copy Markdown
Collaborator

@Junchao-Mellanox Junchao-Mellanox commented Dec 15, 2025

Why I did it

For firmware controlled module, temperature threshold should be read from SDK sysfs to avoid too many I2C access from module.

Work item tracking
  • Microsoft ADO (number only):

How I did it

  1. Read temp threshold from SDK sysfs for fimrware-controlled module
  2. Add a retry mechanism to avoid reading module temperature when it is not ready

How to verify it

unit test
manual test passed on SPC1, SPC3, SPC4 and SPC5
sonic-mgmt regression test passed on all Nvidia platforms

Which release branch to backport (provide reason below if selected)

  • 202205
  • 202211
  • 202305
  • 202311
  • 202405
  • 202411
  • 202505

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@Junchao-Mellanox
Copy link
Copy Markdown
Collaborator Author

PR for 202511: #24937

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

liat-grozovik
liat-grozovik previously approved these changes Jan 12, 2026
@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld
Copy link
Copy Markdown
Collaborator

/azp run Azure.sonic-buildimage

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@liat-grozovik liat-grozovik merged commit e22967b into sonic-net:master Jan 30, 2026
13 checks passed
xincunli-sonic pushed a commit to xincunli-sonic/sonic-buildimage that referenced this pull request Feb 2, 2026
…trolled module (sonic-net#24855)

- Why I did it
For firmware controlled module, temperature threshold should be read from SDK sysfs to avoid too many I2C access from module.

- How I did it
Read temp threshold from SDK sysfs for fimrware-controlled module
Add a retry mechanism to avoid reading module temperature when it is not ready

- How to verify it
unit test
manual test passed on SPC1, SPC3, SPC4 and SPC5
sonic-mgmt regression test passed on all Nvidia platforms

Signed-off-by: Junchao-Mellanox <[email protected]>
Signed-off-by: Xincun Li <[email protected]>
FengPan-Frank pushed a commit to FengPan-Frank/sonic-buildimage that referenced this pull request Mar 6, 2026
…trolled module (sonic-net#24855)

- Why I did it
For firmware controlled module, temperature threshold should be read from SDK sysfs to avoid too many I2C access from module.

- How I did it
Read temp threshold from SDK sysfs for fimrware-controlled module
Add a retry mechanism to avoid reading module temperature when it is not ready

- How to verify it
unit test
manual test passed on SPC1, SPC3, SPC4 and SPC5
sonic-mgmt regression test passed on all Nvidia platforms

Signed-off-by: Junchao-Mellanox <[email protected]>
Signed-off-by: Feng Pan <[email protected]>
dprital pushed a commit that referenced this pull request Mar 19, 2026
…trolled module (#24855)

- Why I did it
For firmware controlled module, temperature threshold should be read from SDK sysfs to avoid too many I2C access from module.

- How I did it
Read temp threshold from SDK sysfs for fimrware-controlled module
Add a retry mechanism to avoid reading module temperature when it is not ready

- How to verify it
unit test
manual test passed on SPC1, SPC3, SPC4 and SPC5
sonic-mgmt regression test passed on all Nvidia platforms

Signed-off-by: Junchao-Mellanox <[email protected]>
Signed-off-by: dprital <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants