Skip to content

[SmartSwitch] [HA] disable northbound route zmq flag for ha topology #23329

Merged
prsunny merged 1 commit intosonic-net:masterfrom
zjswhhh:zmq
Mar 27, 2026
Merged

[SmartSwitch] [HA] disable northbound route zmq flag for ha topology #23329
prsunny merged 1 commit intosonic-net:masterfrom
zjswhhh:zmq

Conversation

@zjswhhh
Copy link
Contributor

@zjswhhh zjswhhh commented Mar 26, 2026

Description of PR

Summary:
Fixes # (issue)

In smartswitch ha topology, noticed peer loopback0 (npu pa ip) is not pingable if orch_northbond_route_zmq_enabled is set to true - the route failed to install.

This is not a ha specific problem, but it's impacting HA tests as NPU-DPU BFD probing requires loopback reachability. Also, we don't need to enable northbond_route_zmq on NPU config. Hence, adding this change to explicitly disable.

sign-off: Jing Zhang [email protected]

Type of change

  • Bug fix
  • Testbed and Framework(new/improvement)
  • New Test case
    • Skipped for non-supported platforms
  • Test case improvement

Back port request

  • 202205
  • 202305
  • 202311
  • 202405
  • 202411
  • 202505
  • 202511

Approach

What is the motivation for this PR?

To make sure npu - dpu works (BFD), so HA tests can pass.

How did you do it?

In golden config gen, check the topo name, if matching ha, set the flag to false.

How did you verify/test it?

Once the flag is set to false, loopback became reachable.

Any platform specific information?

NA.

Supported testbed topology if it's a new test case?

Na.

Documentation

Signed-off-by: Jing Zhang <[email protected]>
@mssonicbld
Copy link
Collaborator

/azp run

@github-actions github-actions bot requested review from r12f, wangxin and xwjiang-ms March 26, 2026 01:05
@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@zjswhhh zjswhhh added the smartswitch-ha Label for tracking smartswitch HA related changes label Mar 26, 2026
Copy link
Collaborator

@yxieca yxieca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AI agent on behalf of Ying. Reviewed; no issues found.

@prsunny
Copy link
Contributor

prsunny commented Mar 26, 2026

@zjswhhh , can you provide details on why this must be disabled and what if its enabled? How is this related to NPU loopback pinging.

@prsunny
Copy link
Contributor

prsunny commented Mar 26, 2026

@prabhataravind to review

@prsunny prsunny requested a review from prabhataravind March 26, 2026 18:10
@zjswhhh
Copy link
Contributor Author

zjswhhh commented Mar 26, 2026

@zjswhhh , can you provide details on why this must be disabled and what if its enabled? How is this related to NPU loopback pinging.

Hi @prsunny - created an issue item to track, details are included: #23328

I don't have the root cause yet, but since it's 100% reproducible, just wanted to add this workaround to unblock test development.

@prabhataravind
Copy link
Contributor

@yue-fred-gao @vivekrnv for viz.

@prabhataravind
Copy link
Contributor

@zjswhhh , can you provide details on why this must be disabled and what if its enabled? How is this related to NPU loopback pinging.

Hi @prsunny - created an issue item to track, details are included: #23328

I don't have the root cause yet, but since it's 100% reproducible, just wanted to add this workaround to unblock test development.

@zjswhhh peer loopback0 shouldn't be any different from other routes. Is the issue seen for all routes?

@zjswhhh
Copy link
Contributor Author

zjswhhh commented Mar 26, 2026

@zjswhhh , can you provide details on why this must be disabled and what if its enabled? How is this related to NPU loopback pinging.

Hi @prsunny - created an issue item to track, details are included: #23328
I don't have the root cause yet, but since it's 100% reproducible, just wanted to add this workaround to unblock test development.

@zjswhhh peer loopback0 shouldn't be any different from other routes. Is the issue seen for all routes?

Only for loopback IP.

if "orch_northbond_route_zmq_enabled" in out:
ori_config_db["DEVICE_METADATA"]["localhost"]["orch_northbond_route_zmq_enabled"] = "true"
if self.topo_name == "t1-smartswitch-ha":
ori_config_db["DEVICE_METADATA"]["localhost"]["orch_northbond_route_zmq_enabled"] = "false"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this typo - northbond?

@prsunny prsunny merged commit ce53d73 into sonic-net:master Mar 27, 2026
20 checks passed
@prsunny prsunny changed the title disable northbound route zmq flag for ha topology [SmartSwitch] [HA] disable northbound route zmq flag for ha topology Mar 27, 2026
ravaliyel pushed a commit to ravaliyel/sonic-mgmt that referenced this pull request Mar 27, 2026
Description of PR
Summary:
Fixes # (issue)

In smartswitch ha topology, noticed peer loopback0 (npu pa ip) is not pingable if orch_northbond_route_zmq_enabled is set to true - the route failed to install.

This is not a ha specific problem, but it's impacting HA tests as NPU-DPU BFD probing requires loopback reachability. Also, we don't need to enable northbond_route_zmq on NPU config. Hence, adding this change to explicitly disable.

In golden config gen, check the topo name, if matching ha, set the flag to false.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

request for 202511 smartswitch-ha Label for tracking smartswitch HA related changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants