Re-enable ICMP responder and gratuitous ARP service for active-standby dualtor topologies.#12860
Closed
vivekverma-arista wants to merge 5 commits intosonic-net:masterfrom
Closed
Re-enable ICMP responder and gratuitous ARP service for active-standby dualtor topologies.#12860vivekverma-arista wants to merge 5 commits intosonic-net:masterfrom
vivekverma-arista wants to merge 5 commits intosonic-net:masterfrom
Conversation
… active-standby dualtor
Contributor
Author
|
This has been fixed by modifying the product code therefore closing this pull request: sonic-net/sonic-linkmgrd#250 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description of PR
Summary:
Fixes #119
Type of change
Back port request
Approach
What is the motivation for this PR?
#221 introduced oscillation logic in active-standby dualtor. These oscillations happen continuously in the testing environment as well because we don't run ICMP responder as it was disabled some time back in #9117.
These continuous oscillations interfere with the testing and has made a lot of traffic tests flaky in active-standby dualtor.
Details can be found in #119
How did you do it?
Re-enabled ICMP responder and gratuitous ARP service in active-standby dualtor topologies.
We have introduced new fixtures
toggle_all_simulator_ports_to_rand_selected_tor_unconditionallyandtoggle_all_simulator_ports_to_enum_rand_one_per_hwsku_frontend_host_unconditionallysimilar toactive-activedualtor to run few tests in active-standby mode where ICMP responder interferes with the testing and we need to pause it.This also affects then following tests which have been fixed
everflow tests: These tests shutdown BGP on the randomly selected ToR and if ICMP responder is running
toggle_all_simulator_ports_to_rand_selected_torfails to toggle the MUX direction towards this ToR because it is now technically unhealthy as it lost routes ( which should be the expected behaviour in the production environment as well ). The fix for this test is very similar to active-active dualtor by using the fixturetoggle_all_simulator_ports_to_rand_selected_tor_unconditionally. If ICMP responder is not running thentoggle_all_simulator_ports_to_rand_selected_torsuccessfully toggles the MUX direction towards the unhealthy ToR which is a testing gap that got introduced due to disabling of ICMP responder.In case of
pfcwd/test_pfcwd_function.pyandarp/test_unknown_mac.py. ICMP responder and GARP interferes with the testing therefore the fix is to selectively pause them for these tests and run them in active-standby mode by using the new fixtures.How did you verify/test it?
Tested on Arista-7260 and Arista-7050 platforms with dualtor and dualtor-120.
Any platform specific information?
Supported testbed topology if it's a new test case?
Documentation