[vlanmgr][202106]Fix for STATE_DB port check logic#1981

Closed

dgsudharsan wants to merge 2 commits intosonic-net:202106from

dgsudharsan:vlanmgrd_202106

Collaborator

dgsudharsan commented Oct 25, 2021

Same as #1980 for 202106
What I did
Updated checks for PORT entry in STATE_DB in vlanmgrd additionally check for presence of "state" attribute.. This is to add Vlanmgrd check similar to #1936

Why I did it
Prior to recent commits for PORT auto-negotiation, 3 daemons in cfgmgr (portmgrd, teammgrd, and intfmgrd) would not allow configuration to proceed for a specific PORT until portsyncd detected the presence of the kernel device (EthernetN) associated with the PORT and created the associated entry for the PORT in the STATE_DB with attribute "state" and value "ok".
With recent commits for PORT auto-negotiation, this logic is now broken due to creation of PORT entry in the STATE_DB by PortsOrch with only "supported_speed" attribute.

This leads to the issue where vlanmgrd might try to access the port even without it created
Oct 21 07:51:42.121276 arc-switch1025 ERR swss#vlanmgrd: :- main: Runtime error: /bin/bash -c "/sbin/ip link set "Ethernet10" master Bridge && /sbin/bridge vlan del vid 1 dev "Ethernet10" && /sbin/bridge vlan add vid 1000 dev "Ethernet10" pvid untagged" :
Oct 21 07:51:42.122339 arc-switch1025 INFO swss#/supervisord: vlanmgrd Cannot find device "Ethernet10"

How I verified it

Details if related

dgsudharsan added 2 commits

October 25, 2021 04:58


          [vlanmgr]Fix for STATE_DB port check logic

Signed-off-by: Sudharsan Dhamal Gopalarathnam <sudharsand@nvidia.com>


          Adding required headers

5734bfb

Signed-off-by: Sudharsan Dhamal Gopalarathnam <sudharsand@nvidia.com>

dgsudharsan requested a review from prsunny as a code owner

October 25, 2021 14:45

Collaborator Author

dgsudharsan commented Oct 25, 2021

/azpw run

Collaborator

mssonicbld commented Oct 25, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 25, 2021

Azure Pipelines successfully started running 1 pipeline(s).

Collaborator Author

dgsudharsan commented Oct 25, 2021

/azpw run

Collaborator

mssonicbld commented Oct 25, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 25, 2021

Azure Pipelines successfully started running 1 pipeline(s).

Collaborator Author

dgsudharsan commented Oct 26, 2021

/azpw run

Collaborator

mssonicbld commented Oct 26, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 26, 2021

Azure Pipelines successfully started running 1 pipeline(s).

Collaborator Author

dgsudharsan commented Oct 26, 2021

/azpw run

Collaborator

mssonicbld commented Oct 26, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 26, 2021

Azure Pipelines successfully started running 1 pipeline(s).

Collaborator Author

dgsudharsan commented Oct 26, 2021

/azpw run

Collaborator

mssonicbld commented Oct 26, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 26, 2021

Azure Pipelines successfully started running 1 pipeline(s).

Collaborator Author

dgsudharsan commented Oct 27, 2021

/azpw run

Collaborator

mssonicbld commented Oct 27, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 27, 2021

Azure Pipelines successfully started running 1 pipeline(s).

Collaborator Author

dgsudharsan commented Oct 27, 2021

/azpw run

Collaborator

mssonicbld commented Oct 27, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 27, 2021

Azure Pipelines successfully started running 1 pipeline(s).

Collaborator Author

dgsudharsan commented Oct 28, 2021

/azpw run

Collaborator

mssonicbld commented Oct 28, 2021

/AzurePipelines run

azure-pipelines bot commented Oct 28, 2021

Azure Pipelines successfully started running 1 pipeline(s).

dgsudharsan closed this

EdenGri pushed a commit to EdenGri/sonic-swss that referenced this pull request


          [GCU] Loading yang-models only once (sonic-net#1981)

3b642c9

#### What I did
Loading sonic-yang models only once, and re-using them. This makes the sorting a lot faster.

How to verify `loadYangModel` took a lot of time?

Add the following snippet to `TestPatchSorter`
```python
from pstats import Stats
import cProfile
class TestPatchSorter(...):
    def setUp(self):
        """init each test"""
        self.pr = cProfile.Profile()
        self.pr.enable()
        print("\n<<<---")
    def tearDown(self):
        """finish any test"""
        p = Stats (self.pr)
        p.strip_dirs()
        p.sort_stats ('cumtime')
        p.print_stats ()
        print("\n--->>>")
    .
    .
    .
    # Also update verify(self, cc_ops=[], tc_ops=[]) by commenting out changes validation to avoid extra calls to loadYangModels 
    def verify(self, cc_ops=[], tc_ops=[]):
        # Arrange
        config_wrapper=ConfigWrapper()
        target_config=jsonpatch.JsonPatch(tc_ops).apply(Files.CROPPED_CONFIG_DB_AS_JSON)
        current_config=jsonpatch.JsonPatch(cc_ops).apply(Files.CROPPED_CONFIG_DB_AS_JSON)
        patch=jsonpatch.make_patch(current_config, target_config)

        # Act
        actual = self.create_patch_sorter(current_config).sort(patch)

        # Assert
        # simulated_config = current_config
        # for move in actual:
        #     simulated_config = move.apply(simulated_config)
        #     self.assertTrue(config_wrapper.validate_config_db_config(simulated_config))
        # self.assertEqual(target_config, simulated_config)

```
Run
```
> python3 -m unittest patch_sorter_test.TestPatchSorter.test_sort__dpb_1_to_4__success 
.
.
.
         48986582 function calls (48933431 primitive calls) in 104.530 seconds

   Ordered by: cumulative time

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    0.000    0.000  104.530  104.530 case.py:549(_callTestMethod)
        1    0.000    0.000  104.530  104.530 patch_sorter_test.py:1889(test_sort__dpb_1_to_4__success)
        1    0.000    0.000  104.529  104.529 patch_sorter_test.py:1933(verify)
        1    0.005    0.005  104.527  104.527 patch_sorter.py:1332(sort)
     32/1    0.006    0.000  104.077  104.077 patch_sorter.py:955(sort)
      334    0.012    0.000   99.498    0.298 patch_sorter.py:310(validate)
      492    2.140    0.004   95.810    0.195 sonic_yang_ext.py:30(loadYangModel)  <=========
```

From the above we can see profiling data about test_sort__dpb_1_to_4__success:
- Took 104.53s to complete
- loadYangModel was called 492 times
- loadYangModel took 95.810s.

loadYangModel is the method that loads the yang models from memory into SonicYang object. The loading of the YANG models should only happen once.

#### How I did it
Moved all calls to create sonic_yang object to ConfigWrapper, and each call to create a new instance just fills in the data for the yang models.

#### How to verify it
unit-test

Running profiling after the update:
```
> python3 -m unittest patch_sorter_test.TestPatchSorter.test_sort__dpb_1_to_4__success 
.
.
.
         702096 function calls (648951 primitive calls) in 2.882 seconds

   Ordered by: cumulative time

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    0.000    0.000    2.882    2.882 case.py:549(_callTestMethod)
        1    0.000    0.000    2.882    2.882 patch_sorter_test.py:1890(test_sort__dpb_1_to_4__success)
        1    0.002    0.002    2.881    2.881 patch_sorter_test.py:1934(verify)
        1    0.000    0.000    2.874    2.874 patch_sorter.py:1332(sort)
     32/1    0.004    0.000    2.705    2.705 patch_sorter.py:955(sort)
      334    0.008    0.000    2.242    0.007 patch_sorter.py:310(validate)
      490    0.080    0.000    1.791    0.004 sonic_yang_ext.py:1043(loadData)
      332    0.043    0.000    1.655    0.005 patch_sorter.py:345(validate)
      332    0.018    0.000    1.509    0.005 gu_common.py:112(validate_config_db_config)
        .
        .
        .
        1    0.002    0.002    0.164    0.164 sonic_yang_ext.py:30(loadYangModel)
```
From the above we can see profiling data about test_sort__dpb_1_to_4__success:
- Took 2.882s to complete
- loadYangModel was called 1time
- loadYangModel took 0.164s.

[profiling-after.txt](https://github.com/Azure/sonic-utilities/files/7757252/profiling-after.txt)

#### Previous command output (if the output of a command-line utility has changed)

#### New command output (if the output of a command-line utility has changed)

dgsudharsan deleted the vlanmgrd_202106 branch

March 9, 2023 02:00

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet