Add loganalyzer ignore regex for GCU#5391
Conversation
wangxin
left a comment
There was a problem hiding this comment.
It is not a good practice to totally disable loganalyzer. Real issue could be hide using this method. It is better to add the message pattern to ignore list.
Changed to ignore regex. Will fix the failure in furture PR. Currently add the ignored_regex for PR check pass. |
| ".*ERR kernel.*Reset adapter.*", # test_portchannel_interface replace mtu | ||
| ".*ERR swss[0-9]*#orchagent: :- getPortOperSpeed.*", # test_portchannel_interface replace mtu | ||
| ".*ERR.*Failed to apply Json change.*", # validator need updater submodule | ||
| ".*ERR GenericConfigUpdater: Change Applier: service invoked.*", # validator need updater submodule |
There was a problem hiding this comment.
Are all these errors only be triggered during GCU testing? If these errors can be observed in other tests, they need to be added to the common ignore list. Otherwise, this change only "fix" the GCU testing.
There was a problem hiding this comment.
These error are only for GCU test.
… files (#5193)" (#5433) This reverts commit 03cccf7. Reverts #5193 After this fix was merged, PR test keeps failing because of errors in syslog. We spent some effort trying to temporarily ignore the errors. However, the list seems endless. Please refer to: [loganalyzer]add log patterns to the common ignore #5411 Add loganalyzer ignore regex for GCU #5391 We need a way to temporarily unblock PR testing. Let's revert this fix for now. Then I'll submit another PR to fix the loganalyzer issue together with a complete ignore list.
… files (#5193)" (#5433) This reverts commit 03cccf7. Reverts #5193 After this fix was merged, PR test keeps failing because of errors in syslog. We spent some effort trying to temporarily ignore the errors. However, the list seems endless. Please refer to: [loganalyzer]add log patterns to the common ignore #5411 Add loganalyzer ignore regex for GCU #5391 We need a way to temporarily unblock PR testing. Let's revert this fix for now. Then I'll submit another PR to fix the loganalyzer issue together with a complete ignore list.
What is the motivation for this PR? Loganalyzer was broken in PR #3235. The issue is that common config was loaded in subprocess for adding marks to syslog. After the subprocess exited, the common config is lost. PR #5193 tried to fix this issue. However, because of many new error logs sneaked in when log analyzer was not working, PR testing started to fail by these error logs after PR #5193 was merged. PR #5391 and #5411 tried to work around the PR testing failure to unblock PR testing. PR #5391 is to address the GCU related error logs and was merged. PR #5411 tried to add other error logs to the common ignore list. But the effort took too long because the ignore list seemed endless. To unblock PR testing as soon as possible, the original fix #5193 was reverted in #5433. This PR tries to complete the work left over from #5411 and #5433. How did you do it? Changes: * Fix the log analyzer common config not loaded issue. * Temporarily add error logs to the common ignore list. * Improve the logging of log analyzer and parallel_run * PR testing t0_part2 takes much more time than t0_part1 after the GCU test scripts are added. This change re-balanced t0 part1&part2 testing by moving some of the tests from part2 to part1. * Sorted the PR testing scripts in alphabetic order. How did you verify/test it? Tested run a few test scripts with log analyzer enabled on KVM testbed. Signed-off-by: Xin Wang <[email protected]>
What is the motivation for this PR? Loganalyzer was broken in PR #3235. The issue is that common config was loaded in subprocess for adding marks to syslog. After the subprocess exited, the common config is lost. PR #5193 tried to fix this issue. However, because of many new error logs sneaked in when log analyzer was not working, PR testing started to fail by these error logs after PR #5193 was merged. PR #5391 and #5411 tried to work around the PR testing failure to unblock PR testing. PR #5391 is to address the GCU related error logs and was merged. PR #5411 tried to add other error logs to the common ignore list. But the effort took too long because the ignore list seemed endless. To unblock PR testing as soon as possible, the original fix #5193 was reverted in #5433. This PR tries to complete the work left over from #5411 and #5433. How did you do it? Changes: * Fix the log analyzer common config not loaded issue. * Temporarily add error logs to the common ignore list. * Improve the logging of log analyzer and parallel_run * PR testing t0_part2 takes much more time than t0_part1 after the GCU test scripts are added. This change re-balanced t0 part1&part2 testing by moving some of the tests from part2 to part1. * Sorted the PR testing scripts in alphabetic order. How did you verify/test it? Tested run a few test scripts with log analyzer enabled on KVM testbed. Signed-off-by: Xin Wang <[email protected]>
Summary: Add LogAnalyzer ignored regex for GCU as it will report failure on incorrect ordering which is discarded by GCU. What is the motivation for this PR? Recent LogAnalyzer fix report failure on discarded ordering in GCU jsonpatch. However, GCU doesn't fail actually as it will explore a correct ordering and apply. There are some real issue in ignored regex. Will fix in future PR. Currently add it for PR check pass. How did you do it? Add LogAnalyzer ignored regex for GCU tests. How did you verify/test it? Run GCU test without flag --disable_loganalyzer and see if it works.
… files (sonic-net#5193)" (sonic-net#5433) This reverts commit 03cccf7. Reverts sonic-net#5193 After this fix was merged, PR test keeps failing because of errors in syslog. We spent some effort trying to temporarily ignore the errors. However, the list seems endless. Please refer to: [loganalyzer]add log patterns to the common ignore sonic-net#5411 Add loganalyzer ignore regex for GCU sonic-net#5391 We need a way to temporarily unblock PR testing. Let's revert this fix for now. Then I'll submit another PR to fix the loganalyzer issue together with a complete ignore list.
Description of PR
Summary: Add LogAnalyzer ignored regex for GCU as it will report failure on incorrect ordering which is discarded by GCU
Fixes # (issue)
Type of change
Back port request
Approach
What is the motivation for this PR?
Recent LogAnalyzer fix report failure on discarded ordering in GCU jsonpatch. However, GCU doesn't fail actually as it will explore a correct ordering and apply.
How did you do it?
Disable LogAnalyzer for GCU tests.
How did you verify/test it?
Run GCU test without flag --disable_loganalyzer and see if it works.
Any platform specific information?
Supported testbed topology if it's a new test case?
Documentation