[action] [PR:23342] fix(console): fix intermittent login failures in dut_console tests by mssonicbld · Pull Request #23396 · sonic-net/sonic-mgmt

mssonicbld · 2026-03-28T17:24:16Z

Fix two intermittent failures in dut_console tests: an accumulation-buffer fix for the Password prompt detection race, and a splitlines()[0] fix for reliable TMOUT value extraction in test_idle_timeout.

Description of PR

Summary:

Fix two independent intermittent failures in dut_console tests:

ssh_console_conn.py — login_stage_2() checked re.search(pwd_pattern, output) where output is only the most recent read_channel() chunk. The DUT's Password: prompt can arrive split across multiple TCP reads (e.g. Pa + ssword:), causing no chunk to match and the password never being sent — resulting in an intermittent "Socket is closed" failure in create_duthost_console (~1 in 5 runs). Fix: check return_msg (accumulated read buffer) instead of output.
test_idle_timeout.py — splitlines()[-1] could return a partial prompt string (e.g. admin@hostname:) instead of the numeric TMOUT value when the prompt was not fully stripped from the command output. Fix: use splitlines()[0] to always read the first output line, which is always the numeric value.

Both fixes were validated on internal branch dev/xuliping/20260325_202511_console_login_fix across 5 full test runs with no failures.

Related: follows up on #23295 (blank Enter fix in the same login path — already merged).

Type of change

Back port request

Approach

What is the motivation for this PR?

dut_console tests were failing intermittently (~1 in 5 runs) with "Socket is closed" errors during console login. Root cause: the DUT's Password: prompt sometimes arrives split across multiple TCP reads, so per-chunk pattern matching never matches. A second independent failure in test_idle_timeout caused by splitlines()[-1] returning a prompt fragment instead of the numeric TMOUT value.

How did you do it?

ssh_console_conn.py: Changed re.search(pwd_pattern, output) to re.search(pwd_pattern, return_msg) in login_stage_2(), where return_msg is the accumulated read buffer across all chunks.
test_idle_timeout.py: Changed splitlines()[-1] to splitlines()[0] in the TMOUT value extraction, so we always get the first output line regardless of trailing prompt remnants.

How did you verify/test it?

Ran all dut_console test cases on a physical testbed using internal branch dev/xuliping/20260325_202511_console_login_fix for 5 full iterations. All tests passed with no failures.

Any platform specific information?

None — applies to all platforms using SSH console connections.

Supported testbed topology if it's a new test case?

N/A (bug fix only)

Documentation

N/A

…onic-net#23342) What is the motivation for this PR? dut_console tests were failing intermittently (~1 in 5 runs) with "Socket is closed" errors during console login. Root cause: the DUT's Password: prompt sometimes arrives split across multiple TCP reads, so per-chunk pattern matching never matches. A second independent failure in test_idle_timeout caused by splitlines()[-1] returning a prompt fragment instead of the numeric TMOUT value. How did you do it? ssh_console_conn.py: Changed re.search(pwd_pattern, output) to re.search(pwd_pattern, return_msg) in login_stage_2(), where return_msg is the accumulated read buffer across all chunks. test_idle_timeout.py: Changed splitlines()[-1] to splitlines()[0] in the TMOUT value extraction, so we always get the first output line regardless of trailing prompt remnants. How did you verify/test it? Ran all dut_console test cases on a physical testbed using internal branch dev/xuliping/20260325_202511_console_login_fix for 5 full iterations. All tests passed with no failures. Any platform specific information? None — applies to all platforms using SSH console connections. Supported testbed topology if it's a new test case? N/A (bug fix only) Signed-off-by: mssonicbld <sonicbld@microsoft.com>

mssonicbld · 2026-03-28T17:24:21Z

Original PR: #23342

mssonicbld · 2026-03-28T17:24:25Z

/azp run

azure-pipelines · 2026-03-28T17:24:38Z

Azure Pipelines successfully started running 1 pipeline(s).

lolyu · 2026-03-30T01:31:24Z

/azp run

azure-pipelines · 2026-03-30T01:31:38Z

Azure Pipelines successfully started running 1 pipeline(s).

…onic-net#23342) (sonic-net#23396) What is the motivation for this PR? dut_console tests were failing intermittently (~1 in 5 runs) with "Socket is closed" errors during console login. Root cause: the DUT's Password: prompt sometimes arrives split across multiple TCP reads, so per-chunk pattern matching never matches. A second independent failure in test_idle_timeout caused by splitlines()[-1] returning a prompt fragment instead of the numeric TMOUT value. How did you do it? ssh_console_conn.py: Changed re.search(pwd_pattern, output) to re.search(pwd_pattern, return_msg) in login_stage_2(), where return_msg is the accumulated read buffer across all chunks. test_idle_timeout.py: Changed splitlines()[-1] to splitlines()[0] in the TMOUT value extraction, so we always get the first output line regardless of trailing prompt remnants. How did you verify/test it? Ran all dut_console test cases on a physical testbed using internal branch dev/xuliping/20260325_202511_console_login_fix for 5 full iterations. All tests passed with no failures. Any platform specific information? None — applies to all platforms using SSH console connections. Supported testbed topology if it's a new test case? N/A (bug fix only) Signed-off-by: mssonicbld <sonicbld@microsoft.com> Co-authored-by: Liping Xu <108326363+lipxu@users.noreply.github.com>

mssonicbld requested review from YatishSVC, bingwang-ms, matthew-soulsby, yanmo96 and yutongzhang-microsoft as code owners March 28, 2026 17:24

mssonicbld added the automerge label Mar 28, 2026

mssonicbld mentioned this pull request Mar 28, 2026

fix(console): fix intermittent login failures in dut_console tests #23342

Merged

12 tasks

mssonicbld merged commit db48e3f into sonic-net:202511 Mar 30, 2026
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[action] [PR:23342] fix(console): fix intermittent login failures in dut_console tests#23396

[action] [PR:23342] fix(console): fix intermittent login failures in dut_console tests#23396
mssonicbld merged 1 commit intosonic-net:202511from
mssonicbld:cherry/202511/23342

mssonicbld commented Mar 28, 2026

Uh oh!

mssonicbld commented Mar 28, 2026

Uh oh!

mssonicbld commented Mar 28, 2026

Uh oh!

azure-pipelines bot commented Mar 28, 2026

Uh oh!

lolyu commented Mar 30, 2026

Uh oh!

azure-pipelines bot commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mssonicbld commented Mar 28, 2026

Description of PR

Type of change

Back port request

Approach

What is the motivation for this PR?

How did you do it?

How did you verify/test it?

Any platform specific information?

Supported testbed topology if it's a new test case?

Documentation

Uh oh!

mssonicbld commented Mar 28, 2026

Uh oh!

mssonicbld commented Mar 28, 2026

Uh oh!

azure-pipelines bot commented Mar 28, 2026

Uh oh!

lolyu commented Mar 30, 2026

Uh oh!

azure-pipelines bot commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants