Skip to content

[action] [PR:11738] Memory exhaustion test case takes an indeterminate amount of time to trigger Kernel panic.#12105

Merged
mssonicbld merged 1 commit intosonic-net:202311from
mssonicbld:cherry/202311/11738
Mar 22, 2024
Merged

[action] [PR:11738] Memory exhaustion test case takes an indeterminate amount of time to trigger Kernel panic.#12105
mssonicbld merged 1 commit intosonic-net:202311from
mssonicbld:cherry/202311/11738

Conversation

@mssonicbld
Copy link
Collaborator

Description of PR

Summary:
Fixes # 11737

Type of change

  • [x ] Bug fix
  • Testbed and Framework(new/improvement)
  • Test case(new/improvement)

Back port request

  • 201911
  • 202012
  • 202205
  • 202305
  • 202311

Approach

What is the motivation for this PR?

How did you do it?

It seems the time it takes for the kernel to raise Out of Memory condition and trigger oom_killer is not very deterministic in this test case. Once memory is exhausted in the system, the node becomes very unresponsive as no new processes can be created. Under most cases the test does complete in 10 mts, however for some of the PIDs, the test takes 20, 30 mts or more.

It seems the issue is seen in Linux operation in other scenarios - https://unix.stackexchange.com/questions/373312/oom-killer-doesnt-work-properly-leads-to-a-frozen-os

The solution seems to be to disable the swapping so the kernel raises the OOM condition much faster.

https://askubuntu.com/questions/1188024/how-to-test-oom-killer-from-command-line

How did you verify/test it?

Ran the test case on a number of PIDs.

…trigger Kernel panic. (sonic-net#11738)

* Turned swapping off so kernel catches OOM in a shorter time.
@mssonicbld
Copy link
Collaborator Author

Original PR: #11738

@mssonicbld mssonicbld merged commit a80ced6 into sonic-net:202311 Mar 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants