fix(systemd-run): Switch to use systemd-run instead of direct process and cgroup manipulation by dpoole73 · Pull Request #64 · Azure/applicationhealth-extension-linux

dpoole73 · 2024-04-22T22:05:23Z

Background:

Our tests have been running fine for a long time but suddenly started failing on specific os versions. This was because the process (although initially associated with the correct cgroup that we created) gets moved back to the parent cgroup. This results in the limits being removed.

I did some research and reached out to various people and found that this is something that has previously been seen.

When a process is started with systemd you are not supposed to manage cgroups directly, systemd owns its own hierarchy and can manipulate things within it. Documentation says that you should not modify the cgroups within that slice hierarchy directly but instead you should use systemd-run to launch processes.

The GuestAgent folks saw very similar behavior and switching to systemd-run resolved all their issues.

Changes:

Changed the code to run using systemd-run to launch the vmwatch process. Using the --scope parameter results in the call to wait until the vmwatch process completes.

The process id returned from the call is the actual process id of vmwatch.

I have confirmed that killing vmwatch and killing app health extension still has the same behavior (the PDeathSig integration is working fine) and the aurora tests are working fine with these changes.

NOTE: Because in docker containers, systemd-run is not available, the code falls back to run the process directly and continues to use the old code path in that case. This should also cover and linux distros which don't use systemd where direct cgroup assignment should work fine.

Background: Our tests have been running fine for a long time but suddenly started failing on specific os versions. This was because the process (although initially associated with the correct cgroup that we created) gets moved back to the parent cgroup. This results in the limits being removed. I did some research and reached out to various people and found that this is something that has previously been seen. When a process is started with systemd you are not supposed to manage cgroups directly, systemd owns its own hierarchy and can manipulate things within it. Documentation says that you should not modify the cgroups within that slice hierarchy directly but instead you should use `systemd-run` to launch processes. The GuestAgent folks saw very similar behavior and switching to systemd-run resolved all their issues. Changes: Changed the code to run using `systemd-run` to launch the vmwatch process. Using the `--scope` parameter results in the call to wait until the vmwatch process completes. The process id returned from the call is the actual process id of vmwatch. I have confirmed that killing vmwatch and killing app health extension still has the same behavior (the PDeathSig integration is working fine) and the aurora tests are working fine with these changes. NOTE: Because in docker containers, systemd-run is not available, the code falls back to run the process directly and continues to use the old code path in that case. This should also cover and linux distros which don't use systemd where direct cgroup assignment should work fine.

main/vmWatch.go

klugorosado

Left Comments

frank-pang-msft · 2024-04-24T16:20:47Z

It might be good to move the check of systemd into a method and use it that way, similar to GA, but will leave up to you.

…message can get logged differently

i don't know why this passed before, clearly we kill the process when we fail to assign a cgroup i don't know why it would ever return a different message with this fix test pass locally

klugorosado

LGTM

dpoole73 added 3 commits April 22, 2024 10:02

correct typo

fc430b0

make sure bash files are lf line endings

c864997

zmyzheng approved these changes Apr 22, 2024

View reviewed changes

frank-pang-msft requested changes Apr 23, 2024

View reviewed changes

main/vmWatch.go Outdated Show resolved Hide resolved

feedback

e4c00e9

klugorosado reviewed Apr 24, 2024

View reviewed changes

main/vmWatch.go Outdated Show resolved Hide resolved

klugorosado reviewed Apr 24, 2024

View reviewed changes

main/vmWatch.go Outdated Show resolved Hide resolved

klugorosado suggested changes Apr 24, 2024

View reviewed changes

frank-pang-msft approved these changes Apr 24, 2024

View reviewed changes

dpoole73 added 5 commits April 24, 2024 11:04

feedback

0c9a693

feedback

b30ee91

fix test issue. There seems to be a non-deterministic case where the …

fea2ebd

…message can get logged differently

revert

3e94eb6

correcting the search term

23cb651

i don't know why this passed before, clearly we kill the process when we fail to assign a cgroup i don't know why it would ever return a different message with this fix test pass locally

klugorosado approved these changes Apr 26, 2024

View reviewed changes

dpoole73 merged commit 043d2a2 into feature/v2/bootstrapVMWatch Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(systemd-run): Switch to use systemd-run instead of direct process and cgroup manipulation#64

fix(systemd-run): Switch to use systemd-run instead of direct process and cgroup manipulation#64
dpoole73 merged 9 commits intofeature/v2/bootstrapVMWatchfrom
dev/dpoole/cgroup-using-systemd-run

dpoole73 commented Apr 22, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

klugorosado left a comment

Uh oh!

frank-pang-msft commented Apr 24, 2024

Uh oh!

klugorosado left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dpoole73 commented Apr 22, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

klugorosado left a comment

Choose a reason for hiding this comment

Uh oh!

frank-pang-msft commented Apr 24, 2024

Uh oh!

klugorosado left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants