Skip to content

Conversation

@filbranden
Copy link

This is a backport of opencontainers/runc#1375 and it fixes kubernetes/kubernetes#61474 (comment) but please note there were some follow ups to that commit in upstream runc, not sure if those should be considered here as well...

Signed-off-by: Qiang Huang [email protected]

(cherry-picked from commit 8430cc4)
Signed-off-by: Filipe Brandenburger [email protected]

Signed-off-by: Qiang Huang <[email protected]>

(cherry-picked from commit 8430cc4)
Signed-off-by: Filipe Brandenburger <[email protected]>
@filbranden
Copy link
Author

/cc @adelton

@vfreex
Copy link

vfreex commented Jun 12, 2018

Because of this issue, OpenShift/Kubernetes nodes can't be run on latest Fedora.
This is definitely a high priority bug. I am surprised this issue doesn't get much attention.

@dustymabe
Copy link

@runcom
Copy link
Collaborator

runcom commented Jun 12, 2018

@mrunalp PTAL

@filbranden
Copy link
Author

So this was fixed with 18eb957 which reverts the backports.

I think you might want to consider adding back the changes around StartTransientUnit, since there seems to be a race condition there. That's opencontainers/runc#1772 and two or three other PRs that precede it. See opencontainers/runc#1683 for the first fix in the series, which discusses the race condition a bit...

This one is no longer relevant, so closing. Thanks!

@filbranden filbranden closed this Jun 12, 2018
@filbranden filbranden deleted the atomic1 branch June 12, 2018 15:43
@runcom
Copy link
Collaborator

runcom commented Jun 12, 2018

So this was fixed with 18eb957 which reverts the backports.

I think you might want to consider adding back the changes around StartTransientUnit, since there seems to be a race condition there. That's opencontainers/runc#1772 and two or three other PRs that precede it. See opencontainers/runc#1683 for the first fix in the series, which discusses the race condition a bit...

This one is no longer relevant, so closing. Thanks!

thanks for the pointer, do you have an ordered list of the fixes that must go in? I can't seem to follow every fix that went in 😕

@runcom
Copy link
Collaborator

runcom commented Jun 12, 2018

@filbranden ^^

@filbranden
Copy link
Author

@runcom If you want to fix the race condition, then you need:

But see also this comment, since it seems the two first PRs were merged here (or in another branch) and later reverted. They introduced bugs that were later fixed by the last two PRs.

You should check with @sjenning, @mrunalp, @vikaschoudhary16 and @derekwaynecarr to see whether these PRs should be backported here or not.

This is about the race with systemd and libcontainer trying to mkdir the directories in the cgroup tree. That's all this patch set fixes.

Cheers,
Filipe

@runcom
Copy link
Collaborator

runcom commented Jun 12, 2018

Thanks a lot!!!

@dustymabe
Copy link

Because of this issue, OpenShift/Kubernetes nodes can't be run on latest Fedora.
This is definitely a high priority bug. I am surprised this issue doesn't get much attention.

@vfreex - can I get you to test/karma https://bodhi.fedoraproject.org/updates/FEDORA-2018-c2e93d5623 ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants