fix: mount retry and logging #3157

z63d · 2025-04-30T07:01:37Z

Description

Fixes mount_into_container handling.
Also, since the function name seems to have changed in the #378, I adjusted the test function name to match.

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Refactoring (no functional changes)
Performance improvement
Test updates
CI/CD related changes
Other (please describe):

Testing

Added new unit tests
Added new integration tests
Ran existing test suite
Tested manually (please provide steps)

Related Issues

#3132

Additional Context

Copilot

Pull Request Overview

This PR fixes the mount_into_container functionality by adding a retry loop for specific mount errors and updating the test function name to match the new behavior.

Implements a retry mechanism for mount failures due to EINVAL or EBUSY errors.
Renames the test function from test_mount_to_container to test_mount_into_container.

Comments suppressed due to low confidence (1)

crates/libcontainer/src/rootfs/mount.rs:567

Consider adding a logging statement when the mount fails after exhausting retries to provide clearer context on the final error.

return Err(err.into());

utam0k

May I ask you to consider the unit test?

utam0k · 2025-04-30T11:13:43Z

crates/libcontainer/src/rootfs/mount.rs

+        let max_retries: u32 = 3;
+        let mut retries = 0;
+        loop {


I don't prefer using the infinity loop. Please use for instead of that, at least.

utam0k · 2025-04-30T11:14:56Z

crates/libcontainer/src/rootfs/mount.rs

+                            && retries < max_retries
+                        {
+                            retries += 1;
+                            std::thread::sleep(std::time::Duration::from_millis(100));


Is 100ms reasonable?

runc seems to set it to 100ms when doing the same thing.
https://github.com/opencontainers/runc/blob/8d90e3dba696ac787ee64de4445517ddf1063b04/libcontainer/rootfs_linux.go#L1232
But I don't know if this is reasonable 🤔

Could you add a comment regarding where 100ms came from?

For unit tests,we don't want to wait 100ms, so use cfg.

utam0k · 2025-05-01T12:12:41Z

crates/libcontainer/src/rootfs/mount.rs


-        if let Err(err) =
-            self.syscall
+        let max_attempts: u32 = 3;


Please Make this a const variable

Copilot

Pull Request Overview

This PR fixes an issue with mount handling by adding retries for mounting and updating the associated test function name to reflect a recent rename.

Introduces a constant for maximum mount attempts and wraps the mount syscall in a retry loop.
Updates tests to simulate transient mount failures and verify successful retries or eventual failure.

Comments suppressed due to low confidence (1)

crates/libcontainer/src/rootfs/mount.rs:559

Clarify the rationale for treating EINVAL as a transient error eligible for retry, since EINVAL typically indicates an invalid argument. Consider adding a comment to explain why retrying on EINVAL is safe in this context.

if (matches!(errno, Errno::EINVAL) || matches!(errno, Errno::EBUSY)) && attempt < MAX_MOUNT_ATTEMPTS - 1

utam0k

Use use as appropriate.

utam0k · 2025-05-01T12:36:54Z

crates/libcontainer/src/rootfs/mount.rs


-        if let Err(err) =
-            self.syscall
+        for attempt in 0..MAX_MOUNT_ATTEMPTS {


The logic of retry is not of interest here. Why not cut this out for utili and others?

fn retry<F, T, E, P>( mut op: F, attempts: usize, delay: Duration, policy: P, ) -> Result<T, E> where F: FnMut() -> Result<T, E>, P: Fn(&E) -> bool, { fn go<F, T, E, P>( // Maybe “for” is easier to understand. mut op: F, remaining: usize, delay: Duration, policy: &P, ) -> Result<T, E> where F: FnMut() -> Result<T, E>, P: Fn(&E) -> bool, { op().or_else(|err| { if remaining > 1 && policy(&err) { std::thread::sleep(delay); go(op, remaining - 1, delay, policy) } else { Err(err) } }) } go(&mut op, attempts, delay, &policy) }

YJDoc2 · 2025-05-07T12:32:51Z

@z63d CI are failing, please take a look

Copilot

Pull Request Overview

This PR fixes mount retry and logging issues for container mounts by introducing a generic retry function and refining the mount error handling logic.

Introduces a generic retry function in utils.rs to encapsulate retry logic.
Updates mount error handling in mount.rs to differentiate between EINVAL and EBUSY errors using retry.
Renames the test function to match the updated function name.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
crates/libcontainer/src/utils.rs	Added a generic retry function with delay and policy.
crates/libcontainer/src/rootfs/mount.rs	Updated mount error handling with retry logic and renamed test function.

Comments suppressed due to low confidence (1)

crates/libcontainer/src/rootfs/mount.rs:558

[nitpick] The use of 'mount_option_config.data' here differs from the use of 'Some(&*d)' in the retry branch. If both represent the same mount data, consider unifying the usage for consistency.

self.syscall.mount(
                          Some(&*src),
                          dest,
                          typ,
                          mount_option_config.flags,
                          Some(&mount_option_config.data),

crates/libcontainer/src/rootfs/mount.rs

utam0k · 2025-05-09T11:32:09Z

@z63d It seems the format error occurred.

Copilot

Pull Request Overview

This pull request addresses the mount handling issues by introducing a generic retry function and updating the error handling logic for mount operations. Key changes include:

Adding a retry function in utils.rs to allow configurable retry logic.
Modifying mount_into_container in mount.rs to handle EINVAL and EBUSY errors differently with retry support.
Renaming the test function to align with the updated function name.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
crates/libcontainer/src/utils.rs	Added a generic retry function with delay and policy support.
crates/libcontainer/src/rootfs/mount.rs	Updated mount error handling using the new retry function and adjusted test names.

crates/libcontainer/src/rootfs/mount.rs

utam0k · 2025-05-19T11:33:18Z

crates/libcontainer/src/rootfs/mount.rs

+                    };
+                    // runc has a retry interval of 100ms. We are following this.
+                    // https://github.com/opencontainers/runc/blob/v1.3.0/libcontainer/rootfs_linux.go#L1235
+                    let delay = Duration::from_millis(100);


nit: Please make 100ms a constant.

utam0k

One nit, otherwise looks good.

YJDoc2 · 2025-05-19T12:21:34Z

crates/libcontainer/src/utils.rs

+pub fn retry<F, T, E, P>(
+    mut op: F,
+    attempts: u32,
+    #[cfg_attr(test, allow(unused_variables))] delay: Duration,


Hey, I think it should be ok to keep the delay in tests as well. with 100 ms delay and 2 attempts after delay, each test would take at most 200 ms ; even if there are 10 tests we might have a delay of 2 sec total, I think that is ok.

That is certainly true.
I'd like to hear @utam0k's opinion.
#3157 (comment)

ok, I didn't see that comment. @utam0k I personally feel that waiting 100 ms should be ok. If not preferable, I'd suggest instead of changing fn signature use cfg to re-define the const to diff value, i.e.

#[cfg(not(test))] const MOUNT_RETRY_DELAY_MS: u64 = 100; #[cfg(test)] const MOUNT_RETRY_DELAY_MS: u64 = 1;

I think this would be cleaner than using cfg in fn signature.

My concern is that if this function is used in many places, the waiting time will increase. However, since this is not the case now, it is acceptable to just leave the comment and make it const.

Signed-off-by: Kaita Nakamura <[email protected]>

z63d changed the title ~~fix: fix: mount retry and logging~~ fix: mount retry and logging Apr 30, 2025

utam0k requested a review from Copilot April 30, 2025 11:09

Copilot AI reviewed Apr 30, 2025

View reviewed changes

utam0k added the kind/bug label Apr 30, 2025

utam0k requested changes Apr 30, 2025

View reviewed changes

z63d force-pushed the fix/mount-into-container-retry branch from 9d350fb to d528cfc Compare May 1, 2025 03:00

z63d requested a review from utam0k May 1, 2025 12:01

utam0k reviewed May 1, 2025

View reviewed changes

utam0k requested a review from Copilot May 1, 2025 12:13

Copilot AI reviewed May 1, 2025

View reviewed changes

utam0k reviewed May 1, 2025

View reviewed changes

z63d force-pushed the fix/mount-into-container-retry branch from 1510251 to 1765e24 Compare May 2, 2025 02:31

z63d requested a review from utam0k May 2, 2025 02:32

utam0k requested a review from Copilot May 9, 2025 11:23

Copilot AI reviewed May 9, 2025

View reviewed changes

crates/libcontainer/src/rootfs/mount.rs Show resolved Hide resolved

z63d force-pushed the fix/mount-into-container-retry branch from 4681d5f to 560853b Compare May 10, 2025 00:29

z63d requested a review from Copilot May 12, 2025 23:41

Copilot AI reviewed May 12, 2025

View reviewed changes

crates/libcontainer/src/rootfs/mount.rs Show resolved Hide resolved

utam0k reviewed May 19, 2025

View reviewed changes

utam0k approved these changes May 19, 2025

View reviewed changes

z63d force-pushed the fix/mount-into-container-retry branch from 560853b to 5f07e0a Compare May 19, 2025 12:10

YJDoc2 reviewed May 19, 2025

View reviewed changes

z63d force-pushed the fix/mount-into-container-retry branch from 5f07e0a to 251a1c4 Compare May 21, 2025 13:03

z63d added 2 commits May 21, 2025 22:20

fix: test fn name

a1df06b

Signed-off-by: Kaita Nakamura <[email protected]>

fix: mount retry and logging

216f50a

Signed-off-by: Kaita Nakamura <[email protected]>

z63d force-pushed the fix/mount-into-container-retry branch from 251a1c4 to 216f50a Compare May 21, 2025 13:21

z63d requested a review from utam0k May 21, 2025 13:33

z63d requested a review from YJDoc2 May 21, 2025 13:33

YJDoc2 enabled auto-merge (squash) May 22, 2025 05:37

YJDoc2 merged commit 6b02740 into youki-dev:main May 22, 2025
43 of 45 checks passed

github-actions bot mentioned this pull request May 22, 2025

Release for v0.5.4 #3124

Merged

z63d deleted the fix/mount-into-container-retry branch May 22, 2025 07:07

fix: mount retry and logging #3157

fix: mount retry and logging #3157

Conversation

z63d commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Related Issues

Additional Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

utam0k left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

utam0k left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

YJDoc2 commented May 7, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

utam0k commented May 9, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

utam0k left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

z63d commented Apr 30, 2025 •

edited

Loading