Skip to content

[action] [PR:17536] fix: handle UnpicklingError in cache read#17584

Merged
mssonicbld merged 1 commit intosonic-net:202411from
mssonicbld:cherry/202411/17536
Mar 20, 2025
Merged

[action] [PR:17536] fix: handle UnpicklingError in cache read#17584
mssonicbld merged 1 commit intosonic-net:202411from
mssonicbld:cherry/202411/17536

Conversation

@mssonicbld
Copy link
Collaborator

Description of PR

Handle UnpicklingError edge case in FactsCache::read() when parallel run is enabled.

Summary:
Fixes # (issue) Microsoft ADO 31839137

Type of change

  • Bug fix
  • Testbed and Framework(new/improvement)
  • New Test case
  • Skipped for non-supported platforms
  • Test case improvement

Back port request

  • 202012
  • 202205
  • 202305
  • 202311
  • 202405
  • 202411

Approach

What is the motivation for this PR?

When parallel run is enabled, multiple processes may try to read/write the same cache file, so there will be a chance that a file is being read by multiple processes at the same time, causing UnpicklingError in some of the processes. Therefore, we decided to retry to read the file after a short random sleep. If we still get the same error after retrying, we will return NOTEXIST to overwrite the file.

How did you do it?

How did you verify/test it?

Any platform specific information?

Supported testbed topology if it's a new test case?

Documentation

Description of PR
Handle UnpicklingError edge case in FactsCache::read() when parallel run is enabled.

Summary:
Fixes # (issue) Microsoft ADO 31839137

Approach
What is the motivation for this PR?
When parallel run is enabled, multiple processes may try to read/write the same cache file, so there will be a chance that a file is being read by multiple processes at the same time, causing UnpicklingError in some of the processes. Therefore, we decided to retry to read the file after a short random sleep. If we still get the same error after retrying, we will return NOTEXIST to overwrite the file.

co-authorized by: jianquanye@microsoft.com
@mssonicbld
Copy link
Collaborator Author

Original PR: #17536

@mssonicbld
Copy link
Collaborator Author

/azp run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@mssonicbld mssonicbld merged commit 6eb330f into sonic-net:202411 Mar 20, 2025
14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants