Optimize LZ OutWindow.CopyBlock by jdpurcell · Pull Request #907 · adamhathcock/sharpcompress

jdpurcell · 2025-03-19T03:49:27Z

After a bit of profiling, this function seemed like a good candidate for optimization. Testing on a large-ish archive (412 MB), I'm seeing extraction times like:

CPU	Before	After	% of Original
Apple M3	31.1 s	26.8 s	86%
Core i7-6700k	37.2 s	33.8 s	91%

jdpurcell · 2025-03-20T03:30:06Z

After more testing I found that my initial commit despite helping a lot with the Qt archive, could slow down others. An example is here (426 MB). I investigated the differences and determined that CopyBlock is called in two different ways, one directly, and one indirectly via CopyPending. Despite my best attempts, I couldn't figure out a way to avoid splitting them up. It's very counterintuitive, it almost seems like CopyPending could benefit more from this optimization since it's often copying more data on average, but I think the issue is about 7% of the time in my testing it gets called with a 1-length request, whereas CopyBlock never does. And there's no way to work around the performance hit with ifs; the only thing I can guess is that 7% is enough to really mess up the branch prediction. (EDIT: I failed to notice although CopyPending has the potential to copy more data judging by _pendingLen, it's often limited to just a single byte by the _total vs _limit comparison) For whatever reason, archives like the Qt one are biased way more to calling CopyBlock vs CopyPending, whereas some others are quite the opposite.

As much as I hated having to duplicate some of the code, I did my best to clean it up at the same time so it's not too bad overall in my opinion. After splitting it up like this, I still get the big performance gains from the Qt archive, I found a couple others that were quicker too, and then another handful including the one I just linked that at least don't regress in performance anymore (if anything maybe a slight increase but it's probably a drop in the bucket). Despite the perhaps-not-as-simple code, I feel it's worth it for the performance potential. If you agree, feel free to merge, and if not, no worries - I am already using a custom build and could continue to do so.

adamhathcock

Copying for a reason is always good.

adamhathcock · 2025-03-20T08:24:36Z

Thanks!

jdpurcell marked this pull request as draft March 19, 2025 06:03

Optimize LZ OutWindow.CopyBlock

c2d9bf9

jdpurcell force-pushed the pr-copyblockoptim branch from 98bb86c to c2d9bf9 Compare March 20, 2025 02:54

jdpurcell marked this pull request as ready for review March 20, 2025 03:30

adamhathcock approved these changes Mar 20, 2025

View reviewed changes

adamhathcock merged commit 227f66f into adamhathcock:master Mar 20, 2025
4 checks passed

jdpurcell deleted the pr-copyblockoptim branch March 20, 2025 13:47

This was referenced Aug 1, 2025

Bump CliWrap and 9 others futrime/lip#286

Closed

update: Bump SharpCompress and 3 others ethan-hann/CyberRadio-Assistant#95

Merged

dependabot bot mentioned this pull request Aug 18, 2025

update: Bump SharpCompress from 0.39.0 to 0.40.0 ethan-hann/CyberRadio-Assistant#96

Merged

dependabot bot mentioned this pull request Sep 5, 2025

Bump SharpCompress from 0.38.0 to 0.40.0 Kiryuumaru/AbsolutePathHelpers#107

Merged

dependabot bot mentioned this pull request Oct 17, 2025

Bump SharpCompress from 0.32.2 to 0.41.0 mattjohnsonpint/TimeZoneConverter#171

Merged

dependabot bot mentioned this pull request Oct 27, 2025

chore: Bump SharpCompress from 0.39.0 to 0.41.0 aweXpect/aweXpect.Testably#106

Open

dependabot bot mentioned this pull request Nov 10, 2025

chore: Bump SharpCompress from 0.39.0 to 0.41.0 aweXpect/aweXpect#845

Merged

This was referenced Nov 24, 2025

chore: Bump SharpCompress from 0.39.0 to 0.41.0 aweXpect/Mockolate#246

Merged

chore: Bump SharpCompress from 0.39.0 to 0.42.0 aweXpect/aweXpect.Mockolate#38

Merged

dependabot bot mentioned this pull request Dec 22, 2025

Bump SharpCompress from 0.39.0 to 0.42.1 AssetRipper/Tpk#114

Closed

This was referenced Jan 5, 2026

Bump SharpCompress from 0.39.0 to 0.43.0 AssetRipper/Tpk#115

Closed

Bump SharpCompress from 0.39.0 to 0.44.0 AssetRipper/Tpk#116

Closed

dependabot bot mentioned this pull request Jan 19, 2026

Bump SharpCompress from 0.39.0 to 0.44.1 AssetRipper/Tpk#119

Merged

dependabot bot mentioned this pull request Feb 13, 2026

Bump SharpCompress from 0.39.0 to 0.46.0 tomtastisch/FileClassifier#36

Closed

This was referenced Mar 4, 2026

Bump SharpCompress from 0.39.0 to 0.47.0 jas88/PACSify#121

Closed

fix(deps): Bump SharpCompress from 0.39.0 to 0.47.0 nzbdav-dev/nzbdav#339

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize LZ OutWindow.CopyBlock#907

Optimize LZ OutWindow.CopyBlock#907
adamhathcock merged 1 commit intoadamhathcock:masterfrom
jdpurcell:pr-copyblockoptim

jdpurcell commented Mar 19, 2025 •

edited

Loading

Uh oh!

jdpurcell commented Mar 20, 2025 •

edited

Loading

Uh oh!

adamhathcock left a comment

Uh oh!

adamhathcock commented Mar 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jdpurcell commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jdpurcell commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adamhathcock left a comment

Choose a reason for hiding this comment

Uh oh!

adamhathcock commented Mar 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jdpurcell commented Mar 19, 2025 •

edited

Loading

jdpurcell commented Mar 20, 2025 •

edited

Loading