igzip/riscv64: Optimize adler32_rvv for VLEN=128 by leiwen2025 · Pull Request #390 · intel/isa-l

leiwen2025 · 2026-02-03T08:27:19Z

This PR optimizes the adler32_rvv implementation for vlen=128.

The optimization has been verified on the SG2044 platform:

SG2044:
        new: adler32_warm: runtime =    3062392 usecs, bandwidth 25988 MB in 3.0624 sec = 8486.24 MB/s
        old: adler32_warm: runtime =    3062471 usecs, bandwidth 23095 MB in 3.0625 sec = 7541.43 MB/s

pablodelara · 2026-02-03T10:33:29Z

@leiwen2025 could you update Release notes saying Adler32 has been optimized for RISCV?

leiwen2025 · 2026-02-05T02:56:28Z

@leiwen2025 could you update Release notes saying Adler32 has been optimized for RISCV?

Done. I've updated the Release notes.

pablodelara · 2026-02-05T12:25:03Z

@sunyuechi could you review this PR? Thanks!

sunyuechi · 2026-02-08T15:38:47Z

igzip/riscv64/igzip_isal_adler32_rvv128.S

-
+    vsetvli zero, t0, e32, m8, ta, ma
+    vmv.v.i v8, 0
+    vmv.v.i v24, 0


sunyuechi · 2026-02-08T16:21:52Z

igzip/riscv64/igzip_isal_adler32_rvv128.S

-    vsetvli zero, t0, e16, m4, ta, ma
+    vsetvli zero, t0, e16, m2, ta, ma
+    vwaddu.wv v24, v24, v16
+    vwaddu.wv v24, v24, v18


The above 2 lines are just vwaddu.vv v24, v16, v18? And vwaddu.vv doesn't require register zero-clearing either.

vwaddu.vv v24, v16 ,v18 cannot implement the correct logic. The logic should be v24 += v16 + v18

Okay, I misunderstood.

sunyuechi · 2026-02-09T06:53:48Z

igzip/riscv64/igzip_isal_adler32_rvv128.S

    li      t0, 32
    bltu    a2, t0, tail_bytes
-
+    vsetvli zero, t0, e32, m8, ta, ma


sunyuechi · 2026-02-09T08:03:33Z

LGTM (please squash commits).

Signed-off-by: WenLei <lei.wen2@zte.com.cn>

pablodelara · 2026-02-09T11:12:50Z

This is merged now, thanks.

sunyuechi reviewed Feb 8, 2026

View reviewed changes

sunyuechi reviewed Feb 9, 2026

View reviewed changes

igzip/riscv64:Optimize adler32_rvv for VLEN=128

9dd9b26

Signed-off-by: WenLei <lei.wen2@zte.com.cn>

leiwen2025 force-pushed the optimize_vlen128_new branch from 3ff06c7 to 9dd9b26 Compare February 9, 2026 08:18

pablodelara closed this Feb 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

igzip/riscv64: Optimize adler32_rvv for VLEN=128#390

igzip/riscv64: Optimize adler32_rvv for VLEN=128#390
leiwen2025 wants to merge 1 commit intointel:masterfrom
leiwen2025:optimize_vlen128_new

leiwen2025 commented Feb 3, 2026

Uh oh!

pablodelara commented Feb 3, 2026

Uh oh!

leiwen2025 commented Feb 5, 2026

Uh oh!

pablodelara commented Feb 5, 2026

Uh oh!

sunyuechi Feb 8, 2026 •

edited

Loading

Uh oh!

leiwen2025 Feb 9, 2026

Uh oh!

sunyuechi Feb 8, 2026

Uh oh!

leiwen2025 Feb 9, 2026

Uh oh!

sunyuechi Feb 9, 2026

Uh oh!

sunyuechi Feb 9, 2026

Uh oh!

leiwen2025 Feb 9, 2026

Uh oh!

sunyuechi commented Feb 9, 2026 •

edited

Loading

Uh oh!

pablodelara commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

leiwen2025 commented Feb 3, 2026

Uh oh!

pablodelara commented Feb 3, 2026

Uh oh!

leiwen2025 commented Feb 5, 2026

Uh oh!

pablodelara commented Feb 5, 2026

Uh oh!

sunyuechi Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leiwen2025 Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

sunyuechi Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

leiwen2025 Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

sunyuechi Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

sunyuechi Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

leiwen2025 Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

sunyuechi commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pablodelara commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sunyuechi Feb 8, 2026 •

edited

Loading

sunyuechi commented Feb 9, 2026 •

edited

Loading