aarch64: Support FEAT_LSFE by taiki-e · Pull Request #201 · taiki-e/portable-atomic

taiki-e · 2025-01-12T14:18:44Z

Armv9.6 added atomic float instructions for binary{16,32,64} and bfloat16 as FEAT_LSFE (Large System Float Extension).

This PR optimizes AArch64 {16,32,64}-bit atomic float add/sub/max/min when FEAT_LSFE is enabled.

LLVM's assembly support for FEAT_LSFE needs LLVM 20 (llvm/llvm-project@67ff5ba), so use .inst directive on LLVM 19 or older.

Run-time detection is also implemented, but at this time it is only used in testing. AFAIK no CPUs actually implement this feature yet, so we will only refer to the feature available at compile time at this time.

taiki-e added the O-aarch64 Target: Armv8-A, Armv8-R, or later processors in AArch64 mode label Jan 12, 2025

taiki-e mentioned this pull request Jan 12, 2025

Optimize atomic floats on nvptx #34

Open

taiki-e force-pushed the aarch64-lsfe branch from f666498 to 8ef05df Compare January 12, 2025 14:21

taiki-e added the A-float Area: related to atomic float label Jan 12, 2025

taiki-e force-pushed the aarch64-lsfe branch from 8ef05df to b942dd3 Compare January 12, 2025 16:48

taiki-e force-pushed the main branch 5 times, most recently from 53c8409 to 378f6cd Compare January 15, 2025 15:07

taiki-e force-pushed the main branch 8 times, most recently from 52836df to 4a9ffc4 Compare February 5, 2025 17:50

taiki-e force-pushed the main branch 5 times, most recently from a368389 to eeb0235 Compare February 24, 2025 12:09

taiki-e force-pushed the main branch 5 times, most recently from 77c5d0d to 813bf8f Compare March 7, 2025 16:12

taiki-e force-pushed the aarch64-lsfe branch 2 times, most recently from 74aec13 to fdd02b9 Compare March 8, 2025 13:46

taiki-e force-pushed the aarch64-lsfe branch 3 times, most recently from 94aeaa1 to cc03496 Compare March 8, 2025 14:20

aarch64: Support FEAT_LSFE

564c517

taiki-e force-pushed the aarch64-lsfe branch from cc03496 to 564c517 Compare March 8, 2025 15:11

taiki-e merged commit 05bef02 into main Mar 8, 2025
119 checks passed

taiki-e deleted the aarch64-lsfe branch March 8, 2025 17:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

aarch64: Support FEAT_LSFE#201

aarch64: Support FEAT_LSFE#201
taiki-e merged 1 commit intomainfrom
aarch64-lsfe

taiki-e commented Jan 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

taiki-e commented Jan 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

taiki-e commented Jan 12, 2025 •

edited

Loading