Skip to content

Solving bank conflict via padding and TMA 3D store#78

Merged
LyricZhao merged 3 commits intomainfrom
tma-3d-padding
Apr 3, 2025
Merged

Solving bank conflict via padding and TMA 3D store#78
LyricZhao merged 3 commits intomainfrom
tma-3d-padding

Conversation

@LyricZhao
Copy link
Collaborator

The optimization should make general cases 1% faster, cases with small Ks ~10% faster.

@LyricZhao LyricZhao requested a review from zheanxu April 3, 2025 07:59
@LyricZhao LyricZhao self-assigned this Apr 3, 2025
@LyricZhao LyricZhao merged commit c187c23 into main Apr 3, 2025
@LyricZhao LyricZhao deleted the tma-3d-padding branch April 11, 2025 03:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants