Skip to content

Revert "Better CPU prompt processing performance for SWA models (#696)"#701

Merged
ikawrakow merged 1 commit intomainfrom
ik/reverts
Aug 17, 2025
Merged

Revert "Better CPU prompt processing performance for SWA models (#696)"#701
ikawrakow merged 1 commit intomainfrom
ik/reverts

Conversation

@ikawrakow
Copy link
Copy Markdown
Owner

Clearly I did not test well enough. The PR leads to a segmentation fault with hybrid GPU/CPU inference.

@ikawrakow ikawrakow merged commit a3a5230 into main Aug 17, 2025
@Ph0rk0z
Copy link
Copy Markdown

Ph0rk0z commented Aug 17, 2025

Me too. Was just in the process of recompiling with the commit removed to see if that's it. GLM-4.5 gave segfault.

@usrlocalben
Copy link
Copy Markdown
Contributor

here as well, was just about to report

@calvin2021y
Copy link
Copy Markdown

pure cpu also crash.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants