webui: Fix selecting generated output issues during active streaming by allozaur · Pull Request #18091 · ggml-org/llama.cpp

allozaur · 2025-12-16T10:39:16Z

Implements incremental rendering: Splits markdown content into stable blocks and a single unstable block (initial draft created by @ServeurpersoCom):
- Stable blocks (all but the last) are cached and only rendered once
- Only the last block (unstable) is re-rendered during streaming
- Prevents DOM reconstruction of already-rendered content, enabling smooth text selection
- Using HAST node positions for stable block identification
Reduces unnecessary re-processing of unchanged markdown blocks
Improves code organization in MarkdownContent component

demo.mp4

allozaur

Please do some testing on your end as well and let me know if we aren't missing anything in this PR to address the issue.

ServeurpersoCom · 2025-12-16T12:44:19Z

Fantastic! I absolutely must stress test it @ MoE A3B on 5090 because my draft kept crashing after a while!

Edit : We have the same edge case bug as in the POC/Draft (video watched together). We need to try to narrow down by generating the content that causes it.

ServeurpersoCom · 2025-12-16T22:18:39Z

Running GPT-OSS-20B at 330 tok/s inference was faster than rendering, making it easier to trigger the race condition between stable/unstable block updates, solution: await tick() to force DOM sync
I can no longer reproduce the bug with this patch
I haven't noticed any performance drop, although it could be improved by limiting rendering to one per requestAnimationFrame or a submultiple of them later.

tools/server/webui/src/lib/components/app/misc/MarkdownContent.svelte

ggerganov · 2025-12-17T06:39:57Z

Generally this also works on my end, but I do see occasionally bigger selections than expected. Maybe related to the race that @ServeurpersoCom found:

webui-selection-0.mp4

ggerganov

Actually, I did some more testing and the problem that I observed occurs only when I am trying to select text inside a code block that is currently being generated. After it gets closed, then selecting for that block works ok.

I think this is acceptable.

tools/server/webui/src/lib/components/app/misc/MarkdownContent.svelte

allozaur · 2025-12-17T12:03:27Z

Actually, I did some more testing and the problem that I observed occurs only when I am trying to select text inside a code block that is currently being generated. After it gets closed, then selecting for that block works ok.

I think this is acceptable.

@ggerganov @ServeurpersoCom I've added some changes after @ngxson's review. Please re-test this on your ends.

ServeurpersoCom · 2025-12-17T12:36:15Z

No regression on my end: >10 long rich markdown generations with success, whereas the corruption was systematically occurring after 2 or 3 generations
Testing : GPT-OSS-20B
Prompt : Writes a long and rich markdown

Smartphone test OK

ggerganov · 2025-12-17T12:53:17Z

Found a bug introduced here - when you "Copy code", it forgets the whitespaces:

ngxson

Tested on my side, it works except for inside a generating code block as Georgi spotted earlier.

We can improve this in the future by somehow prevent setting innerHTML via {@html block.html}. Instead, I think it's best to have a system where it can take a HastRoot and use depth-first search to get the diff between 2 virtual DOM, and only update changed nodes on HTML. Probably there are some libraries already do all of these heavy-lifting works for us, but we can have a look later.

For now, I think this PR is good to merge.

ServeurpersoCom · 2025-12-17T13:02:47Z

Same on Windows, copy paste -> no \n or no \r\n

allozaur · 2025-12-18T01:00:44Z

Found a bug introduced here - when you "Copy code", it forgets the whitespaces:

@ggerganov @ServeurpersoCom this should be fixed with 30c2c18

@ServeurpersoCom

Suggestion from @ServeurpersoCom Co-authored-by: Pascal <[email protected]>

thomasjfox · 2025-12-20T09:20:57Z

Thanks so much for this one! 🥳

It fixes a usability annoyance for good. The users will love it.

allozaur · 2025-12-20T10:25:11Z

Thanks so much for this one! 🥳

It fixes a usability annoyance for good. The users will love it.

Great to hear! It's still not in a perfect state and we are planning a better strategy for rendering the generated content, but it solves the most pressing issue.

@ServeurpersoCom

…gml-org#18091) * draft: incremental markdown rendering with stable blocks * refactor: Logic improvements * refactor: DRY Markdown post-processing logic * refactor: ID generation improvements * fix: Remove runes * refactor: Clean up & add JSDocs * chore: update webui static output * fix: Add tick to prevent race conditions for rendering Markdown blocks Suggestion from @ServeurpersoCom Co-authored-by: Pascal <[email protected]> * chore: Run `npm audit fix` * chore: update webui static output * feat: Improve performance using global counter & id instead of UUID * refactor: Enhance Markdown rendering with link and code features * chore: update webui static output * fix: Code block content extraction * chore: update webui static output * chore: update webui static output --------- Co-authored-by: Pascal <[email protected]>

@ServeurpersoCom

…(#18091) * draft: incremental markdown rendering with stable blocks * refactor: Logic improvements * refactor: DRY Markdown post-processing logic * refactor: ID generation improvements * fix: Remove runes * refactor: Clean up & add JSDocs * chore: update webui static output * fix: Add tick to prevent race conditions for rendering Markdown blocks Suggestion from @ServeurpersoCom Co-authored-by: Pascal <[email protected]> * chore: Run `npm audit fix` * chore: update webui static output * feat: Improve performance using global counter & id instead of UUID * refactor: Enhance Markdown rendering with link and code features * chore: update webui static output * fix: Code block content extraction * chore: update webui static output * chore: update webui static output --------- Co-authored-by: Pascal <[email protected]>

allozaur requested review from ServeurpersoCom, ggerganov and ngxson December 16, 2025 10:55

allozaur commented Dec 16, 2025

View reviewed changes

github-actions bot added examples server labels Dec 16, 2025

ServeurpersoCom reviewed Dec 17, 2025

View reviewed changes

tools/server/webui/src/lib/components/app/misc/MarkdownContent.svelte Show resolved Hide resolved

tools/server/webui/src/lib/components/app/misc/MarkdownContent.svelte Outdated Show resolved Hide resolved

allozaur force-pushed the 17132-select-message-during-generation branch from eb39de1 to 511a426 Compare December 17, 2025 09:50

ggerganov approved these changes Dec 17, 2025

View reviewed changes

allozaur requested a review from ServeurpersoCom December 17, 2025 10:08

ngxson reviewed Dec 17, 2025

View reviewed changes

loci-dev mentioned this pull request Dec 17, 2025

UPSTREAM PR #18091: webui: Fix selecting generated output issues during active streaming auroralabs-loci/llama.cpp#604

Open

allozaur force-pushed the 17132-select-message-during-generation branch 2 times, most recently from c5f4157 to 70b0644 Compare December 17, 2025 12:01

allozaur requested a review from ngxson December 17, 2025 12:02

ngxson approved these changes Dec 17, 2025

View reviewed changes

allozaur force-pushed the 17132-select-message-during-generation branch from 6768dec to aa461e8 Compare December 18, 2025 00:59

ggerganov approved these changes Dec 18, 2025

View reviewed changes

ServeurpersoCom and others added 4 commits December 18, 2025 11:10

draft: incremental markdown rendering with stable blocks

c57fa0a

refactor: Logic improvements

20eada8

refactor: DRY Markdown post-processing logic

9fc14ce

refactor: ID generation improvements

6bf3cdc

allozaur and others added 12 commits December 18, 2025 11:10

fix: Remove runes

842b690

refactor: Clean up & add JSDocs

a830d92

chore: update webui static output

af63c0c

fix: Add tick to prevent race conditions for rendering Markdown blocks

1a52f0a

Suggestion from @ServeurpersoCom Co-authored-by: Pascal <[email protected]>

chore: Run npm audit fix

da569c6

chore: update webui static output

a0f8fa5

feat: Improve performance using global counter & id instead of UUID

fa4ed35

refactor: Enhance Markdown rendering with link and code features

67aabb7

chore: update webui static output

0d44d34

fix: Code block content extraction

922dae6

chore: update webui static output

d6dbcca

chore: update webui static output

b4aa66a

allozaur force-pushed the 17132-select-message-during-generation branch from aa461e8 to b4aa66a Compare December 18, 2025 10:13

allozaur merged commit 9ce64ae into ggml-org:master Dec 18, 2025
10 checks passed

allozaur deleted the 17132-select-message-during-generation branch December 18, 2025 10:17

wallentri88 mentioned this pull request Feb 24, 2026

Eval bug: qwen35 and qwen35moe graph split issues (Severe PP impact, crashes) #19864

Closed

Conversation

allozaur commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

allozaur left a comment

Choose a reason for hiding this comment

Uh oh!

ServeurpersoCom commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ServeurpersoCom commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ggerganov commented Dec 17, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

allozaur commented Dec 17, 2025

Uh oh!

ServeurpersoCom commented Dec 17, 2025

Uh oh!

ggerganov commented Dec 17, 2025

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

ServeurpersoCom commented Dec 17, 2025

Uh oh!

allozaur commented Dec 18, 2025

Uh oh!

Uh oh!

thomasjfox commented Dec 20, 2025

Uh oh!

allozaur commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

allozaur commented Dec 16, 2025 •

edited

Loading

ServeurpersoCom commented Dec 16, 2025 •

edited

Loading

ServeurpersoCom commented Dec 16, 2025 •

edited

Loading