bring dev changes to web dev #6557

dinhlongviolin1 · 2025-09-23T08:10:38Z

Describe Your Changes

Fixes Issues

Closes #
Closes #

Self Checklist

Added relevant comments, esp in complex areas
Updated docs (for bug fixes / features)
Created issues for follow-up changes or refactoring needed

…-dom fix: avoid error validate nested DOM

#6431) * fix: correct context shift flag handling in LlamaCPP extension The previous implementation added the `--no-context-shift` flag when `cfg.ctx_shift` was disabled, which conflicted with the llama.cpp CLI where the presence of `--context-shift` enables the feature. The logic is updated to push `--context-shift` only when `cfg.ctx_shift` is true, ensuring the extension passes the correct argument and behaves as expected. * feat: detect model out of context during generation --------- Co-authored-by: Dinh Long Nguyen <[email protected]>

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

…ission enhancement: copy MCP permission

- Removed the unused `getKVCachePerToken` helper and replaced it with a unified `estimateKVCache` that returns both total size and per‑token size. - Fixed the KV cache size calculation to account for all layers, correcting previous under‑estimation. - Added proper clamping of user‑requested context lengths to the model’s maximum. - Refactored VRAM budgeting: introduced explicit reserves, fixed engine overhead, and separate multipliers for VRAM and system RAM based on memory mode. - Implemented a more robust planning flow with clear GPU, Hybrid, and CPU pathways, including fallback configurations when resources are insufficient. - Updated default context length handling and safety buffers to prevent OOM situations. - Adjusted usable memory percentage to 90 % and refined logging for easier debugging.

chore: makefile rust targets macos

The Llama.cpp backend can emit the phrase “failed to allocate” when it runs out of memory. Adding this check ensures such messages are correctly classified as out‑of‑memory errors, providing more accurate error handling CPU backends.

Use fallback value 'high' for memory_util config and remove unused GgufMetadata import.

fix: pathname file install BE

fix: attachment edit message

fix: immediate update value model selection

fix: validate mmproj from general basename

…nd-videos docs: update url for gifs and videos

fix(number-input): preserve '0.0x' format when typing (#6520)

docs: update changelog for v0.6.10

chore: update build tauri commands

feat: fix remote provider vision capability

* fix: prevent consecutive messages with same role * fix: tests * fix: first message should not be assistant * fix: tests

…e-mcp enhancement: toaster delete mcp server

* feat: Prompt progress when streaming - BE changes: - Add a `return_progress` flag to `chatCompletionRequest` and a corresponding `prompt_progress` payload in `chatCompletionChunk`. Introduce `chatCompletionPromptProgress` interface to capture cache, processed, time, and total token counts. - Update the Llamacpp extension to always request progress data when streaming, enabling UI components to display real‑time generation progress and leverage llama.cpp’s built‑in progress reporting. * Make return_progress optional * chore: update ui prompt progress before streaming content * chore: remove log * chore: remove progress when percentage >= 100 * chore: set timeout prompt progress * chore: move prompt progress outside streaming content * fix: tests --------- Co-authored-by: Faisal Amir <[email protected]> Co-authored-by: Louis <[email protected]>

* feat: add getTokensCount method to compute token usage Implemented a new async `getTokensCount` function in the LLaMA.cpp extension. The method validates the model session, checks process health, applies the request template, and tokenizes the resulting prompt to return the token count. Includes detailed error handling for crashed models and API failures, enabling callers to assess token usage before sending completions. * Fix: typos * chore: update ui token usage * chore: remove unused code * feat: add image token handling for multimodal LlamaCPP models Implemented support for counting image tokens when using vision-enabled models: - Extended `SessionInfo` with optional `mmprojPath` to store the multimodal project file. - Propagated `mmproj_path` from the Tauri plugin into the session info. - Added import of `chatCompletionRequestMessage` and enhanced token calculation logic in the LlamaCPP extension: - Detects image content in messages. - Reads GGUF metadata from `mmprojPath` to compute accurate image token counts. - Provides a fallback estimation if metadata reading fails. - Returns the sum of text and image tokens. - Introduced helper methods `calculateImageTokens` and `estimateImageTokensFallback`. - Minor clean‑ups such as comment capitalization and debug logging. * chore: update FE send params message include content type image_url * fix mmproj path from session info and num tokens calculation * fix: Correct image token estimation calculation in llamacpp extension This commit addresses an inaccurate token count for images in the llama.cpp extension. The previous logic incorrectly calculated the token count based on image patch size and dimensions. This has been replaced with a more precise method that uses the clip.vision.projection_dim value from the model metadata. Additionally, unnecessary debug logging was removed, and a new log was added to show the mmproj metadata for improved visibility. * fix per image calc * fix: crash due to force unwrap --------- Co-authored-by: Faisal Amir <[email protected]> Co-authored-by: Louis <[email protected]>

* fix: custom fetch for all providers * fix: run in development should use built-in fetch

* fix: prevent relocation to root directories * Update web-app/src/locales/zh-TW/settings.json Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

* feat: implement conversation endpoint * use conversation aware endpoint * fetch message correctly * preserve first message * fix logout * fix broadcast issue locally + auth not refreshing profile on other tabs+ clean up and sync messages * add is dev tag

ellipsis-dev · 2025-09-23T08:11:54Z

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at [email protected]

Generated with ❤️ by ellipsis.dev

louis-jan

LGTM

github-actions · 2025-09-23T08:14:21Z

Preview URL: https://6dce4553.docs-9ba.pages.dev

urmauur and others added 30 commits September 12, 2025 10:58

fix: avoid error validate nested dom

4293fe7

Merge pull request #6426 from menloresearch/fix/error-validate-nested…

ad428f5

…-dom fix: avoid error validate nested DOM

chore: add install-rust-targets step for macOS universal builds

6959329

fix: make install-rust-targets a dependency

4fa78fa

enhancement: copy MCP permission

44893bc

chore: make action mutton capitalize

a26445e

Update web-app/src/locales/en/tool-approval.json

a4483b7

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

Merge pull request #6456 from menloresearch/enhancement/copy-mcp-perm…

7a2782e

…ission enhancement: copy MCP permission

chore: simplify macos workflow

1db67ea

Merge pull request #6457 from menloresearch/chore/makefile-rust-targets

55edc71

chore: makefile rust targets macos

fix: pathname file install BE

18114c0

fix: set default memory mode and clean up unused import (#6463)

9e3a77a

Use fallback value 'high' for memory_util config and remove unused GgufMetadata import.

fix: auto update should not block popup

5736d7b

fix: remove log

e02be47

fix: imporove edit message with attachment image

3b22f0b

fix: imporove edit message with attachment image

7b9b966

fix: type imageurl

52fe8e8

Merge pull request #6464 from menloresearch/fix/pathname-install-be

da2f790

fix: pathname file install BE

fix: immediate dropdown value update

9fe517d

fix: linter

4ef64ce

Merge pull request #6465 from menloresearch/fix/attachment-edit-message

0e97264

fix: attachment edit message

Merge pull request #6474 from menloresearch/fix/model-selection

fd05214

fix: immediate update value model selection

fix/validate-mmproj-from-general-basename

ea354ce

fix/revalidate-model-gguf

272ef9f

fix: loader when importing

0945eae

Merge pull request #6477 from menloresearch/fix/valdidate-mmproj

9380774

fix: validate mmproj from general basename

fix/mcp-json-validation

bb39cb1

github-roushan and others added 20 commits September 19, 2025 11:36

fix(number-input): preserve '0.0x' format when typing (#6520)

ae2532d

docs: update url for gifs and videos

4694ab8

chore: update url for jan-v1 docs

465544c

Merge pull request #6527 from menloresearch/docs/update-url-for-gif-a…

e1fa60b

…nd-videos docs: update url for gifs and videos

fix: Typo in openapi JSON (#6528)

991bbec

enhancement: toaster delete mcp server

ec42516

Merge pull request #6526 from github-roushan/fix-number-input

2986243

fix(number-input): preserve '0.0x' format when typing (#6520)

Update 2025-09-18-auto-optimize-vision-imports.mdx

b6169a4

Merge pull request #6524 from menloresearch/docs/update-changelog

361c9ee

docs: update changelog for v0.6.10

Merge pull request #6518 from menloresearch/chore/update-build-tauri

8cdb021

chore: update build tauri commands

Merge pull request #6475 from menloresearch/feat/bump-tokenjs

b0b84b7

feat: fix remote provider vision capability

fix: prevent consecutive messages with same role (#6544)

0d2c99a

* fix: prevent consecutive messages with same role * fix: tests * fix: first message should not be assistant * fix: tests

Merge pull request #6529 from menloresearch/enhancement/toaster-delet…

e1294cd

…e-mcp enhancement: toaster delete mcp server

chore: add ci for web stag (#6550)

05e58cf

fix: custom fetch for all providers (#6538)

568ee85

* fix: custom fetch for all providers * fix: run in development should use built-in fetch

add full-width model names (#6350)

5adc0d9

github-project-automation bot added this to Jan Sep 23, 2025

github-actions bot assigned dinhlongviolin1 Sep 23, 2025

louis-jan approved these changes Sep 23, 2025

View reviewed changes

Minh141120 approved these changes Sep 23, 2025

View reviewed changes

dinhlongviolin1 merged commit 7413f13 into dev-web Sep 23, 2025
10 of 11 checks passed

github-project-automation bot moved this to QA in Jan Sep 23, 2025

github-actions bot added this to the v0.7.0 milestone Sep 23, 2025

github-actions bot deployed to docs (Preview) September 23, 2025 08:13 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bring dev changes to web dev #6557

bring dev changes to web dev #6557

Uh oh!

dinhlongviolin1 commented Sep 23, 2025

Uh oh!

ellipsis-dev bot commented Sep 23, 2025

Uh oh!

louis-jan left a comment

Uh oh!

Uh oh!

github-actions bot commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

bring dev changes to web dev #6557

bring dev changes to web dev #6557

Uh oh!

Conversation

dinhlongviolin1 commented Sep 23, 2025

Describe Your Changes

Fixes Issues

Self Checklist

Uh oh!

ellipsis-dev bot commented Sep 23, 2025

Uh oh!

louis-jan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants