Sync Release/v0.6.6 into dev #5973

louis-jan · 2025-07-29T16:07:58Z

Changes

fix: assistant with last used and fix metadata @urmauur (fix: assistant with last used and fix metadata #5955)
fix: search models result in hub should be sorted by weight @louis-menlo (fix: search models result in hub should be sorted by weight #5954)
fix: factory reset fail with access denied error @louis-menlo (fix: factory reset fail with access denied error #5952)
fix: set autoUnload in onLoad() @qnixsynapse (fix: set autoUnload in onLoad() #5956)
fix: update edge case experimental feature MCP @urmauur (fix: update edge case experimental feature MCP #5951)
fix: correctly apply auto_unload setting from config @qnixsynapse (fix: correctly apply auto_unload setting from config #5953)
fix: Prevent race condition with auto-unload during rapid model loading @qnixsynapse (fix: Prevent race condition with auto-unload during rapid model loading #5947)
chore: uninstall when upgrading windows installer @Minh141120 (chore: uninstall when upgrading windows installer #5945)
fix: openrouter unselect itself @louis-menlo (fix: openrouter unselect itself #5943)
fix: tool approval params scrollable @urmauur (fix: tool approval params scrollable #5941)
fix: migrate app settings to the new version @louis-menlo (fix: migrate app settings to the new version #5936)
fix: Remove sInfo from activeSessions before unloading @qnixsynapse (fix: Remove sInfo from activeSessions before unloading #5938)
fix: update default GPU toggle, and simplify state @urmauur (fix: update default GPU toggle, and simplify state #5937)
chore: revert back to passive mode on windows installer @Minh141120 (chore: revert back to passive mode on windows installer #5934)
fix: update ui version_backend, mem usage hardware @urmauur (fix: update ui version_backend, mem usage hardware #5932)
fix: Frontend updates when llama.cpp backend auto-downloads @qnixsynapse (fix: Frontend updates when llama.cpp backend auto-downloads #5926)
fix: calculation memory on hardware and system monitor @urmauur (fix: calculation memory on hardware and system monitor #5922)
fix: persist model capabilities refresh app @urmauur (fix: persist model capabilities refresh app #5918)
fix: validate name assistant and improve area clickable @urmauur (fix: validate name assistant and improve area clickable #5920)
fix: Allow N-GPU Layers (NGL) to be set to 0 in llama.cpp @qnixsynapse (fix: Allow N-GPU Layers (NGL) to be set to 0 in llama.cpp #5907)
fix: models hub should show latest data only @louis-menlo (fix: models hub should show latest data only #5925)
fix: Persist 'Auto-Unload Old Models' setting in llama.cpp @qnixsynapse (fix: Persist 'Auto-Unload Old Models' setting in llama.cpp #5906)
feat: Enhance Llama.cpp backend management with persistence @qnixsynapse (feat: Enhance Llama.cpp backend management with persistence #5886)
Chore cua mac runner @hiento09 (Chore cua mac runner #5888)
fix: provider settings should be refreshed on page load @louis-menlo (fix: provider settings should be refreshed on page load #5887)
🐛fix: get system info and system usage @urmauur (🐛fix: get system info and system usage #5884)
fix: gpu detected from backend version @urmauur (fix: gpu detected from backend version #5882)
fix: bring back HF repo ID search in Hub @louis-menlo (fix: bring back HF repo ID search in Hub #5880)
chore: revert app artifact name for macos linux and windows builds @Minh141120 (chore: revert app artifact name for macos linux and windows builds #5878)
feat: add support for querying available backend devices @qnixsynapse (feat: add support for querying available backend devices #5877)
fix: llama.cpp backend shows blank list sometime @louis-menlo (fix: llama.cpp backend shows blank list sometime #5876)
ci: rename app github artifact on windows and linux build @Minh141120 (ci: rename app github artifact on windows and linux build #5875)
ci: autoqa github artifact @Minh141120 (ci: autoqa github artifact #5873)
fix: jan should have a general assistant instruction @louis-menlo (fix: jan should have a general assistant instruction #5872)
fix: tmp download file should be removed on cancel @louis-menlo (fix: tmp download file should be removed on cancel #5849)
🐛fix: remove sampling parameters from llamacpp extension @urmauur (🐛fix: remove sampling parameters from llamacpp extension #5871)
🐛fix: update vulkan active syntax @urmauur (🐛fix: update vulkan active syntax #5869)
fix: app should not show manually deleted models @louis-menlo (fix: app should not show manually deleted models #5868)
feat: migrate cortex models to llamacpp extension @louis-menlo (feat: migrate cortex models to llamacpp extension #5838)
fix: charmap encoding @Minh141120 (fix: charmap encoding #5865)
fix: HuggingFace provider should be non-deletable @louis-menlo (fix: HuggingFace provider should be non-deletable #5856)
fix: gemini tool call support - version bump @louis-menlo (fix: gemini tool call support - version bump #5848)
Fix: engine unable to find dlls on when running on Windows @qnixsynapse (Fix: engine unable to find dlls on when running on Windows #5863)
chore: update build appimage script @Minh141120 (chore: update build appimage script #5866)
✨enhancement: dialog model error trigger from provider screen @urmauur (✨enhancement: dialog model error trigger from provider screen #5858)
fix: support load model configurations @urmauur (fix: support load model configurations #5843)
fix: delete all thread should not include fav @urmauur (fix: delete all thread should not include fav #5864)
Chore: enrich autoqa log @hiento09 (Chore: enrich autoqa log #5862)
refactor: Improve Llama.cpp backend management and auto-update @qnixsynapse (refactor: Improve Llama.cpp backend management and auto-update #5845)
fix: autoqa prompt template @Minh141120 (fix: autoqa prompt template #5854)
feat: add vcruntime for windows installer @Minh141120 (feat: add vcruntime for windows installer #5852)
✨enhancement: auto focus always allow action from tool approval dialog and add req parameters @urmauur (✨enhancement: auto focus always allow action from tool approval dialog and add req parameters #5836)
enhancement: better error page component @urmauur (enhancement: better error page component #5834)
chore: sync make build with dev @Minh141120 (chore: sync make build with dev #5847)
refactor: standardize build process and remove build-tauri target @Minh141120 (refactor: standardize build process and remove build-tauri target #5846)
fix: custom tauri nsis template CheckIfAppIsRunning macro @Minh141120 (fix: custom tauri nsis template CheckIfAppIsRunning macro #5840)
fix: update @taur-apps/cli to newest verison to fix appimage download @Minh141120 (fix: update @taur-apps/cli to newest verison to fix appimage download #5839)
fix: prevent terminal window from opening on model load on WindowsOS @qnixsynapse (fix: prevent terminal window from opening on model load on WindowsOS #5837)
feat: add claude-4 @louis-menlo (feat: add claude-4 #5829)
feat: support per-model overrides in llama.cpp load() @qnixsynapse (feat: support per-model overrides in llama.cpp load() #5820)
fix: llama.cpp integration model load and chat experience @louis-menlo (fix: llama.cpp integration model load and chat experience #5823)
test: deprecate webdriver test in favor of auto qa using CUA @louis-menlo (test: deprecate webdriver test in favor of auto qa using CUA #5825)
Revert "chore(deps): update rand requirement from 0.8 to 0.9 in /src-tauri" @louis-menlo (Revert "chore(deps): update rand requirement from 0.8 to 0.9 in /src-tauri" #5824)
fix: Legacy threads show on top of new threads (bug: Legacy thread show on top of New thread #5696) @louis-menlo (fix: Legacy threads show on top of new threads (#5696) #5810)
fix: llama.cpp backend download on windows @louis-menlo (fix: llama.cpp backend download on windows #5813)
fix: dependabot should just update security patch @louis-menlo (fix: dependabot should just update security patch #5814)
chore(deps): update rand requirement from 0.8 to 0.9 in /src-tauri @dependabot (chore(deps): update rand requirement from 0.8 to 0.9 in /src-tauri #5399)
docs: Add Instruction for Toggling Experimental Features Before Toggling MCP Servers @bytrangle (docs: Add Instruction for Toggling Experimental Features Before Toggling MCP Servers #5771)
chore(deps): bump @radix-ui/react-hover-card from 1.1.11 to 1.1.14 @dependabot (chore(deps): bump @radix-ui/react-hover-card from 1.1.11 to 1.1.14 #5603)
Fix autoqa lib dependencies @hiento09 (Fix autoqa lib dependencies #5812)
refactor: simplify proxy settings by removing unused SSL verification options @louis-menlo (refactor: simplify proxy settings by removing unused SSL verification options #5809)
feat: Add Hugging Face as a provider @gary149 (feat: Add Hugging Face as a provider #5808)
fix: Improve stream error handling and parsing @qnixsynapse (fix: Improve stream error handling and parsing #5807)
feat: add autoqa @hiento09 (feat: add autoqa #5779)
set line number userSelect to none so that code can be copied without line number @ethanova (set line number userSelect to none so that code can be copied without line number #5782)
feat: add model load error handling to improve UX @louis-menlo (feat: add model load error handling to improve UX #5802)
fix: Add --reasoning-format none to support rendering of reasoning content @qnixsynapse (fix: Add --reasoning-format none to support rendering of reasoning content #5803)
feat: proxy support for the new downloader @louis-menlo (feat: proxy support for the new downloader #5795)
Sync release/0.6.5 into dev to start new development cycle @louis-menlo (Sync release/0.6.5 into dev to start new development cycle #5801)
refactor: move thinking toggle to runtime settings for dynamic control @qnixsynapse (refactor: move thinking toggle to runtime settings for dynamic control #5800)
test: deprecate webdriver test in favor of auto qa using CUA @louis-menlo (test: deprecate webdriver test in favor of auto qa using CUA #5797)
Documentation Updates for v0.6.5 @ramonpzg (Documentation Updates for v0.6.5 #5799)

Contributor

@Minh141120, @bytrangle, @dependabot, @dependabot[bot], @ethanova, @gary149, @hiento09, @louis-menlo, @qnixsynapse, @ramonpzg and @urmauur

sync: commits from dev into release/v0.7.0

Sync dev into Release/v0.7.0

…abstract class

…5087) * add pull and abortPull * add model import (download only) * write model.yaml. support local model import * remove cortex-related command * add TODO * remove cortex-related command

* fix: update ui version_backend, mem usage hardware * chore: hidden gpu from system monitor on mac * chore: fix gpus vram

This commit addresses a potential race condition that could lead to "connection errors" when unloading a llamacpp model. The issue arose because the `activeSessions` map still has the session info of the model during unload. This could lead to "connection errors" when the backend is taking time to unload while there is an ongoing request to the model. The fix involves: 1. **Deleting the `pid` from `activeSessions` before calling backend's unload:** This ensures that the model is cleared from the map before we start unloading. 2. **Failure handling**: If somehow the backend fails to unload, the session info for that model is added back to prevent any race conditions. This commit improves the robustness and reliability of the unloading process by preventing potential conflicts.

* fix: migrate app settings to the new version * fix: edge cases * fix: migrate HF import model on Windows * fix hardware page broken after downgraded * test: correct test * fix: backward compatible hardware info

* fix: selected openrouter model does not work * test: add tests to cover new change

…ng (#5947) This commit addresses a race condition where, with "Auto-Unload Old Models" enabled, rapidly attempting to load multiple models could result in more than one model being loaded simultaneously. Previously, the unloading logic did not account for models that were still in the process of loading when a new load operation was initiated. This allowed new models to start loading before the previous ones had fully completed their unload cycle. To resolve this: - A `loadingModels` map has been introduced to track promises for models currently in the loading state. - The `load` method now checks if a model is already being loaded and, if so, returns the existing promise, preventing duplicate load operations for the same model. - The `performLoad` method (which encapsulates the actual loading logic) now ensures that when `autoUnload` is active, it waits for any *other* models that are concurrently loading to finish before proceeding to unload all currently loaded models. This guarantees that the auto-unload mechanism properly unloads all models, including those initiated in quick succession, thereby preventing the race condition. This fixes the issue where clicking the start button very fast on multiple models would bypass the auto-unload functionality.

Previously, the `autoUnload` flag was not being updated when set via config, causing models to be auto-unloaded regardless of the intended behavior. This patch ensures the setting is respected at runtime.

* fix: update edge case experimental feature MCP * Update web-app/src/routes/settings/mcp-servers.tsx Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

The variable was not initialized resulted in always setting true when starting. This change fixes it.

* fix: factory reset fail due to access denied error * fix: unused import * fix: tests

* fix: assistant with last used and fix metadata * chore: revert instruction and desc * chore: fix current assistant state * chore: updae metadata message assistant * chore: update test case

…mg-src directive (#5967) * fix: csp including img.shields.io in img-src directive * fix: add huggingface upload cdn to img-src directive

This change improves the robustness of the llama.cpp extension's server port selection. Previously, the `getRandomPort()` method only checked for ports already in use by active sessions, which could lead to model load failures if the chosen port was occupied by another external process. This change introduces a new Tauri command, `is_port_available`, which performs a system-level check to ensure the randomly selected port is truly free before attempting to start the llama-server. It also adds a retry mechanism with a maximum number of attempts (20,000) to find an available port, throwing an error if no suitable port is found within the specified range after all attempts. This enhancement prevents port conflicts and improves the reliability and user experience of the llama.cpp extension within Jan. Closes #5965

…ows (#5972) * fix: remove CREATE_NEW_PROCESS_GROUP flag for proper Ctrl-C handling CREATE_NEW_PROCESS_GROUP prevented GenerateConsoleCtrlEvent from working, causing graceful shutdown failures. Removed to enable proper signal handling. * Revert "fix: remove CREATE_NEW_PROCESS_GROUP flag for proper Ctrl-C handling" This reverts commit 82ace3e. * fix: use direct process termination instead of console events Simplified Windows process cleanup by removing console attachment logic and using direct child.kill() method. More reliable for headless processes. * Fix missing imports * switch to tokio::time * Don't wait while forcefully terminate process using kill API on Windows Disabled use of windows-sys crate as graceful shutdown on Windows is unreliable in this context. Updated cleanup.rs and server.rs to directly call child.kill().await for terminating processes on Windows. Improved logging for process termination and error handling during kill and wait. Removed timeout-based graceful shutdown attempt on Windows since TerminateProcess is inherently forceful and immediate. This ensures more predictable process cleanup behavior on Windows platforms. * final cleanups

louis-jan and others added 30 commits June 24, 2025 14:50

🐛 fix: add back a couple of major fixes (#5469) (#5470)

e389011

Merge pull request #5569 from menloresearch/dev

be465ec

sync: commits from dev into release/v0.7.0

Merge pull request #5671 from menloresearch/dev

c288e75

Sync dev into Release/v0.7.0

feat: inference-llamacpp-extension: backend implementation

9016fbf

Add spaces before EOF

5c9e8dc

add thiserror to Cargo.toml

f5b5596

Fix import

0551b0b

make it compile

15f0b11

add llamacpp-extension. can list some models

3f08237

update settings

19274f7

Resolved conflicts by keeping HEAD changes

a8abc9f

Change scripts to download only llama.cpp engine

ed6f86d

Fixup: llama-server load

021f8ae

remove cortex from tauri.conf.json

47881db

remove cortex engine dirs

b4670b5

refactor load/unload

bbbf477

fixup from refactoring

0e9a8a2

remove override from localOAIEngine and OAIEngine

ee2cb9e

refactor load/unload again; move types to core and refactor AIEngine …

a7a2dcc

…abstract class

feat: Model import (download + local import) for llama.cpp extension (#…

ded9ae7

…5087) * add pull and abortPull * add model import (download only) * write model.yaml. support local model import * remove cortex-related command * add TODO * remove cortex-related command

refactor OAI request payload type to support image and audio

587ed3c

implement delete

d523166

add basic model list

cd36b42

refactor unload and implement a destructor to clean up sessions

c5a0ee7

slight modelbasepath refactoring

fe457a5

Add --reasoning_budget option

742e731

update fileStat()

77f6770

use PathBuf to check exists()

d01cbe4

add read_yaml. use buffered reader/writer

5803fcd

update model config (import and list)

9bb4dee

qnixsynapse and others added 20 commits July 26, 2025 08:48

fix: Frontend updates when llama.cpp backend auto-downloads (#5926)

8ec4a36

fix: update ui version_backend, mem usage hardware (#5932)

b89d9d0

* fix: update ui version_backend, mem usage hardware * chore: hidden gpu from system monitor on mac * chore: fix gpus vram

chore: revert back to passive mode on windows installer (#5934)

c3fa04f

fix: update default GPU toggle, and simplify state (#5937)

54d44ce

fix: migrate app settings to the new version (#5936)

1fc37a9

* fix: migrate app settings to the new version * fix: edge cases * fix: migrate HF import model on Windows * fix hardware page broken after downgraded * test: correct test * fix: backward compatible hardware info

fix: tool approval params scrollable (#5941)

08af8a4

fix: openrouter unselect itself (#5943)

fdaa3b1

* fix: selected openrouter model does not work * test: add tests to cover new change

chore: uninstall when upgrading windows installer (#5945)

a4e5973

fix: correctly apply auto_unload setting from config (#5953)

fa896b3

Previously, the `autoUnload` flag was not being updated when set via config, causing models to be auto-unloaded regardless of the intended behavior. This patch ensures the setting is respected at runtime.

fix: set autoUnload in onLoad() (#5956)

07421d7

The variable was not initialized resulted in always setting true when starting. This change fixes it.

fix: factory reset fail with access denied error (#5952)

812a808

* fix: factory reset fail due to access denied error * fix: unused import * fix: tests

fix: search models result in hub should be sorted by weight (#5954)

160d158

fix: assistant with last used and fix metadata (#5955)

63cb4fb

* fix: assistant with last used and fix metadata * chore: revert instruction and desc * chore: fix current assistant state * chore: updae metadata message assistant * chore: update test case

ci: tolerate artifact upload (#5969)

210ace7

fix: csp including img.shields.io and cdn-uploads.huggingface.co in i…

eb71477

…mg-src directive (#5967) * fix: csp including img.shields.io in img-src directive * fix: add huggingface upload cdn to img-src directive

chore: allow all HTTPS image sources in img-src directive (#5970)

ee582a8

github-project-automation bot added this to Jan Jul 29, 2025

github-actions bot assigned louis-jan Jul 29, 2025

urmauur and others added 2 commits July 30, 2025 09:18

fix: rename thread dialog shows previous thread (#5963)

0797599

urmauur approved these changes Jul 30, 2025

View reviewed changes

qnixsynapse approved these changes Jul 30, 2025

View reviewed changes

louis-jan merged commit 12c552c into dev Jul 30, 2025
13 of 14 checks passed

github-project-automation bot moved this to QA in Jan Jul 30, 2025

github-actions bot added this to the v0.6.6 milestone Jul 30, 2025

github-actions bot deployed to docs (Production) July 30, 2025 05:50 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sync Release/v0.6.6 into dev #5973

Sync Release/v0.6.6 into dev #5973

Uh oh!

louis-jan commented Jul 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Sync Release/v0.6.6 into dev #5973

Sync Release/v0.6.6 into dev #5973

Uh oh!

Conversation

louis-jan commented Jul 29, 2025

Changes

Contributor

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants