jan loads/unloads the LLM model into memory for each query #6446

MRDOCTOROO · 2025-09-14T03:13:39Z

MRDOCTOROO
Sep 14, 2025

Because it is unloaded every few seconds when not used, it takes a long time to load the model each time, and the cost is higher. Also, the model list cannot be obtained using the API service. When the model is not started, a whitelist is added and it is started with 0.0.0.0. The model cannot be accessed from outside the local cross-domain.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jan

jan loads/unloads the LLM model into memory for each query #6446

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Jan

jan loads/unloads the LLM model into memory for each query #6446

Uh oh!

MRDOCTOROO Sep 14, 2025

Replies: 0 comments

MRDOCTOROO
Sep 14, 2025