fix(llama.cpp): improve context shift handling #4820

mudler · 2025-02-13T08:37:25Z

Description

This pull request includes changes to the backend/cpp/llama/grpc-server.cpp file to handle context capacity and prevent redundant checks. The most important changes include adding a check for context capacity and improving logging when context is exhausted.

Context capacity handling:

Added a check to set truncated and stopped_limit flags and log a message when the context capacity is reached. (backend/cpp/llama/grpc-server.cpp)

Redundant check removal:

Removed redundant checks and improved logging when context is exhausted by releasing the slot and logging an error message. (backend/cpp/llama/grpc-server.cpp)

Notes for Reviewers

Signed commits

Yes, I signed my commits.

netlify · 2025-02-13T08:37:42Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`652d4b6`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/67af310860aaa80008ea81f9
😎 Deploy Preview	https://deploy-preview-4820--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Signed-off-by: Ettore Di Giacinto <[email protected]>

fix(llama.cpp): improve context shift handling

652d4b6

Signed-off-by: Ettore Di Giacinto <[email protected]>

mudler force-pushed the feat/improve_context_shift branch from bd0c800 to 652d4b6 Compare February 14, 2025 12:03

mudler added the bug Something isn't working label Feb 14, 2025

mudler merged commit 9e32fda into master Feb 14, 2025
25 checks passed

mudler deleted the feat/improve_context_shift branch February 14, 2025 13:55

BrewTestBot mentioned this pull request Feb 15, 2025

localai 2.26.0 Homebrew/homebrew-core#207814

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(llama.cpp): improve context shift handling #4820

fix(llama.cpp): improve context shift handling #4820

Uh oh!

mudler commented Feb 13, 2025

Uh oh!

netlify bot commented Feb 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

fix(llama.cpp): improve context shift handling #4820

fix(llama.cpp): improve context shift handling #4820

Uh oh!

Conversation

mudler commented Feb 13, 2025

Uh oh!

netlify bot commented Feb 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

netlify bot commented Feb 13, 2025 •

edited

Loading