Skip to content

server : add kill switch when server is stuck#20277

Merged
ggerganov merged 1 commit intomasterfrom
gg/server-kill-switch
Mar 9, 2026
Merged

server : add kill switch when server is stuck#20277
ggerganov merged 1 commit intomasterfrom
gg/server-kill-switch

Conversation

@ggerganov
Copy link
Member

ref #20087 (comment)

Sometimes the server enters an infinite loop of empty batches. This change makes it easier to debug such cases.

@ggerganov ggerganov requested a review from ngxson as a code owner March 9, 2026 07:37
@aldehir
Copy link
Contributor

aldehir commented Mar 9, 2026

I managed to reproduce this reliably with gpt-oss-20b and 120b.

Windows & Nvidia RTX 3090.

repro.py
server.log 1
server.log 2

The reproducer runs an agentic scenario, and then runs it again. The second run is what triggers the loop.

Let me know if there's anything else I can provide.

@ggerganov ggerganov merged commit 107d599 into master Mar 9, 2026
71 of 75 checks passed
@ggerganov ggerganov deleted the gg/server-kill-switch branch March 9, 2026 08:33
std::vector<server_slot> slots;

int slots_debug = 0;
int n_empty_consequtive = 0;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ggerganov
Did you mean consecutive?

bartowski1182 pushed a commit to bartowski1182/llama.cpp that referenced this pull request Mar 10, 2026
alam0rt added a commit to alam0rt/nix-config that referenced this pull request Mar 14, 2026
…ound)

- -c 32768 → 65536: openclaw compaction was failing at 41784 tokens
- Remove --ctx-checkpoints 1: triggers empty-batch loop bug in b8255
  which causes the kill-switch abort (server-context.cpp:2634)
  See: ggml-org/llama.cpp#20277
Ethan-a2 pushed a commit to Ethan-a2/llama.cpp that referenced this pull request Mar 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants