Skip to content

UPSTREAM PR #17775: server: support multiple generations from one prompt (OAI "n" option)#444

Open
loci-dev wants to merge 2 commits intomainfrom
upstream-PR17775-branch_ngxson-xsn/add_n_support
Open

UPSTREAM PR #17775: server: support multiple generations from one prompt (OAI "n" option)#444
loci-dev wants to merge 2 commits intomainfrom
upstream-PR17775-branch_ngxson-xsn/add_n_support

Conversation

@loci-dev
Copy link
Copy Markdown

@loci-dev loci-dev commented Dec 5, 2025

Mirrored from ggml-org/llama.cpp#17775

Fix ggml-org/llama.cpp#11142

TODO:

  • only release parent slot when all children are done
  • do not allow context shifting
  • add OAI output format
  • add tests

@loci-dev loci-dev force-pushed the main branch 27 times, most recently from 32aa2bc to 0044ef5 Compare December 8, 2025 13:19
@loci-dev loci-dev force-pushed the main branch 30 times, most recently from de9b0c0 to b28744d Compare December 13, 2025 10:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants