Skip to content

Conversation

@Ying1123
Copy link
Member

@Ying1123 Ying1123 commented Oct 4, 2024

The issue could happen in non-streaming scenarios with small batch sizes. It could break continuous batching because of the wrong manipulation of do_not_get_new_batch.

This PR fixes the issue.

@Ying1123 Ying1123 force-pushed the ying-perf-fix branch 2 times, most recently from 1559c1e to e6113bd Compare October 4, 2024 01:08
@Ying1123 Ying1123 changed the title [Fix] Fix major performance bug [Fix] Fix major performance bug in certain cases Oct 4, 2024
@Ying1123 Ying1123 marked this pull request as draft October 4, 2024 01:24
@Ying1123 Ying1123 force-pushed the ying-perf-fix branch 8 times, most recently from eba3ac8 to f58a5cf Compare October 4, 2024 04:26
@Ying1123 Ying1123 requested a review from hnyls2002 October 4, 2024 04:26
@Ying1123 Ying1123 marked this pull request as ready for review October 4, 2024 04:33
@hnyls2002 hnyls2002 enabled auto-merge (squash) October 4, 2024 08:33
@hnyls2002 hnyls2002 merged commit 04b262c into main Oct 4, 2024
@hnyls2002 hnyls2002 deleted the ying-perf-fix branch October 4, 2024 08:51
@zhyncs zhyncs mentioned this pull request Nov 17, 2024
3 tasks
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants