Skip to content

[Optimization] Update estimated_num_new_pages logic in TokenToKVPoolAllocator#8794

Merged
merrymercy merged 3 commits intosgl-project:mainfrom
YiXR:main
Aug 10, 2025
Merged

[Optimization] Update estimated_num_new_pages logic in TokenToKVPoolAllocator#8794
merrymercy merged 3 commits intosgl-project:mainfrom
YiXR:main

Conversation

@YiXR
Copy link
Copy Markdown
Contributor

@YiXR YiXR commented Aug 5, 2025

Motivation

This PR update estimated_num_new_pages logic and format code which is introduced by the previous PR: #8133

Modifications

Accuracy Test

Benchmark & Profiling

Checklist

…llocator

Signed-off-by: Xingrui Yi <yixingrui@linux.alibaba.com>
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Copy link
Copy Markdown
Collaborator

@ShangmingCai ShangmingCai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Copy link
Copy Markdown
Contributor

@merrymercy merrymercy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me.
It seems this sort is only useful for PD case. Should we add a flag so that it only turns on during PD?

@YiXR
Copy link
Copy Markdown
Contributor Author

YiXR commented Aug 7, 2025

Looks good to me. It seems this sort is only useful for PD case. Should we add a flag so that it only turns on during PD?

done

Signed-off-by: Xingrui Yi <yixingrui@linux.alibaba.com>
@merrymercy merrymercy merged commit 0418b9d into sgl-project:main Aug 10, 2025
60 of 66 checks passed
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 17, 2025
…llocator (sgl-project#8794)

Signed-off-by: Xingrui Yi <yixingrui@linux.alibaba.com>
Co-authored-by: Xingrui Yi <yixingrui@linux.alibaba.com>
MahmoudAshraf97 pushed a commit to MahmoudAshraf97/sglang that referenced this pull request Sep 8, 2025
…llocator (sgl-project#8794)

Signed-off-by: Xingrui Yi <yixingrui@linux.alibaba.com>
Co-authored-by: Xingrui Yi <yixingrui@linux.alibaba.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants