-
Notifications
You must be signed in to change notification settings - Fork 5.1k
feat: mtp support dp-attention #6081
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
zhyncs
merged 66 commits into
sgl-project:main
from
u4lr451:feature_mtp_support_dp_attention
Jun 17, 2025
Merged
Changes from all commits
Commits
Show all changes
66 commits
Select commit
Hold shift + click to select a range
054ed81
feat: mtp support dp-attention with cuda-graph (#6080)
a602a29
fix dp+mtp bugs
ed6b060
Merge branch 'main' into feature_mtp_support_dp_attention
u4lr451 6cc38e7
fix: MTP+cudagraph+DPAtten and fa3
TianQiLin666666 a526032
Merge remote-tracking branch 'github/main' into feature_mtp_support_d…
672d6be
feat:Enable CUDA Graph for draft_extend while supporting dp-attention…
b130867
fix: Adjust the init_cuda_graph_state and fixbug (#6081)
35fe3df
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
3ceedbe
Performance: Eliminate performance impact in non-dp-attention+mtp sce…
04ede24
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
96b7209
fix bugs for mtp (#6081)
54dd1f7
fix enable cuda graph for draft_extend stage while supporting dp-atte…
b01de94
Merge branch 'main' into feature_mtp_support_dp_attention
u4lr451 5805662
Merge branch 'main' into feature_mtp_support_dp_attention
ch-wan 990fe38
Merge branch 'main' into feature_mtp_support_dp_attention
Qiaolin-Yu 658fd39
Added test cases for dp-attention + mtp (#6081)
8e47432
Merge commit '60fdad7cf343333e956a3889c12956396a1516bf' into u4lr451:…
57e8f1c
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
e15db54
Update mtp+dp-attention test cases (#6081)
64cc457
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
ed7d4e2
Merge branch 'main' into feature_mtp_support_dp_attention
u4lr451 5cba657
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
b54f934
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
c336c53
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
76f6cde
compatibility for fa3 (#6081)
23f82db
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
9dff016
fix
Qiaolin-Yu cc124fb
fix
Qiaolin-Yu 55aefb7
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
7d44df1
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
767ff45
Merge branch 'main' into feature_mtp_support_dp_attention
ch-wan 4982404
Merge branch 'main' into feature_mtp_support_dp_attention
u4lr451 9be85b7
Remove redundant code (#6081)
6690410
Merge branch 'main' into feature_mtp_support_dp_attention
Qiaolin-Yu d4ec8c8
nit update
ch-wan 1218312
nit fix (#6081)
42d2403
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
6f9478a
Merge branch 'main' into feature_mtp_support_dp_attention
ch-wan 4e54751
Merge branch 'main' into feature_mtp_support_dp_attention
zhyncs ec987fc
update scheduler and eagle worker
ch-wan 9c86afe
update eagle_worker (#6081)
b0cb235
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
973edde
update forward_batch_speculative_generation
4f299ae
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
d2a162f
Merge commit '55561e25533f195e6d6b11e1c3d2449bc9908495' into pr/u4lr4…
ch-wan 6e7c69e
polish global sync
ch-wan 37af1a2
refactor eagle_worker.py
ch-wan 64cc292
fix
ch-wan 5c6b93e
Merge branch 'main' into feature_mtp_support_dp_attention
ch-wan 3744a72
Merge remote-tracking branch 'origin/HEAD' into pr/u4lr451/6081
ch-wan ab26c11
format
ch-wan c07ba77
fix refactor bug
ff07187
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
97f531b
fix enable_dp_lm_head when dp-size == tp-size
3f686b1
Performance: Support enabling CUDA graph when idle batches exist
5ae3c3d
Merge remote-tracking branch 'github/main' into u4lrssh.feature_mtp_s…
f3854ee
Merge remote-tracking branch 'github/main' into u4lr451:feature_mtp_s…
841defa
refine code for dp lm head
ch-wan 2f64ad7
Merge branch 'main' into feature_mtp_support_dp_attention
zhyncs a279680
Revert "Performance: Support enabling CUDA graph when idle batches ex…
ch-wan 038ca0f
add a note
ch-wan 3bc16e4
Merge commit '873ae12cee348dcb579a4c7456d789ef4441f3bf' into pr/u4lr4…
ch-wan 16f8a63
Merge branch 'main' into feature_mtp_support_dp_attention
zhyncs e4bf571
fix merge error
ch-wan 3a5b9d5
clean code and add comments
ch-wan a2effc0
Merge branch 'main' into feature_mtp_support_dp_attention
zhyncs File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.