Commit 13c355c
wenfeng.wf
Merge branch 'main' into nixl
* main: (29 commits)
reduce moe_align_block_size_kernel small batch mode overhead (sgl-project#5086)
Fix DeepSeek error when using DeepEP mode (sgl-project#5190)
[metrics] Add in queue metrics (sgl-project#4444)
fix: log warning when disable cuda graph (sgl-project#5209)
Add H20 dtype fp8_w8a8 fused MoE kernel tuning configs for DeepSeek V3/R1 (sgl-project#5196)
sgl-kernel use cutlass latest version for fp8 blockwise gemm (sgl-project#5207)
update grok test (sgl-project#5171)
model: support mllama4 (sgl-project#5144)
[ci] fix ci test fused_moe op (sgl-project#5102)
Support Llama4 fp8 inference (sgl-project#5194)
Optimize topk operation in llama4 (sgl-project#5128)
Fix ci test "test_eval_fp8_accuracy" failed (sgl-project#5185)
[Misc] clean up vllm in sgl-kernel test (sgl-project#5189)
Let `bench_one_batch` support `enable_dp_attention` (sgl-project#4058)
[DeepEP] fix: import buffer error (sgl-project#5179)
fix: use DeepEPDispatcher on CUDA (sgl-project#5180)
feat: add DeepGEMM build warning (sgl-project#5176)
docs: remove the use of Downward API for LWS_WORKER_INDEX (sgl-project#5110)
[Fix] DeepEP Compatibility with Low Latency (sgl-project#5068)
[Bugfix] Fix index out of bounds in local attention with large sequences (sgl-project#5173)
...
# Conflicts:
# python/sglang/srt/disaggregation/mini_lb.py
# python/sglang/srt/managers/scheduler.py83 files changed
Lines changed: 10523 additions & 1638 deletions
File tree
- .github/workflows
- docs
- backend
- references
- python/sglang
- srt
- configs
- disaggregation
- entrypoints
- layers
- attention
- moe
- ep_moe
- fused_moe_triton
- configs
- quantization
- compressed_tensors
- managers
- multimodal_processors
- metrics
- model_executor
- model_loader
- models
- speculative
- test
- sgl-kernel
- benchmark
- csrc
- cpu
- cutlass_extensions/gemm
- collective
- gemm
- moe
- tests
- test/srt
- models
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
56 | 56 | | |
57 | 57 | | |
58 | 58 | | |
59 | | - | |
60 | | - | |
| 59 | + | |
| 60 | + | |
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | 64 | | |
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
68 | | - | |
69 | 68 | | |
70 | 69 | | |
71 | 70 | | |
| |||
105 | 104 | | |
106 | 105 | | |
107 | 106 | | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
108 | 149 | | |
109 | 150 | | |
110 | 151 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
371 | 371 | | |
372 | 372 | | |
373 | 373 | | |
374 | | - | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
375 | 377 | | |
376 | 378 | | |
377 | 379 | | |
| |||
412 | 414 | | |
413 | 415 | | |
414 | 416 | | |
415 | | - | |
416 | | - | |
417 | | - | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
418 | 424 | | |
419 | 425 | | |
420 | 426 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| |||
95 | 95 | | |
96 | 96 | | |
97 | 97 | | |
98 | | - | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | 98 | | |
103 | 99 | | |
104 | 100 | | |
| |||
164 | 160 | | |
165 | 161 | | |
166 | 162 | | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | 163 | | |
172 | 164 | | |
173 | 165 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
60 | 60 | | |
61 | 61 | | |
62 | 62 | | |
| 63 | + | |
63 | 64 | | |
64 | 65 | | |
65 | 66 | | |
| |||
184 | 185 | | |
185 | 186 | | |
186 | 187 | | |
| 188 | + | |
187 | 189 | | |
188 | 190 | | |
189 | 191 | | |
| |||
199 | 201 | | |
200 | 202 | | |
201 | 203 | | |
| 204 | + | |
202 | 205 | | |
203 | 206 | | |
204 | 207 | | |
| |||
220 | 223 | | |
221 | 224 | | |
222 | 225 | | |
| 226 | + | |
223 | 227 | | |
224 | 228 | | |
225 | 229 | | |
| |||
238 | 242 | | |
239 | 243 | | |
240 | 244 | | |
| 245 | + | |
241 | 246 | | |
242 | 247 | | |
243 | 248 | | |
| |||
249 | 254 | | |
250 | 255 | | |
251 | 256 | | |
| 257 | + | |
252 | 258 | | |
253 | 259 | | |
254 | 260 | | |
255 | 261 | | |
256 | 262 | | |
257 | 263 | | |
258 | 264 | | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
259 | 279 | | |
260 | 280 | | |
261 | 281 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
279 | 279 | | |
280 | 280 | | |
281 | 281 | | |
| 282 | + | |
282 | 283 | | |
283 | 284 | | |
284 | 285 | | |
| |||
485 | 486 | | |
486 | 487 | | |
487 | 488 | | |
488 | | - | |
489 | 489 | | |
| 490 | + | |
490 | 491 | | |
491 | 492 | | |
492 | 493 | | |
| |||
0 commit comments