Commit 05319dc
wangweiwei
Merge remote-tracking branch 'www/pr/ks' into pr/ks
* www/pr/ks: (265 commits)
[BugFix][PD]Fix metadata_buffer_index leak when aborted in PD (sgl-project#17483)
Refactoring Mooncake TE as a shared distributed component (sgl-project#17810)
[ModelOPT] Support Qwen 3 Next Coder NVFP4 (sgl-project#18224)
Update author information in pyproject.toml (sgl-project#18453)
[Kimi-K2.5] Fix missing `quant_config` in `KimiK25` (sgl-project#18440)
Add tensor parallelism support to LFM2 ShortConv layers (sgl-project#17777)
[diffusion] chore: revise process title (sgl-project#18446)
Fix TRT-LLM MLA backend applying k_scale to BF16 KV cache in BMM1 (sgl-project#18396)
[diffusion] refactor: group component loaders under the component_loaders/ directory (sgl-project#18438)
[ModelOpt] Fix broken Qwen3-235B-A22B-Instruct-2507-NVFP4 launch (sgl-project#18189)
[diffusion] feat: support efficient sequence shard (sgl-project#18161)
[CI] fix: notebook ci may not working (sgl-project#18417)
fix: sync server_args.kv_cache_dtype when detecting FP8 KV cache (sgl-project#18394)
[Fix] Fix backend selection after flashinfer version update (sgl-project#18364)
[diffusion] platform: support WAN/FLUX/Qwen-Image/Qwen-Image-edit on Ascend (sgl-project#13662)
fix: fix NVFP4 Kimi-K2.5 weight mapping and exclude list (sgl-project#18370)
[diffusion] feat: support saving videos directly on the server to avoid the overhead of tensor transfer (sgl-project#18253)
[diffusion] fix: respect dist_timeout option (sgl-project#18386)
[Doc] Fix outdated `--fp4-gemm-backend` documentation (sgl-project#18350)
[diffusion] fix: remove unnecessary norm_type argument from GLM-Image dits (sgl-project#18382)
...File tree
671 files changed
+43948
-16540
lines changed- .github
- workflows
- benchmark
- bench_rope
- kernels/fused_moe_triton
- docker
- docs
- advanced_features
- basic_usage
- developer_guide
- get_started
- platforms
- references
- frontend
- multi_node_deployment/lws_pd
- supported_models
- extending
- image_generation
- retrieval_ranking
- specialized
- text_generation
- python
- sglang
- jit_kernel
- benchmark
- csrc
- elementwise
- gemm/marlin
- diffusion/cutedsl
- common
- flash_attention/cute
- include/sgl_kernel
- tests
- multimodal_gen
- configs
- models
- bridges
- dits
- encoders
- vaes
- pipeline_configs
- sample
- docs
- runtime
- cache
- distributed
- device_communicators
- entrypoints
- openai
- layers
- attention/backends
- loader
- component_loaders
- managers
- models
- bridges
- dits
- encoders
- schedulers
- vaes
- pipelines_core
- stages
- model_specific_stages
- pipelines
- platforms
- utils
- test
- scripts
- server
- ascend
- srt
- batch_overlap
- compilation
- configs
- disaggregation
- ascend
- common
- mooncake
- mori
- nixl
- distributed
- device_communicators
- entrypoints
- openai
- function_call
- hardware_backend/npu
- attention
- modules
- moe
- quantization
- layers
- attention
- fla
- mamba
- nsa
- triton_ops
- deep_gemm_wrapper
- moe
- ep_moe
- fused_moe_triton
- configs/triton_3_5_1
- moe_runner
- token_dispatcher
- quantization
- compressed_tensors
- utils
- lora
- managers
- mem_cache
- storage/mooncake_store
- metrics
- model_executor
- model_loader
- models
- deepseek_common
- multimodal
- processors
- multiplex
- parser
- speculative
- tracing
- utils
- test
- ascend
- attention
- kits
- server_fixtures
- scripts
- ci
- amd
- cuda
- utils
- playground
- sgl-kernel
- csrc
- allreduce
- cpu
- elementwise
- gemm
- python/sgl_kernel
- sgl-model-gateway
- src
- mesh
- proto
- routers
- mesh
- tests
- test
- lm_eval_configs
- manual
- layers/attention/nsa
- models
- registered
- 4-gpu-models
- 8-gpu-models
- amd
- accuracy
- mi30x
- mi35x
- disaggregation
- perf
- mi30x
- mi35x
- ascend
- basic_function
- interface
- parameter
- llm_models
- vlm_models
- attention
- core
- distributed
- ep
- function_call
- hicache
- kernels
- layers
- mamba
- lora
- model_loading
- models
- moe
- openai_server/basic
- profiling
- quant
- radix_cache
- rl
- sampling
- scheduler
- spec
- eagle
- utils
- utils
- srt
- ascend
- cpu
- xpu
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
671 files changed
+43948
-16540
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
209 | 209 | | |
210 | 210 | | |
211 | 211 | | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
212 | 219 | | |
213 | 220 | | |
214 | 221 | | |
| |||
412 | 419 | | |
413 | 420 | | |
414 | 421 | | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
415 | 429 | | |
416 | 430 | | |
417 | 431 | | |
| |||
811 | 825 | | |
812 | 826 | | |
813 | 827 | | |
| 828 | + | |
| 829 | + | |
| 830 | + | |
| 831 | + | |
| 832 | + | |
| 833 | + | |
| 834 | + | |
814 | 835 | | |
815 | 836 | | |
816 | 837 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
| 6 | + | |
6 | 7 | | |
7 | | - | |
8 | | - | |
| 8 | + | |
| 9 | + | |
9 | 10 | | |
10 | 11 | | |
11 | 12 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
108 | 108 | | |
109 | 109 | | |
110 | 110 | | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
| 6 | + | |
6 | 7 | | |
7 | 8 | | |
8 | 9 | | |
| 10 | + | |
| 11 | + | |
9 | 12 | | |
10 | 13 | | |
11 | 14 | | |
12 | 15 | | |
13 | 16 | | |
14 | 17 | | |
15 | 18 | | |
| 19 | + | |
| 20 | + | |
16 | 21 | | |
17 | 22 | | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
18 | 29 | | |
| 30 | + | |
19 | 31 | | |
20 | | - | |
| 32 | + | |
21 | 33 | | |
22 | 34 | | |
23 | 35 | | |
| |||
43 | 55 | | |
44 | 56 | | |
45 | 57 | | |
| 58 | + | |
46 | 59 | | |
47 | 60 | | |
48 | 61 | | |
| 62 | + | |
49 | 63 | | |
50 | 64 | | |
51 | 65 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
| 37 | + | |
37 | 38 | | |
38 | 39 | | |
39 | 40 | | |
| |||
582 | 583 | | |
583 | 584 | | |
584 | 585 | | |
585 | | - | |
| 586 | + | |
586 | 587 | | |
587 | 588 | | |
588 | 589 | | |
589 | 590 | | |
590 | 591 | | |
591 | | - | |
| 592 | + | |
592 | 593 | | |
593 | 594 | | |
594 | 595 | | |
| |||
793 | 794 | | |
794 | 795 | | |
795 | 796 | | |
| 797 | + | |
| 798 | + | |
| 799 | + | |
| 800 | + | |
| 801 | + | |
| 802 | + | |
| 803 | + | |
| 804 | + | |
| 805 | + | |
| 806 | + | |
| 807 | + | |
| 808 | + | |
| 809 | + | |
| 810 | + | |
| 811 | + | |
| 812 | + | |
| 813 | + | |
| 814 | + | |
| 815 | + | |
| 816 | + | |
| 817 | + | |
| 818 | + | |
| 819 | + | |
| 820 | + | |
| 821 | + | |
| 822 | + | |
| 823 | + | |
| 824 | + | |
| 825 | + | |
| 826 | + | |
| 827 | + | |
| 828 | + | |
| 829 | + | |
796 | 830 | | |
797 | 831 | | |
798 | 832 | | |
| |||
834 | 868 | | |
835 | 869 | | |
836 | 870 | | |
837 | | - | |
838 | | - | |
839 | | - | |
840 | 871 | | |
| 872 | + | |
| 873 | + | |
| 874 | + | |
841 | 875 | | |
842 | 876 | | |
843 | 877 | | |
| |||
853 | 887 | | |
854 | 888 | | |
855 | 889 | | |
| 890 | + | |
856 | 891 | | |
857 | 892 | | |
858 | 893 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
165 | 165 | | |
166 | 166 | | |
167 | 167 | | |
168 | | - | |
| 168 | + | |
169 | 169 | | |
170 | 170 | | |
171 | 171 | | |
| |||
178 | 178 | | |
179 | 179 | | |
180 | 180 | | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
181 | 297 | | |
182 | 298 | | |
183 | 299 | | |
184 | 300 | | |
| 301 | + | |
185 | 302 | | |
| 303 | + | |
| 304 | + | |
186 | 305 | | |
187 | 306 | | |
188 | 307 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
95 | 95 | | |
96 | 96 | | |
97 | 97 | | |
98 | | - | |
| 98 | + | |
99 | 99 | | |
100 | 100 | | |
101 | 101 | | |
| |||
118 | 118 | | |
119 | 119 | | |
120 | 120 | | |
121 | | - | |
| 121 | + | |
122 | 122 | | |
123 | 123 | | |
124 | 124 | | |
| |||
179 | 179 | | |
180 | 180 | | |
181 | 181 | | |
182 | | - | |
| 182 | + | |
183 | 183 | | |
184 | 184 | | |
185 | 185 | | |
| |||
199 | 199 | | |
200 | 200 | | |
201 | 201 | | |
202 | | - | |
| 202 | + | |
203 | 203 | | |
204 | 204 | | |
205 | 205 | | |
| |||
0 commit comments