Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix ROCm build to respect PYTORCH_ROCM_ARCH for GPU_TARGETS (issue #22590) ci/build documentation Improvements or additions to documentation nvidia rocm Related to AMD ROCm v1
#31079 opened Dec 20, 2025 by westers Loading…
5 tasks done
[Doc] Add warning regarding GPU profiling limitations on WSL2 documentation Improvements or additions to documentation
#31078 opened Dec 20, 2025 by kjuuii Loading…
Fix ROCm CUDA graph replay synchronization bug (issue #29521) ci/build documentation Improvements or additions to documentation nvidia rocm Related to AMD ROCm v1
#31077 opened Dec 20, 2025 by westers Loading…
5 tasks done
[ROCm][CI/Build] Fix Dockerfile.rocm to set VLLM_TARGET_DEVICE=rocm ci/build documentation Improvements or additions to documentation rocm Related to AMD ROCm v1
#31075 opened Dec 20, 2025 by westers Loading…
Fix formatting of softmax equation in documentation documentation Improvements or additions to documentation
#31074 opened Dec 20, 2025 by ssaketh-ch Loading…
5 tasks
[ROCm][Test] Skip RTN quantization tests on ROCm documentation Improvements or additions to documentation rocm Related to AMD ROCm v1
#31072 opened Dec 20, 2025 by westers Loading…
[Doc] Clarify FP8 KV cache computation workflow documentation Improvements or additions to documentation v1
#31071 opened Dec 20, 2025 by westers Loading…
[Doc] Fix image rendering in paged_attention.md documentation Improvements or additions to documentation v1
#31070 opened Dec 20, 2025 by westers Loading…
Fix LoRA prefix cache corruption by using lora_int_id ci/build rocm Related to AMD ROCm v1
#31069 opened Dec 20, 2025 by westers Loading…
[misc] allow overriding the TAG variable in auto_tune.sh performance Performance-related issues
#31065 opened Dec 20, 2025 by kkr16 Loading…
3 of 4 tasks
[ROCm][Docker] Add gfx1103 support to Docker builds ci/build rocm Related to AMD ROCm
#31062 opened Dec 20, 2025 by westers Loading…
[Quantization] add marlin w4a8/w8a8 check
#31061 opened Dec 20, 2025 by jinzhen-lin Loading…
[Frontend] add logprob, compression_rate to 'verbose_json' features documentation Improvements or additions to documentation frontend
#31059 opened Dec 20, 2025 by sangbumlikeagod Loading…
5 tasks
[CI] Fix H200 Distributed test ci/build documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#31054 opened Dec 20, 2025 by LucasWilkinson Loading…
[MoE Refactor] Use modular kernel for unquantized Triton MoE ready ONLY add when PR is ready to merge/full CI is needed
#31052 opened Dec 20, 2025 by zyongye Loading…
[MoE Refactor] Split invoke_fused_moe_kernel ready ONLY add when PR is ready to merge/full CI is needed
#31050 opened Dec 20, 2025 by zyongye Loading…
[CI] Add Qwen3-Next-FP8 to Blackwell model tests qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#31049 opened Dec 19, 2025 by vadiklyutiy Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.