Skip to content

Pull requests: alibaba/rtp-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: seperate different model flashinfer params
#697 opened Feb 11, 2026 by Vinkle-hzt Loading…
feat: optimize memory connector
#695 opened Feb 10, 2026 by li-xiao-qing Loading…
feat: MTP-compatible Streaming Parsing detectors
#694 opened Feb 10, 2026 by soaringk Loading…
Develop/hybrid kvcache refactor
#693 opened Feb 10, 2026 by xinfei-shi Loading…
feat: add acclbarex envs in cache store
#689 opened Feb 9, 2026 by intermezzi Loading…
Qwen3 next speculative decoding support
#688 opened Feb 9, 2026 by Vinkle-hzt Loading…
feat: update cuda 12.9
#687 opened Feb 9, 2026 by Bruce-Lee-LY Loading…
add metrics for allocator
#686 opened Feb 9, 2026 by jianglan89 Loading…
Feat: Qwen2.5 VIT hipblaslt swizzle
#685 opened Feb 9, 2026 by yuzho-amd Loading…
feat: add master queue mechanism
#684 opened Feb 8, 2026 by sunmiaozju Loading…
feat: ROCm mha batch prefill
#683 opened Feb 6, 2026 by yuzho-amd Loading…
[WIP] feat: support elastic ep downscale in DP
#681 opened Feb 6, 2026 by intermezzi Loading…
feature - support deepgemm warmup
#679 opened Feb 6, 2026 by jianglan89 Loading…
Feat/support ds 32
#677 opened Feb 5, 2026 by Nancheng-11 Loading…
Feature/refactor global info 1
#674 opened Feb 4, 2026 by wanglining97 Loading…
feat: rocm python mtp
#668 opened Feb 3, 2026 by amd-yilizhao Loading…
add python atrex pa
#667 opened Feb 2, 2026 by yanglf1121 Loading…
feat: support hipgraph on python mode
#662 opened Feb 2, 2026 by muse-coder Loading…
feat: add remote_connector
#660 opened Feb 2, 2026 by MMeecatfish Loading…
fix - w13 fused
#657 opened Jan 30, 2026 by jianglan89 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.