-
Notifications
You must be signed in to change notification settings - Fork 181
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Throwaway: conc-64 gsm8k eval for DEP8+MTP3 dispatch token bug
non-canary-full-sweep-enabled
Run the full sweep without the canary gate (full search space, no trim)
#1659
opened Jun 3, 2026 by
Oseltamivir
Collaborator
Loading…
[NV] Add Kimi K2.5 FP4 B200/B300 EP sweep
full-sweep-enabled
#1658
opened Jun 3, 2026 by
jasonlizhengjian
Collaborator
Loading…
AMD - gpt-oss vllm mxfp4: AITER tuning + n-gram spec decode + server …
AMD
#1657
opened Jun 3, 2026 by
nehaprakriya
Loading…
[WIP] Update Dsv4 B300 configs
full-sweep-enabled
#1656
opened Jun 3, 2026 by
wzhao18
Collaborator
Loading…
[WIP] Update B200 Dsv4 configs
full-sweep-enabled
#1655
opened Jun 3, 2026 by
wzhao18
Collaborator
Loading…
fix(power): classify zero-decode-GPU multinode runs as aggregated
#1646
opened Jun 2, 2026 by
arygupt
Collaborator
Loading…
dsr1-fp4-mi355x-sglang: bump ROCm 7.0->7.2 image + add TP4 search-space
AMD
#1645
opened Jun 2, 2026 by
JohnQinAMD
Collaborator
Loading…
Use official TRT-LLM image (1.3.0rc15.post1) for DSv4 B300 TRT (non-MTP + MTP)
full-sweep-enabled
#1636
opened Jun 1, 2026 by
Oseltamivir
Collaborator
Loading…
feat(power): vendor-agnostic GPU power/telemetry aggregation core
#1635
opened Jun 1, 2026 by
arygupt
Collaborator
Loading…
2 of 3 tasks
Enable Rust frontend (VLLM_USE_RUST_FRONTEND=1)
full-sweep-enabled
#1634
opened Jun 1, 2026 by
chunfangamd
Collaborator
Loading…
[Klaud Cold] Update gptoss-fp4-mi300x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1621
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update minimaxm2.5-fp8-mi300x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1618
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-mi300x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1615
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-mi325x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1614
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-mi355x-vllm vLLM ROCm image to v0.22.0
full-sweep-enabled
#1613
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update minimaxm2.5-fp4-b300-vllm vLLM image to v0.22.0
full-sweep-enabled
#1612
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update minimaxm2.5-fp8-b300-vllm vLLM image to v0.22.0
full-sweep-enabled
#1608
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update gptoss-fp4-h100-vllm vLLM image to v0.22.0
full-sweep-enabled
#1605
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-fp4-b300-vllm vLLM image to v0.22.0
full-sweep-enabled
#1603
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
[Klaud Cold] Update kimik2.5-int4-h100-vllm vLLM image to v0.22.0
full-sweep-enabled
#1601
opened May 30, 2026 by
functionstackx
Collaborator
Loading…
1 task
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-05-03.