SemiAnalysisAI / InferenceX Public

Notifications You must be signed in to change notification settings
Fork 181
Star 1k

Code
Issues 106
Pull requests 82
Discussions
Actions
Projects
Models
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Models
Security and quality
Insights

Pull requests: SemiAnalysisAI/InferenceX

Labels 40 Milestones 6

New pull request New

82 Open 1,210 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Throwaway: conc-64 gsm8k eval for DEP8+MTP3 dispatch token bug non-canary-full-sweep-enabled

Run the full sweep without the canary gate (full search space, no trim)

#1659 opened Jun 3, 2026 by Oseltamivir Collaborator

Loading…

[NV] Add Kimi K2.5 FP4 B200/B300 EP sweep full-sweep-enabled

#1658 opened Jun 3, 2026 by jasonlizhengjian Collaborator

Loading…

AMD - gpt-oss vllm mxfp4: AITER tuning + n-gram spec decode + server … AMD

#1657 opened Jun 3, 2026 by nehaprakriya

Loading…

[WIP] Update Dsv4 B300 configs full-sweep-enabled

#1656 opened Jun 3, 2026 by wzhao18 Collaborator

Loading…

[WIP] Update B200 Dsv4 configs full-sweep-enabled

#1655 opened Jun 3, 2026 by wzhao18 Collaborator

Loading…

[DNM][AMD] agentx-v0.4

#1654 opened Jun 3, 2026 by seungrokj Collaborator

Loading…

[NV] Add GitHub Action to collect SPEED-Bench AL matrix

#1650 opened Jun 2, 2026 by qiching • Draft

3 tasks done

fix(power): classify zero-decode-GPU multinode runs as aggregated

#1646 opened Jun 2, 2026 by arygupt Collaborator

Loading…

dsr1-fp4-mi355x-sglang: bump ROCm 7.0->7.2 image + add TP4 search-space AMD

#1645 opened Jun 2, 2026 by JohnQinAMD Collaborator

Loading…

[WIP] agentX v0.4

#1640 opened Jun 2, 2026 by cquil11 Collaborator • Draft

[AMD][MI355X] update model for gpt-oss

#1638 opened Jun 2, 2026 by ukannika

Loading…

Use official TRT-LLM image (1.3.0rc15.post1) for DSv4 B300 TRT (non-MTP + MTP) full-sweep-enabled

#1636 opened Jun 1, 2026 by Oseltamivir Collaborator

Loading…

feat(power): vendor-agnostic GPU power/telemetry aggregation core

#1635 opened Jun 1, 2026 by arygupt Collaborator

Loading…

2 of 3 tasks

Enable Rust frontend (VLLM_USE_RUST_FRONTEND=1) full-sweep-enabled

#1634 opened Jun 1, 2026 by chunfangamd Collaborator

Loading…

Update new fixed-AR-MTP CI workflow for kimik2.5_int4, kimik2.5_fp4, …

#1633 opened Jun 1, 2026 by haic0 Collaborator • Draft

[Klaud Cold] Update gptoss-fp4-mi300x-vllm vLLM ROCm image to v0.22.0 full-sweep-enabled

#1621 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update minimaxm2.5-fp8-mi300x-vllm vLLM ROCm image to v0.22.0 full-sweep-enabled

#1618 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update kimik2.5-int4-mi300x-vllm vLLM ROCm image to v0.22.0 full-sweep-enabled

#1615 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update kimik2.5-int4-mi325x-vllm vLLM ROCm image to v0.22.0 full-sweep-enabled

#1614 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update kimik2.5-int4-mi355x-vllm vLLM ROCm image to v0.22.0 full-sweep-enabled

#1613 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update minimaxm2.5-fp4-b300-vllm vLLM image to v0.22.0 full-sweep-enabled

#1612 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update minimaxm2.5-fp8-b300-vllm vLLM image to v0.22.0 full-sweep-enabled

#1608 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update gptoss-fp4-h100-vllm vLLM image to v0.22.0 full-sweep-enabled

#1605 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update kimik2.5-fp4-b300-vllm vLLM image to v0.22.0 full-sweep-enabled

#1603 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

[Klaud Cold] Update kimik2.5-int4-h100-vllm vLLM image to v0.22.0 full-sweep-enabled

#1601 opened May 30, 2026 by functionstackx Collaborator

Loading…

1 task

Previous 1 2 3 4 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2026-05-03.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!