-
Notifications
You must be signed in to change notification settings - Fork 95
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Kimi-K2.5 INT4 vLLM v0.16.0 benchmark for MI300X
AMD
sweep-enabled
#860
opened Mar 3, 2026 by
functionstackx
Loading…
Add Kimi K2.5 INT4 single-node MI325X vLLM benchmark (TP8)
AMD
sweep-enabled
#857
opened Mar 3, 2026 by
functionstackx
Loading…
[Do Not Merge] [WIP till AMD releases MXFP4 of MiniMax M2.5] Add MiniMax M2.1 MXFP4 benchmark for MI355x vLLM (TP=2,4)
AMD
#827
opened Mar 1, 2026 by
functionstackx
Loading…
[NV] Qwen3.5 B200 SGLang FP4 configs
NVIDIA
sweep-enabled
#820
opened Feb 27, 2026 by
kedarpotdar-nv
Loading…
[NVIDIA] Update NVIDIA single-node DSR1 SGLang images from v0.5.6-v0.5.8 to v0.5.9
image update
NVIDIA
sweep-enabled
#814
opened Feb 26, 2026 by
cquil11
Loading…
Performance Improvements for MI300X with GEMM and FP8 Enhancements
#811
opened Feb 26, 2026 by
chunfangamd
Loading…
[NVIDIA] Update NVIDIA GPT-OSS vLLM image from v0.15.1 to v0.16.0
NVIDIA
#800
opened Feb 26, 2026 by
cquil11
Loading…
feat: add GLM-5 FP8 SGLang benchmark for MI355X
AMD
sweep-enabled
#762
opened Feb 19, 2026 by
functionstackx
Loading…
Add MiniMax-M2.5 FP8 vLLM benchmark for B200
NVIDIA
#757
opened Feb 19, 2026 by
functionstackx
Loading…
Add auto perf-changelog generation on PR merge
#656
opened Feb 6, 2026 by
Klaud-Cold
Loading…
4 tasks
[WIP] [NV] Updates SGLang DSR1-FP4 GB300 1k8k configurations (STP only)
NVIDIA
#637
opened Feb 5, 2026 by
yunzhoul-nv
Loading…
[WIP] [NV] Updates SGLang DSR1-FP4 GB200 1k8k (STP only)
NVIDIA
#634
opened Feb 5, 2026 by
yunzhoul-nv
•
Draft
[NV] update DSR1 SGLang MTP configs on single node B200
NVIDIA
sweep-enabled
#631
opened Feb 4, 2026 by
zbpatel
Loading…
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.