forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 29
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add TritonScaledMMLinearKernel to fix broken support for int8 models
#377
opened Jan 21, 2025 by
rasmith
Loading…
[Cleanup] Remove obsolete patches and references and test CI
#354
opened Jan 9, 2025 by
hongxiayang
Loading…
[FEAT] Improved PagedAttention FP8 (faster kvcache dequant v1)
#346
opened Dec 24, 2024 by
tjtanaa
Loading…
[Kernel] Upload a MoE config file for Mixtral8x7B 8GPU on AMD_Instinct_MI300X_OAM machine (fp16)
#261
opened Nov 4, 2024 by
Jacob0226
Loading…
multi-gpu fused_moe tuning support
stale
#143
opened Aug 16, 2024 by
divakar-amd
Loading…
1 task done
ProTip!
no:milestone will show everything without a milestone.