Actions: vllm-project/vllm
Actions
2,933 workflow runs
2,933 workflow runs
process_after_weight_loading
for W4A16 MoE Group Act Order
Cleanup PR Body
#2802:
Pull request #11528
edited
by
dsikka
cutlass_scaled_mm
to support 2d group (blockwise) scaling
Cleanup PR Body
#2800:
Pull request #11868
edited
by
LucasWilkinson
process_after_weight_loading
for W4A16 MoE Group Act Order
Cleanup PR Body
#2799:
Pull request #11528
edited
by
dsikka
process_after_weight_loading
for W4A16 MoE Group Act Order
Cleanup PR Body
#2796:
Pull request #11528
edited
by
dsikka
HfExampleModels.find_hf_info
Cleanup PR Body
#2793:
Pull request #12223
opened
by
DarkLight1337
transformers
backend support
Cleanup PR Body
#2790:
Pull request #11330
edited
by
Isotr0py
attention
to impl backend
Cleanup PR Body
#2787:
Pull request #12218
opened
by
wangxiyuan
_get_cache_block_size
Cleanup PR Body
#2785:
Pull request #12214
opened
by
heheda12345