Skip to content

[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm#6552

Merged
robertgshaw2-redhat merged 6 commits intovllm-project:mainfrom neuralmagic:tms/fp8_scaled_mm_channelwiseJul 18, 2024

Commits

Commits on Jul 18, 2024