[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm#6552
Merged
robertgshaw2-redhat merged 6 commits intovllm-project:mainfrom neuralmagic:tms/fp8_scaled_mm_channelwiseJul 18, 2024
+40-21
Commits
Commits on Jul 18, 2024
- committed
- committed
- committed
- committed
- committed
- committed