Skip to content

[Kernel] optimize moe_align_block_size for cuda graph and large num_experts (e.g. DeepSeek-V3) #2791

[Kernel] optimize moe_align_block_size for cuda graph and large num_experts (e.g. DeepSeek-V3)

[Kernel] optimize moe_align_block_size for cuda graph and large num_experts (e.g. DeepSeek-V3) #2791

Triggered via pull request January 20, 2025 12:48
Status Success
Total duration 14s
Artifacts

cleanup_pr_body.yml

on: pull_request_target
update-description
6s
update-description
Fit to window
Zoom out
Zoom in

Annotations

1 warning
update-description
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636