[Kernel] optimize moe_align_block_size for cuda graph and large num_experts (e.g. DeepSeek-V3) #2791
Triggered via pull request
January 20, 2025 12:48
jinzhen-lin
edited
#12222
Status
Success
Total duration
14s
Artifacts
–
cleanup_pr_body.yml
on: pull_request_target
update-description
6s
Annotations
1 warning
update-description
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|