Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
4,355 workflow runs
4,355 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Scenario tests
Scenario tests #236: Scheduled
January 15, 2025 15:34 9m 19s main
January 15, 2025 15:34 9m 19s
Misc cleanup for HELM Capabilities (#3274)
Test #7881: Commit 6d70e98 pushed by yifanmai
January 15, 2025 06:16 10m 14s main
January 15, 2025 06:16 10m 14s
Misc cleanup for HELM Capabilities
Test #7880: Pull request #3274 synchronize by yifanmai
January 15, 2025 06:06 9m 24s yifanmai/fix-capabilities-cleanup
January 15, 2025 06:06 9m 24s
Add general info metrics to Capabilities run specs (#3273)
Test #7878: Commit 716e523 pushed by yifanmai
January 15, 2025 01:03 10m 49s main
January 15, 2025 01:03 10m 49s
Allow running on all subjects for MMLU-Pro (#3272)
Test #7876: Commit b069c5a pushed by yifanmai
January 14, 2025 23:42 10m 12s main
January 14, 2025 23:42 10m 12s
Return more information in Omni-MATH annotations (#3271)
Test #7875: Commit 61a9bc0 pushed by yifanmai
January 14, 2025 23:29 10m 26s main
January 14, 2025 23:29 10m 26s
Allow running on all subjects for MMLU-Pro
Test #7874: Pull request #3272 opened by yifanmai
January 14, 2025 23:28 10m 13s yifanmai/fix-mmlu-pro-all
January 14, 2025 23:28 10m 13s
Move run specs for HELM capabilities to its module (#3270)
Test #7873: Commit 6989b81 pushed by yifanmai
January 14, 2025 23:15 9m 43s main
January 14, 2025 23:15 9m 43s
Scenario tests
Scenario tests #235: Scheduled
January 14, 2025 15:34 9m 28s main
January 14, 2025 15:34 9m 28s
Update run entries for Unitxt tables benchmark (#3268)
Test #7870: Commit e34b3eb pushed by yifanmai
January 14, 2025 04:37 9m 47s main
January 14, 2025 04:37 9m 47s
Allow running recipes from the Unitxt catalog (#3267)
Test #7869: Commit 1bc7bd0 pushed by yifanmai
January 14, 2025 04:36 10m 20s main
January 14, 2025 04:36 10m 20s
increase the max token number to 2000 (#3266)
Test #7866: Commit 5f5c17e pushed by yifanmai
January 13, 2025 21:09 9m 51s main
January 13, 2025 21:09 9m 51s
Add support for Granite 3.1 model family (IBM) (#3261)
Test #7865: Commit b9ad574 pushed by yifanmai
January 13, 2025 21:09 10m 19s main
January 13, 2025 21:09 10m 19s
Scenario tests
Scenario tests #234: Scheduled
January 13, 2025 15:35 8m 38s main
January 13, 2025 15:35 8m 38s
Scenario tests
Scenario tests #233: Scheduled
January 12, 2025 15:34 8m 28s main
January 12, 2025 15:34 8m 28s
Scenario tests
Scenario tests #232: Scheduled
January 11, 2025 15:34 11m 26s main
January 11, 2025 15:34 11m 26s
Release MMLU v1.13.0 (#3265)
Build Frontend #162: Commit 12ab30b pushed by yifanmai
January 10, 2025 18:52 51s main
January 10, 2025 18:52 51s
Release MMLU v1.13.0 (#3265)
Frontend #701: Commit 12ab30b pushed by yifanmai
January 10, 2025 18:52 1m 1s main
January 10, 2025 18:52 1m 1s