Skip to content

Actions: EricLBuehler/mistral.rs

docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,186 workflow runs
1,186 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix GGUF auto device mapping block count (#1047)
docs #1186: Commit 8347e21 pushed by EricLBuehler
January 10, 2025 12:25 7m 27s master
January 10, 2025 12:25 7m 27s
Support loading models without ISQ using device map (#1045)
docs #1185: Commit dba4c76 pushed by EricLBuehler
January 10, 2025 04:37 7m 30s master
January 10, 2025 04:37 7m 30s
Fix cuda build
docs #1184: Commit 5e1a615 pushed by EricLBuehler
January 10, 2025 01:34 7m 45s master
January 10, 2025 01:34 7m 45s
Support automatic device mapping for gguf models (#1044)
docs #1183: Commit b1db3ac pushed by EricLBuehler
January 10, 2025 01:18 7m 52s master
January 10, 2025 01:18 7m 52s
Automatic device mapping support (#1042)
docs #1182: Commit 729b81b pushed by EricLBuehler
January 9, 2025 23:34 7m 56s master
January 9, 2025 23:34 7m 56s
Remove debug
docs #1181: Commit 524a8d9 pushed by EricLBuehler
January 8, 2025 23:16 7m 28s master
January 8, 2025 23:16 7m 28s
Faster F16 support for mllama (#1041)
docs #1180: Commit 9b6c592 pushed by EricLBuehler
January 8, 2025 20:36 7m 25s master
January 8, 2025 20:36 7m 25s
CUDA dequant kernels conditional compilation (#1039)
docs #1179: Commit 017b109 pushed by EricLBuehler
January 8, 2025 17:33 7m 23s master
January 8, 2025 17:33 7m 23s
Rename MemoryGpuConfig::Amount->MbAmount (#1038)
docs #1178: Commit 894bd9b pushed by EricLBuehler
January 8, 2025 00:50 7m 28s master
January 8, 2025 00:50 7m 28s
Remove cudarc transient dep (#1037)
docs #1177: Commit 8c255b8 pushed by EricLBuehler
January 7, 2025 23:57 7m 26s master
January 7, 2025 23:57 7m 26s
Allocate paged attn cache as empty instead of zeros (#1036)
docs #1176: Commit c26e6c8 pushed by EricLBuehler
January 7, 2025 22:08 9m 54s master
January 7, 2025 22:08 9m 54s
Patch prefix caching to fix incorrect outputs (#1035)
docs #1175: Commit 3df9622 pushed by EricLBuehler
January 7, 2025 21:50 8m 51s master
January 7, 2025 21:50 8m 51s
Use float8 mistralrs_cudarc_fork feature (#1034)
docs #1174: Commit d2692f9 pushed by EricLBuehler
January 7, 2025 20:53 22m 47s master
January 7, 2025 20:53 22m 47s
Fix metal paged attn phi3 (#1033)
docs #1173: Commit a853cdd pushed by EricLBuehler
January 7, 2025 20:18 7m 24s master
January 7, 2025 20:18 7m 24s
Use cudarc fork to fix CUDA build on Windows (#1032)
docs #1172: Commit 80beed4 pushed by EricLBuehler
January 7, 2025 19:52 10m 17s master
January 7, 2025 19:52 10m 17s
Implement DeepSeekV2 (#1010)
docs #1171: Commit a562fd0 pushed by EricLBuehler
January 5, 2025 00:35 7m 23s master
January 5, 2025 00:35 7m 23s
Remove debug
docs #1170: Commit a06f937 pushed by EricLBuehler
January 2, 2025 16:11 7m 36s master
January 2, 2025 16:11 7m 36s
Update license for 2025 (#1024)
docs #1169: Commit e06f0af pushed by EricLBuehler
January 2, 2025 15:14 7m 33s master
January 2, 2025 15:14 7m 33s
Support uqff load/save for idefics3 (#1023)
docs #1168: Commit 27ab495 pushed by EricLBuehler
January 2, 2025 14:10 7m 36s master
January 2, 2025 14:10 7m 36s
Cleaner pipeline no prefix cache setting (#1022)
docs #1167: Commit 0875194 pushed by EricLBuehler
January 2, 2025 12:16 7m 23s master
January 2, 2025 12:16 7m 23s
Bump version to v0.3.5 (#1021)
docs #1166: Commit f1c3a36 pushed by EricLBuehler
January 2, 2025 03:14 7m 29s master
January 2, 2025 03:14 7m 29s
Support uqff for idefics3 (#1020)
docs #1165: Commit 8ee15d4 pushed by EricLBuehler
January 2, 2025 02:25 7m 39s master
January 2, 2025 02:25 7m 39s
More fixes for the prefix cacher (#1019)
docs #1164: Commit bd7af28 pushed by EricLBuehler
January 2, 2025 02:14 7m 30s master
January 2, 2025 02:14 7m 30s
Prefix cacher fixes (#1018)
docs #1163: Commit d116d4d pushed by EricLBuehler
January 1, 2025 19:16 7m 37s master
January 1, 2025 19:16 7m 37s
Support device mapping for Paged Attention (#1011)
docs #1162: Commit c345954 pushed by EricLBuehler
January 1, 2025 11:05 7m 21s master
January 1, 2025 11:05 7m 21s