Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,333 workflow runs
1,333 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Bugfix] Fix vocab_size field access in LLaVA models
Add label on auto-merge enabled #58: Pull request #6624 auto_merge_enabled by DarkLight1337
July 22, 2024 03:40 14s
July 22, 2024 03:40 14s
[ CI ] Awq Marlin Integration Tests
Add label on auto-merge enabled #57: Pull request #6627 auto_merge_enabled by robertgshaw2-redhat
July 22, 2024 01:01 10s
July 22, 2024 01:01 10s
[ Kernel ] Enable fp8-marlin for fbgemm-fp8 models
Add label on auto-merge enabled #56: Pull request #6606 auto_merge_enabled by mgoin
July 20, 2024 18:32 11s
July 20, 2024 18:32 11s
[Misc] Consolidate and optimize logic for building padded tensors
Add label on auto-merge enabled #55: Pull request #6541 auto_merge_enabled by DarkLight1337
July 20, 2024 03:37 11s
July 20, 2024 03:37 11s
[ Misc ] fbgemm checkpoints
Add label on auto-merge enabled #54: Pull request #6559 auto_merge_enabled by mgoin
July 20, 2024 01:37 10s
July 20, 2024 01:37 10s
[ Kernel ] FP8 Dynamic Per Token Quant - Add scale_ub
Add label on auto-merge enabled #53: Pull request #6593 auto_merge_enabled by mgoin
July 20, 2024 00:35 12s
July 20, 2024 00:35 12s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #52: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 23:34 14s
July 19, 2024 23:34 14s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #51: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 22:27 10s
July 19, 2024 22:27 10s
[Bugfix] [SpecDecode] AsyncMetricsCollector: update time since last collection
Add label on auto-merge enabled #50: Pull request #6578 auto_merge_enabled by cadedaniel
July 19, 2024 20:53 11s
July 19, 2024 20:53 11s
[ Kernel ] Enable Dynamic Per Token fp8
Add label on auto-merge enabled #49: Pull request #6547 auto_merge_enabled by robertgshaw2-redhat
July 19, 2024 18:34 13s
July 19, 2024 18:34 13s
[Misc] Fix input_scale typing in w8a8_utils.py
Add label on auto-merge enabled #48: Pull request #6579 auto_merge_enabled by mgoin
July 19, 2024 14:31 11s
July 19, 2024 14:31 11s
[Bugfix][Frontend] remove duplicate init logger
Add label on auto-merge enabled #47: Pull request #6581 auto_merge_enabled by DarkLight1337
July 19, 2024 14:19 13s
July 19, 2024 14:19 13s
[BUGFIX] Raise an error for no draft token case when draft_tp>1
Add label on auto-merge enabled #46: Pull request #6369 auto_merge_enabled by cadedaniel
July 19, 2024 08:21 9s
July 19, 2024 08:21 9s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #45: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 05:06 11s
July 19, 2024 05:06 11s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #44: Pull request #6557 auto_merge_enabled by comaniac
July 19, 2024 04:57 12s
July 19, 2024 04:57 12s
[Bugfix][Frontend] Fix missing /metrics endpoint
Add label on auto-merge enabled #43: Pull request #6463 auto_merge_enabled by simon-mo
July 19, 2024 02:00 11s
July 19, 2024 02:00 11s
Add support for a rope extension method
Add label on auto-merge enabled #42: Pull request #6553 auto_merge_enabled by simon-mo
July 19, 2024 01:16 1m 17s
July 19, 2024 01:16 1m 17s
[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm
Add label on auto-merge enabled #41: Pull request #6552 auto_merge_enabled by robertgshaw2-redhat
July 18, 2024 22:00 13s
July 18, 2024 22:00 13s
[ci][test] add correctness test for cpu offloading
Add label on auto-merge enabled #40: Pull request #6549 auto_merge_enabled by youkaichao
July 18, 2024 21:49 12s
July 18, 2024 21:49 12s
[Misc] Small perf improvements
Add label on auto-merge enabled #39: Pull request #6520 auto_merge_enabled by Yard1
July 18, 2024 20:13 15s
July 18, 2024 20:13 15s
[Model] Support Mistral-Nemo
Add label on auto-merge enabled #38: Pull request #6548 auto_merge_enabled by mgoin
July 18, 2024 19:35 15s
July 18, 2024 19:35 15s
[Bugfix] Update flashinfer.py with PagedAttention forwards - Fixes Gemma2 OpenAI Server Crash
Add label on auto-merge enabled #37: Pull request #6501 auto_merge_enabled by comaniac
July 18, 2024 06:10 12s
July 18, 2024 06:10 12s
[Misc] Minor patch for draft model runner
Add label on auto-merge enabled #36: Pull request #6523 auto_merge_enabled by cadedaniel
July 18, 2024 05:39 10s
July 18, 2024 05:39 10s
[BugFix] Avoid secondary error in ShmRingBuffer destructor
Add label on auto-merge enabled #35: Pull request #6530 auto_merge_enabled by youkaichao
July 18, 2024 04:04 15s
July 18, 2024 04:04 15s
[BugFix][Frontend] Use LoRA tokenizer in OpenAI APIs
Add label on auto-merge enabled #34: Pull request #6227 auto_merge_enabled by DarkLight1337
July 18, 2024 00:55 14s
July 18, 2024 00:55 14s
ProTip! You can narrow down the results and go further in time using created:<2024-07-18 or the other filters available.