Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,333 workflow runs
1,333 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add Nemotron to PP_SUPPORTED_MODELS
Add label on auto-merge enabled #108: Pull request #6863 auto_merge_enabled by mgoin
July 27, 2024 21:52 13s
July 27, 2024 21:52 13s
[Kernel] Remove scaled_fp8_quant kernel padding footgun
Add label on auto-merge enabled #107: Pull request #6842 auto_merge_enabled by DarkLight1337
July 27, 2024 10:27 10s
July 27, 2024 10:27 10s
[Model] Initial support for BLIP-2
Add label on auto-merge enabled #106: Pull request #5920 auto_merge_enabled by DarkLight1337
July 27, 2024 10:00 11s
July 27, 2024 10:00 11s
[bugfix] make args.stream work
Add label on auto-merge enabled #105: Pull request #6831 auto_merge_enabled by DarkLight1337
July 27, 2024 09:04 1m 42s
July 27, 2024 09:04 1m 42s
[CI/Build][Doc] Update CI and Doc for VLM example changes
Add label on auto-merge enabled #104: Pull request #6860 auto_merge_enabled by ywang96
July 27, 2024 07:22 10s
July 27, 2024 07:22 10s
[ROCm] Upgrade PyTorch nightly version
Add label on auto-merge enabled #103: Pull request #6845 auto_merge_enabled by WoosukKwon
July 27, 2024 03:15 11s
July 27, 2024 03:15 11s
[Bug Fix] Illegal memory access, FP8 Llama 3.1 405b
Add label on auto-merge enabled #102: Pull request #6852 auto_merge_enabled by comaniac
July 27, 2024 02:18 9s
July 27, 2024 02:18 9s
Update README.md
Add label on auto-merge enabled #101: Pull request #6847 auto_merge_enabled by zhuohan123
July 27, 2024 00:25 12s
July 27, 2024 00:25 12s
[TPU] Support collective communications in XLA devices
Add label on auto-merge enabled #100: Pull request #6813 auto_merge_enabled by WoosukKwon
July 26, 2024 23:01 14s
July 26, 2024 23:01 14s
enforce eager mode with bnb quantization temporarily
Add label on auto-merge enabled #99: Pull request #6846 auto_merge_enabled by mgoin
July 26, 2024 21:57 10s
July 26, 2024 21:57 10s
[Bugfix][Kernel] Promote another index to int64_t
Add label on auto-merge enabled #98: Pull request #6838 auto_merge_enabled by WoosukKwon
July 26, 2024 17:43 13s
July 26, 2024 17:43 13s
[Misc][TPU] Support TPU in initialize_ray_cluster
Add label on auto-merge enabled #97: Pull request #6812 auto_merge_enabled by WoosukKwon
July 26, 2024 17:28 11s
July 26, 2024 17:28 11s
[Doc] Add missing mock import to docs conf.py
Add label on auto-merge enabled #96: Pull request #6834 auto_merge_enabled by DarkLight1337
July 26, 2024 15:57 11s
July 26, 2024 15:57 11s
[Doc] Add missing mock import to docs conf.py
Add label on auto-merge enabled #95: Pull request #6834 auto_merge_enabled by DarkLight1337
July 26, 2024 13:29 16s
July 26, 2024 13:29 16s
[doc][debugging] add known issues for hangs
Add label on auto-merge enabled #94: Pull request #6816 auto_merge_enabled by DarkLight1337
July 26, 2024 04:45 9s
July 26, 2024 04:45 9s
[Frontend] New allowed_token_ids decoding request parameter
Add label on auto-merge enabled #93: Pull request #6753 auto_merge_enabled by DarkLight1337
July 26, 2024 02:26 11s
July 26, 2024 02:26 11s
[Docs] Publish 5th meetup slides
Add label on auto-merge enabled #92: Pull request #6799 auto_merge_enabled by zhuohan123
July 25, 2024 23:46 12s
July 25, 2024 23:46 12s
[Bugfix] Fix empty (nullptr) channelwise scales when loading wNa16 using compressed tensors
Add label on auto-merge enabled #91: Pull request #6798 auto_merge_enabled by mgoin
July 25, 2024 21:28 13s
July 25, 2024 21:28 13s
Fix ReplicatedLinear weight loading
Add label on auto-merge enabled #90: Pull request #6793 auto_merge_enabled by comaniac
July 25, 2024 20:08 11s
July 25, 2024 20:08 11s
[Core] Fix ray forward_dag error mssg
Add label on auto-merge enabled #89: Pull request #6792 auto_merge_enabled by simon-mo
July 25, 2024 17:04 12s
July 25, 2024 17:04 12s
[Bugfix] Add image placeholder for OpenAI Compatible Server of MiniCPM-V
Add label on auto-merge enabled #88: Pull request #6787 auto_merge_enabled by DarkLight1337
July 25, 2024 16:30 17s
July 25, 2024 16:30 17s
[Bugfix] Fix kv_cache_dtype=fp8 without scales for FP8 checkpoints
Add label on auto-merge enabled #87: Pull request #6761 auto_merge_enabled by mgoin
July 25, 2024 14:51 12s
July 25, 2024 14:51 12s
[Bugfix] Fix encoding_format in examples/openai_embedding_client.py
Add label on auto-merge enabled #86: Pull request #6755 auto_merge_enabled by DarkLight1337
July 25, 2024 02:21 11s
July 25, 2024 02:21 11s
[ Misc ] fp8-marlin channelwise via compressed-tensors
Add label on auto-merge enabled #85: Pull request #6524 auto_merge_enabled by mgoin
July 25, 2024 00:45 11s
July 25, 2024 00:45 11s
[Bugfix] Fix decode tokens w. CUDA graph
Add label on auto-merge enabled #84: Pull request #6757 auto_merge_enabled by comaniac
July 24, 2024 20:40 13s
July 24, 2024 20:40 13s
ProTip! You can narrow down the results and go further in time using created:<2024-07-24 or the other filters available.