[V1][Bugfix] Fix data item ordering in mixed-modality inference #12259

ywang96 · 2025-01-21T10:07:27Z

Due to v1 re-arch and design of get_input_embeddings being not modality-aware, the order of embeddings in encoder_outputs needs to match the order of multimodal inputs (MultiModalKwargs) in the current batch.

However, batching MultiModalKwargs when there are multiple modalities in the same batch will not preserve their original order, and this PR adds a temporary workaround for this issue.

Signed-off-by: Roger Wang <[email protected]>

github-actions · 2025-01-21T10:07:40Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

vllm/multimodal/utils.py

vllm/v1/worker/gpu_model_runner.py

Signed-off-by: Roger Wang <[email protected]>

vllm/multimodal/utils.py

Signed-off-by: Roger Wang <[email protected]>

DarkLight1337

LGTM if you don't plan to change https://github.com/vllm-project/vllm/pull/12259/files#r1923428521

Signed-off-by: Roger Wang <[email protected]>

fix

7f2f795

Signed-off-by: Roger Wang <[email protected]>

ywang96 requested review from WoosukKwon, robertgshaw2-redhat, njhill, comaniac, alexm-redhat and DarkLight1337 as code owners January 21, 2025 10:07

DarkLight1337 reviewed Jan 21, 2025

View reviewed changes

vllm/multimodal/utils.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Jan 21, 2025

View reviewed changes

vllm/v1/worker/gpu_model_runner.py Outdated Show resolved Hide resolved

use groupby

327cc20

Signed-off-by: Roger Wang <[email protected]>

DarkLight1337 reviewed Jan 21, 2025

View reviewed changes

vllm/multimodal/utils.py Outdated Show resolved Hide resolved

simplify

d7f7990

Signed-off-by: Roger Wang <[email protected]>

DarkLight1337 approved these changes Jan 21, 2025

View reviewed changes

simplify

e523caa

Signed-off-by: Roger Wang <[email protected]>

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 21, 2025

ywang96 enabled auto-merge (squash) January 21, 2025 11:24

DarkLight1337 approved these changes Jan 21, 2025

View reviewed changes

ywang96 merged commit b197a5c into main Jan 21, 2025
56 of 59 checks passed

ywang96 deleted the fix-mixed-modality branch January 21, 2025 13:18

ywang96 mentioned this pull request Jan 22, 2025

[Bugfix][VLM] Fix mixed-modality inference backward compatibility for V0 #12313

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V1][Bugfix] Fix data item ordering in mixed-modality inference #12259

[V1][Bugfix] Fix data item ordering in mixed-modality inference #12259

ywang96 commented Jan 21, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Jan 21, 2025

DarkLight1337 left a comment

[V1][Bugfix] Fix data item ordering in mixed-modality inference #12259

[V1][Bugfix] Fix data item ordering in mixed-modality inference #12259

Conversation

ywang96 commented Jan 21, 2025 • edited by github-actions bot Loading

github-actions bot commented Jan 21, 2025

DarkLight1337 left a comment

Choose a reason for hiding this comment

ywang96 commented Jan 21, 2025 •

edited by github-actions bot

Loading