[New Model] Support Command-A-Vision #22660

dongluw · 2025-08-11T16:55:31Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

Support Command-A-Vision https://huggingface.co/CohereLabs/command-a-vision-07-2025/blob/main/config.json

Test Plan

python examples/offline_inference/vision_language_multi_image.py --model-type command_a_vision

--------------------------------------------------
The first image shows a **Mallard duck** swimming in calm blue water. The duck has a vibrant green head, a yellow bill, and a body with brown, white, and black feathers. Its reflection is visible in the water, and the surface has gentle ripples.

The second image features a **lion** sitting in a field of tall, golden-brown grass. The lion has a thick, dark mane and a focused expression, looking slightly to the side. The background is softly blurred, emphasizing the lion's majestic presence in its natural habitat.
--------------------------------------------------

python examples/offline_inference/vision_language.py --model-type command_a_vision

--------------------------------------------------
The image captures a stunning view of the Tokyo Tower, a prominent landmark in Tokyo, Japan, framed by the delicate pink blossoms of cherry trees. The tower, with its white and orange structure, stands tall against a clear blue sky, creating a striking contrast. The cherry blossoms, in full bloom, dominate the foreground
--------------------------------------------------
The image captures a stunning view of the Tokyo Tower, a prominent landmark in Tokyo, Japan, framed by cherry blossoms in full bloom. The tower, with its distinctive white and orange color scheme, stands tall against a clear blue sky. The cherry blossom trees, or sakura, are in the foreground, their delicate
--------------------------------------------------
The image captures a stunning view of the Tokyo Tower, a prominent landmark in Tokyo, Japan, framed by the delicate blossoms of cherry trees in full bloom. The cherry blossoms, known as *sakura* in Japanese, are a quintessential symbol of spring and are celebrated for their fleeting beauty. The contrast between the soft
--------------------------------------------------
The image captures a stunning view of the Tokyo Tower, a prominent landmark in Tokyo, Japan, framed by the delicate blossoms of cherry trees. The tower, with its distinctive white and orange color scheme, stands tall against a clear blue sky. The cherry blossoms, in full bloom, are a vibrant pink and dominate the
--------------------------------------------------

online chat

vllm serve CohereLabs/command-a-vision-07-2025 --disable-log-requests  --tensor-parallel-size 4 --max_model_len 32000 --max-num-seqs 32

python examples/online_serving/openai_chat_completion_client_for_multimodal.py --chat-type multi-image


Chat completion output: The images feature two animals: a mallard duck and an African lion. 

1. **Mallard Duck**: The first image shows a mallard duck swimming in water. Mallards are one of the most recognizable and widespread duck species, known for their iridescent green heads (in males), white collars, and

Test Result

(Optional) Documentation Update

github-actions · 2025-08-11T16:55:43Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request introduces support for the Command-A-Vision model. The changes include a new model implementation file, along with additions to example scripts and model registries. The core logic seems sound and follows existing patterns in the codebase. However, I've identified a critical bug in the new model implementation related to an incorrect function call that will cause a runtime error. Additionally, there's a high-severity issue where a value is hardcoded, which compromises the code's generality and maintainability. Addressing these points will improve the robustness and quality of the new model support.

vllm/model_executor/models/cohere2_vision.py

examples/offline_inference/vision_language.py

vllm/model_executor/models/cohere2_vision.py

Signed-off-by: donglu <[email protected]>

DarkLight1337

Thanks, LGTM!

cc @hmellor can you measure the perf difference compared to Transformers backend? In the future we may be able to switch fully to Transformers backend if the performance becomes on par with the custom implementation.

hmellor · 2025-08-11T18:00:10Z

can you measure the perf difference compared to Transformers backend?

Sure, testing now

hmellor · 2025-08-11T20:40:35Z

Looks like running Command A Vision in the Transformers backend will require #22673 and a change on the Transformers side

Signed-off-by: donglu <[email protected]> Signed-off-by: Paul Pak <[email protected]>

Signed-off-by: donglu <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

Signed-off-by: donglu <[email protected]>

Signed-off-by: donglu <[email protected]> Signed-off-by: Xiao Yu <[email protected]>

Signed-off-by: donglu <[email protected]>

dongluw requested review from DarkLight1337 and ywang96 as code owners August 11, 2025 16:55

mergify bot added documentation Improvements or additions to documentation new-model Requests to new models labels Aug 11, 2025

gemini-code-assist bot reviewed Aug 11, 2025

View reviewed changes

vllm/model_executor/models/cohere2_vision.py Outdated Show resolved Hide resolved

vllm/model_executor/models/cohere2_vision.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Aug 11, 2025

View reviewed changes

examples/offline_inference/vision_language.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Aug 11, 2025

View reviewed changes

vllm/model_executor/models/cohere2_vision.py Outdated Show resolved Hide resolved

dongluw marked this pull request as draft August 11, 2025 17:10

command a vision

f02103d

Signed-off-by: donglu <[email protected]>

dongluw force-pushed the command_a_vision branch from 4ef9e2a to f02103d Compare August 11, 2025 17:52

dongluw marked this pull request as ready for review August 11, 2025 17:53

dongluw requested a review from hmellor as a code owner August 11, 2025 17:53

DarkLight1337 approved these changes Aug 11, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) August 11, 2025 17:55

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 11, 2025

vllm-bot merged commit 9f909b8 into vllm-project:main Aug 12, 2025
39 of 47 checks passed

paulpak58 pushed a commit to paulpak58/vllm that referenced this pull request Aug 13, 2025

[New Model] Support Command-A-Vision (vllm-project#22660)

10a2401

Signed-off-by: donglu <[email protected]> Signed-off-by: Paul Pak <[email protected]>

diegocastanibm pushed a commit to diegocastanibm/vllm that referenced this pull request Aug 15, 2025

[New Model] Support Command-A-Vision (vllm-project#22660)

df9ba08

Signed-off-by: donglu <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Aug 19, 2025

[New Model] Support Command-A-Vision (vllm-project#22660)

1691ea2

Signed-off-by: donglu <[email protected]>

simon-mo mentioned this pull request Aug 19, 2025

[Doc]: Missing Mamba1 and Jamba V1 support in Release v0.10.1 Highlights #23170

Closed

1 task

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

[New Model] Support Command-A-Vision (vllm-project#22660)

66e7532

Signed-off-by: donglu <[email protected]>

xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025

[New Model] Support Command-A-Vision (vllm-project#22660)

e3b8b5b

Signed-off-by: donglu <[email protected]> Signed-off-by: Xiao Yu <[email protected]>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

[New Model] Support Command-A-Vision (vllm-project#22660)

fa3b8e3

Signed-off-by: donglu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[New Model] Support Command-A-Vision #22660

[New Model] Support Command-A-Vision #22660

Uh oh!

dongluw commented Aug 11, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 left a comment

Uh oh!

hmellor commented Aug 11, 2025

Uh oh!

hmellor commented Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[New Model] Support Command-A-Vision #22660

[New Model] Support Command-A-Vision #22660

Uh oh!

Conversation

dongluw commented Aug 11, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

github-actions bot commented Aug 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

hmellor commented Aug 11, 2025

Uh oh!

hmellor commented Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

dongluw commented Aug 11, 2025 •

edited by github-actions bot

Loading