Skip to content

Add vision id for Qwen3-VL#4183

Merged
lvhan028 merged 6 commits intoInternLM:mainfrom
CUHKSZzxy:add-vision-id
Dec 8, 2025
Merged

Add vision id for Qwen3-VL#4183
lvhan028 merged 6 commits intoInternLM:mainfrom
CUHKSZzxy:add-vision-id

Conversation

@CUHKSZzxy
Copy link
Copy Markdown
Collaborator

@CUHKSZzxy CUHKSZzxy commented Dec 5, 2025

Motivation

For multiple visual inputs, Qwen3-VL should be able to add index for each input automatically, via its chat template.
The prompt would be formatted like Picture1 xxx, Picture2 xxx, ... Picture n.
As claimed, it may improve multi-image benchmarking.

https://github.com/QwenLM/Qwen3-VL?tab=readme-ov-file#add-ids-for-multiple-visual-inputs

Usage

User can add vision id by adding "add_vision_id": true in the message, as follows:

curl http://0.0.0.0:23333/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "Qwen/Qwen3-VL-8B-Instruct",
    "messages": [
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "Can you describe these images?"
                },
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://raw.githubusercontent.com/open-mmlab/mmdeploy/main/tests/data/tiger.jpeg"
                    }
                },
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VL/assets/demo.jpeg"
                    }
                }
            ]
        }
    ],
    "add_vision_id": true
  }'

@CUHKSZzxy CUHKSZzxy requested a review from lvhan028 December 5, 2025 07:32
@lvhan028 lvhan028 merged commit 322b133 into InternLM:main Dec 8, 2025
5 checks passed
@CUHKSZzxy CUHKSZzxy deleted the add-vision-id branch December 8, 2025 06:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants