是否支持VLM 输出logprobs #3243

hxyghostor · 2025-03-12T06:28:19Z

Can VLLM(Qwen2-vl-2b-4bit) support logprobs output? I always get None

lvhan028 · 2025-03-12T12:51:49Z

Can you share the reproducible code?

hxyghostor · 2025-03-12T12:58:29Z

server

lmdeploy serve api_server /question_classification/qwen2-vl-2b-4bit --server-port $PORT0 --backend turbomind --model-format awq --enable-prefix-caching --quant-policy 8

client

import requests
import base64

api_url = f"http://localhost:10516/v1/chat/completions"



image_path = ""
with open(image_path, "rb") as f:
            encoded_image = base64.b64encode(f.read())
encoded_image_text = encoded_image.decode("utf-8")
base64_qwen = f"data:image;base64,{encoded_image_text}"
data = {
            "model" : "/question_classification/qwen2-vl-2b-4bit",
            "messages":[
                {"role": "system", "content": "You are a helpful assistant."},
                {
                    "role": "user",
                    "content": [
                        {
                            "type": "image_url",
                            "image_url": {
                                "url": base64_qwen
                            },
                        },
                        {"type": "text", 
                         "text": ""},
                    ],
                },
            ],
            "logprobs": True,
            "top_logprobs": 3
        }

chat_response = requests.post(api_url, json=data).json()
print("Chat response:", chat_response)

lvhan028 · 2025-03-12T13:08:44Z

Sorry, the pytorch engine hasn't supported output logprob yet.

hxyghostor · 2025-03-12T13:14:59Z

Is there a time estimate for when tuibomind will support Qwen2 VL?

lvhan028 · 2025-03-12T13:18:42Z

Probably at the end of this month. I am reviewing the #3164. Once it is merged to main branch, @irexyc can submit the implementation of qwen2-vl and qwen2.5-vl

hxyghostor · 2025-03-12T13:22:19Z

OK，thanks for your reply.

lvhan028 assigned CUHKSZzxy Mar 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

是否支持VLM 输出logprobs #3243

是否支持VLM 输出logprobs #3243

hxyghostor commented Mar 12, 2025

lvhan028 commented Mar 12, 2025

hxyghostor commented Mar 12, 2025 •

edited by lvhan028

Loading

lvhan028 commented Mar 12, 2025

hxyghostor commented Mar 12, 2025

lvhan028 commented Mar 12, 2025

hxyghostor commented Mar 12, 2025

是否支持VLM 输出logprobs #3243

是否支持VLM 输出logprobs #3243

Comments

hxyghostor commented Mar 12, 2025

lvhan028 commented Mar 12, 2025

hxyghostor commented Mar 12, 2025 • edited by lvhan028 Loading

lvhan028 commented Mar 12, 2025

hxyghostor commented Mar 12, 2025

lvhan028 commented Mar 12, 2025

hxyghostor commented Mar 12, 2025

hxyghostor commented Mar 12, 2025 •

edited by lvhan028

Loading