Conversation
This reverts commit c979730.
|
Improved the config check part, tested with internvl / intern-s1 / qwen3vl / qwen3 / qwen2.5vl / glm4.1v, seems good. |
|
May share the evalution test results |
|
LLM evaluation test failed by following #4094 |
Can reproduce the which is the same as the one mentioned in Therefore, I would conclude that this is a bug related to scheduling, rather than the current qwen3vl codes. |
Tested with VLMEvalKit, dataset: OCRBench, temperature: 0.7, max_new_tokens: 16384
Official ACC refers to: |
|
After merging main, benchmark serving by |
|
I have also deployed two additional models: Qwen/Qwen3-8B and OpenGVLab/InternVL3_5-8B. After benchmarking both services, I confirmed that they functioned properly. Therefore, I suspect that this PR may contain potential issues. |
@lvhan028 Tested with Qwen/Qwen3-8B, with the following benchmark settings, appears that the main branch code still triggers the
Therefore, I think does not really resolve the scheduling bug. Additionally, for pure text inputs, the qwen3vl text parts work as a class inherited from qwen3, unlikely to cause scheduling errors. I will insist on the view that this is not a bug caused by the current qwen3vl code. |
cc @grimoire |


Related
TODO