We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于vLLM加速推理GLM-4V
代码如下:
请问如何进行批量推理,一次推理4张图?
单张图片推理速度为 38 tokoens/s 如果进行遍历文件夹推理图片,则推理速度变为:8 tokons/s。 有遇到过这个问题吗? 谢谢!
The text was updated successfully, but these errors were encountered:
这个模型只支持一张图呀
Sorry, something went wrong.
意思是说,不支持同时推理多个图文对话?
还有 单张图片推理速度为 38 tokoens/s 如果进行遍历文件夹推理图片,则推理速度变为:8 tokons/s。 这个可能原因是, 谢谢!
zRzRzRzRzRzRzR
No branches or pull requests
关于vLLM加速推理GLM-4V
代码如下:
请问如何进行批量推理,一次推理4张图?
单张图片推理速度为 38 tokoens/s 如果进行遍历文件夹推理图片,则推理速度变为:8 tokons/s。
有遇到过这个问题吗?
谢谢!
The text was updated successfully, but these errors were encountered: