-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
推理速度 #4
Comments
您好,这可能是由于当前推理代码作为开源版本存在一部分mismatch的地方,我已经请 @Uranusxer 检查一下问题,多谢 |
您好,能否进一步提供详细代码和命令行信息?关于推理速度,部署模型本身需要1min左右的时间,单个问题则仅需1-2s,可以部署模型后批量推理;另外对同一个图片/视频做多次问答时,可以对图像/视频embedding在transformer层做kv cache加速,我们未来也计划做进一步更新。 |
-- encoding: utf-8 --import os, sys MASK_INDICES = [0, 1, 2] # Indices of mask features in original list def cal_score(args,image_path,prompt,model,text_processor_infer,image_processor): def main():
if name == "main": 使用上面的代码遇到同样的问题 |
以下是代码:
以下是命令行信息
|
你好,推理的时候输入768x1024的图片,大概要一分半钟出结果
另外会出现重复回答的问题
请问如何解决?
The text was updated successfully, but these errors were encountered: