MT Photos AI识别相关任务独立部署项目

基于PaddleOCR实现的文本识别(OCR)接口
基于Chinese-CLIP（OpenAI CLIP模型的中文版本）实现的图片、文本提取特征接口

更新日志

V1.2.0

升级 onnx 和 openvino中 rapidocr 版本至1.3.26 ，修复OCR识别过程中内存占用较高的问题
默认端口从8000修改为8060，避免8000端口比较容易遇到冲突的问题

目录说明

openvino：基于rapidocr_openvino库，进行识别任务。适用于Intel Xeon、Core CPU运行
onnx：基于rapidocr_onnxruntime库，进行识别任务。适用于所有CPU运行
cuda：基于paddleocr官方库，进行识别任务。适用于支持CUDA的显卡运行
coreml：CLIP任务使用CoreML加速。适用于M系列处理器MAC电脑运行

在Intel cpu上运行时OpenVINO版本会快很多；

RapidOCR更多配置可参考官方仓库 https://github.com/RapidAI/RapidOCR

镜像说明

DockerHub镜像仓库地址： https://hub.docker.com/r/mtphotos/mt-photos-ai

镜像Tags说明：

latest：基于openvino文件夹打包生成，推荐Intel CPU机型安装这个镜像
onnx:基于onnx文件夹打包生产，推荐AMD CPU机型安装这个镜像

由于cuda版本镜像包含的驱动等相关文件较多，未打包镜像，有需要可以自行打包。

打包docker镜像

cd cuda
docker build  . -t mt-photos-ai:cuda-latest

cuda为文件夹，可根据需要替换为onnx、openvino

运行docker容器

docker run -i -p 8060:8060 -e API_AUTH_KEY=mt_photos_ai_extra_secret --name mt-photos-ai --gpus all --restart="unless-stopped" mt-photos-ai:cuda-latest

cuda-latest可以替换为latest、cpu-latest

下载源码本地运行

安装python 3.10版本
根据硬件环境选择cuda、onnx或openvino文件夹
在选择文件夹下执行pip install -r requirements.txt
复制.env.example生成.env文件，然后修改.env文件内的API_AUTH_KEY
执行 python server.py ，启动服务

paddlepaddle-gpu 安装请根据CUDA版本

https://www.paddlepaddle.org.cn/install/quick?docurl=/documentation/docs/zh/install/pip/windows-pip.html

python安装包地址： https://www.python.org/downloads/release/python-3810/

API_AUTH_KEY为MT Photos填写api_key需要输入的值

./onnx/utils 和 ./openvino/utils 目录下需要手动添加 vit-b-16.img.fp32.onnx、vit-b-16.txt.fp32.onnx 2个模型文件

可以从release中下载附件：https://github.com/MT-Photos/mt-photos-ai/releases/tag/v1.1.0

手动转换ONNX模型方法见：

https://github.com/OFA-Sys/Chinese-CLIP/blob/master/deployment.md#%E8%BD%AC%E6%8D%A2%E5%92%8C%E8%BF%90%E8%A1%8Connx%E6%A8%A1%E5%9E%8B

看到以下日志，则说明服务已经启动成功

INFO:     Started server process [3024]
INFO:     Waiting for application startup.
INFO:     Application startup complete.
INFO:     Uvicorn running on http://0.0.0.0:8060 (Press CTRL+C to quit)

API

/check

检测服务是否可用，及api-key是否正确

curl --location --request POST 'http://127.0.0.1:8060/check' \
--header 'api-key: api_key'

response:

{
  "result": "pass"
}

/ocr

curl --location --request POST 'http://127.0.0.1:8060/ocr' \
--header 'api-key: api_key' \
--form 'file=@"/path_to_file/test.jpg"'

response:

result.texts : 识别到的文本列表
result.scores : 为识别到的文本对应的置信度分数，1为100%
result.boxes : 识别到的文本位置，x,y为左上角坐标，width,height为框的宽高

{
  "result": {
    "texts": [
      "识别到的文本1",
      "识别到的文本2"
    ],
    "scores": [
      "0.98",
      "0.97"
    ],
    "boxes": [
      {
        "x": "4.0",
        "y": "7.0",
        "width": "283.0",
        "height": "21.0"
      },
      {
        "x": "7.0",
        "y": "34.0",
        "width": "157.0",
        "height": "23.0"
      }
    ]
  }
}

/clip/img

curl --location --request POST 'http://127.0.0.1:8060/clip/img' \
--header 'api-key: api_key' \
--form 'file=@"/path_to_file/test.jpg"'

response:

results : 图片的特征向量

{
  "results": [
    "0.3305919170379639",
    "-0.4954293668270111",
    "0.0217289477586746",
    ...
  ]
}

/clip/txt

curl --location --request POST 'http://127.0.0.1:8060/clip/txt' \
--header "Content-Type: application/json" \
--header 'api-key: api_key' \
--data '{"text":"飞机"}'

response:

results : 文字的特征向量

{
  "results": [
    "0.3305919170379639",
    "-0.4954293668270111",
    "0.0217289477586746",
    ...
  ]
}

/restart_v2

通过重启进程来释放内存

curl --location --request POST 'http://127.0.0.1:8060/restart_v2' \
--header 'api-key: api_key'

response:

请求中断,没有返回，因为服务重启了

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
coreml		coreml
cuda		cuda
onnx		onnx
openvino		openvino
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MT Photos AI识别相关任务独立部署项目

更新日志

目录说明

镜像说明

打包docker镜像

运行docker容器

下载源码本地运行

API

/check

/ocr

/clip/img

/clip/txt

/restart_v2

About

Releases 3

Packages

Contributors 4

Languages

MT-Photos/mt-photos-ai

Folders and files

Latest commit

History

Repository files navigation

MT Photos AI识别相关任务独立部署项目

更新日志

目录说明

镜像说明

打包docker镜像

运行docker容器

下载源码本地运行

API

/check

/ocr

/clip/img

/clip/txt

/restart_v2

About

Resources

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 4

Languages

Packages