# yoloocr

**Repository Path**: pishenlll/yoloocr

## Basic Information

- **Project Name**: yoloocr
- **Description**: No description available
- **Primary Language**: Python
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2024-11-24
- **Last Updated**: 2024-11-24

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# RUN

> PaddleOCR/chinapetro/layout.py

# 运行完成目录结构

```
{PDF_filename}
└───figure
│   │   page001.png
│   │   page002.png
│   │   ...
│   │   
│   |
└───result
│   │   page001.png
│   │   page002.png
│   │   ...
│   │   
│   │   
└───res.json
    
```

# 模型文件路径

相关所有模型文件均在 \inference 目录下

改掉 /table/table.py中的模型路径 相对路径好像不行 绝对路径一定可以

| name      | path                 |
|-----------|----------------------|
| 表格模型      | slanet_lcnetv2_infer |
| 公式模型      | 暂无                   |
| 检测模型      | YOLOV8               |
| 表格中文本识别模型 | PP-OCR               |
| 通用OCR     | RapidOCR             |

# 部署BUG

table SLANet-LCNetV2
> pip 无法安装autolog参照如下步骤
>
> - git clone https://github.com/LDOUBLEV/AutoLog
> - pip3 install -r requirements.txt
> - python3 setup.py bdist_wheel
> - pip3 install ./dist/auto_log-1.0.0-py3-none-any.whl

# 调用bash

单独进行表格检测及识别

```bash
python table/predict_table.py --det_model_dir=../inference/ch_PP-OCRv4_det_infer --rec_model_dir=../inference/ch_PP-OCRv4_rec_infer --table_model_dir=../infer
ence/ch_ppstructure_mobile_v2.0_SLANet_infer --rec_char_dict_path=../ppocr/utils/ppocr_keys_v1.txt --table_char_dict_path=../ppocr/utils/dict/table_structure_dict_ch.txt --image_dir=docs/data/picture/table.png --output=docs/output/cmd/
```

# config

- PDF缩放因子确定4
- 模型路径配置

pdf {'text':['line1','line2'],'figure':[{"figure_caption":"","figure_path":"","figure_array":"","figure_describe":""}]
,'table':[{'html':""},{'table_caption':""}]}

# gotocr

- pip install verovio
- pip install tiktoken
- pip install 'accelerate>=0.26.0

# 测试问题

1. 多线程下文件的名字不能一样（出去后缀名后前面那个名字一样格式不一样的话转成pdf多线程直接认为是一个文件）
2. 如果一份文件的不同格式最后只会返回一个结果 字典的key不允许同名