4.3. OCR

这里介绍 OCR （光学字符识别）。

Alas 使用了 cnocr 作为 OCR 库，也针对碧蓝航线内的字体训练了两个 OCR 模型。许多脚本都迈不过 OCR 这道坎，需要依赖在线 OCR，但是在 Alas 里，你可以大量地调用 OCR。

需要注意的是 OCR 是无法达到 100% 正确率的，在调用时需要注意异常处理。

预训练模型

Alas 里有 3 个 OCR 模型：

cnocr 是 cnocr 的默认模型，支持中英文

# Folder: ./bin/cnocr_models/cnocr
# Size: 9.51MB
# Model: densenet-lite-gru
# Epoch: 39
# Validation accuracy: 99.04%
# Font: Various
# Charset: Number, English character, Chinese character, symbols, <space>
# _num_classes: 6426

azur_lane 针对碧蓝航线数字和字母，仿照 cnocr 默认模型训练

# Folder: ./bin/cnocr_models/azur_lane
# Size: 3.25MB
# Model: densenet-lite-gru
# Epoch: 15
# Validation accuracy: 99.43%
# Font: Impact, AgencyFB-Regular, MStiffHeiHK-UltraBold
# Charset: 0123456789ABCDEFGHIJKLMNPQRSTUVWXYZ:/- (Letter 'O' and <space> is not included)
# _num_classes: 39

jp 针对日文，同样仿照 cnocr 默认模型训练

具体信息忘了（逃

对 cnocr 的修改

Alas 对 cnocr 进行了简单的修改，module/ocr/al_ocr.py 覆写了一些 cnocr 的方法：

_assert_and_prepare_model_files

取消了自动下载默认模型
_preprocess_img_array

图片预处理改成纯 opencv 实现，提高速度
init

模型懒加载
_gen_line_pred_chars

去除了准确率在 0.5 以下的字符输出
debug

增加了一个展示预处理后的图片的方法

Ocr

通用的 OCR 类

以关卡名称 OCR 为例：

# 创建 Ocr 对象
# 可以是个 button，也可以是一个 list 的 button
ocr = Ocr(buttons, name='campaign', letter=(255, 255, 255), threshold=128,
          alphabet='0123456789ABCDEFGHIJKLMNPQRSTUVWXYZ-')
# 获取识别结果
result = ocr.ocr(image)

Digit

识别数字。返回 int

如果识别结果不能转换为 int，返回 0。

DigitCounter

识别数字计数，例如 14/15，返回 (14, 1, 15)。

如果识别结果不符合 {x}/{y} 的格式，返回 (0, 0, 0)。

Duration

识别时长，例如 08:00:00，返回 datetime.timedelta 对象。

如果识别结果不符合 {h}:{m}:{s} 的格式，返回时长为 0 的 datetime.timedelta 对象。

Home & Home_CN

Getting Started

Development

MISC

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4.3. OCR

4.3. OCR

预训练模型

对 cnocr 的修改

Ocr

Digit

DigitCounter

Duration

Clone this wiki locally