项目作者: ilovin

项目描述 :
Use CTC + tensorflow to OCR
高级语言: Python
项目地址: git://github.com/ilovin/lstm_ctc_ocr.git
创建时间: 2017-03-30T07:37:28Z
项目社区:https://github.com/ilovin/lstm_ctc_ocr

开源协议:

下载


  • old master:
    • harder to converge compare to the beta version
    • both standard ctc and warpCTC
    • read data at once
  • dev:
    • the pipline version of lstm_ctc_ocr, resize to same size
    • use tf.records
  • beta (current):
    • generate data on the fly
    • deal with multi-width image, padding to same width

How to use

  1. ./train.sh

Dependency

Some details

The training data:
data

Notice that,
parameters can be found in ./lstm.yml(higher priority) and lib/lstm/utils/config.y
some parameters need to be fined tune:

  • learning rate
  • decay step & decay rate
  • image_height
  • optimizer?

in ./lib/lstm/utils/gen.py, the height of the images are the same, and I pad the width
to the same for each batch, so
if you want to use your own data, the height of the image shall be the same.

Result

The accurary can be more that 95%
acc

Read this blog for more details and this blog for how to
use tf.nn.ctc_loss or warpCTC