项目作者: harshul1610

项目描述 :
Optical character recognition Using Deep Learning
高级语言: Jupyter Notebook
项目地址: git://github.com/harshul1610/OCR.git
创建时间: 2018-06-09T03:20:06Z
项目社区:https://github.com/harshul1610/OCR

开源协议:MIT License

下载


Optical Character Recognition Using DeepLearning

Er. Harshul Jain, author

Text is everywhere! It is present in PDFs, docs as well as images. There are lots of applications where text data is useful for doing analytics. Such applications include receipts recognition, number plate detection, extracting the latex formulas from the images etc. General Computer Vision can be used for such task but it lacks in accuracy. In order to solve the low accuracy and variance problem, we use the state of the art deep neural networks.

This repository includes:

  1. 1. A TensorFlow implementation of the CNN+LSTM+CTC model for OCR.
  2. 2. supporting scripts to apply the RCNN appraoch for OCR.

Architecture

Architecture

Instructions on How to run

Get the repository

  1. git clone https://github.com/harshul1610/OCR.git

Get the NIST19 dataset

  1. mkdir data
  2. wget https://s3.amazonaws.com/nist-srd/SD19/by_class.zip
  3. unzip by_class.zip
  4. mv by_class NIST19

Get the Captcha data

  1. cd OCR
  2. python2 generate_captcha.py

Run the final notebook for training and testing

  1. CNN_LSTM_CTC_OCR-captcha.ipynb

LICENSE

MIT