项目作者: zevio

项目描述 :
PDF parser component (Apache Tika) for PCU project
高级语言: Python
项目地址: git://github.com/zevio/pcu_pdf.git
创建时间: 2018-09-10T12:02:21Z
项目社区:https://github.com/zevio/pcu_pdf

开源协议:GNU General Public License v3.0

下载


pcu_pdf (Apache Tika parser for PCU project)

PDF parser component (Apache Tika) for PCU project.
From the path of a PDF file, get its textual content.

Based on Apache Tika.

pdf


Check PCU project.

Usage in another project

If you wish to import this module in another Python project, please install it :

pip install pcu-pdf

Then, add this import line at the beginning of your Python file :

from pcu_pdf import pcu_pdf

You can now use pcu_pdf’s functions, for example :

pcu_pdf.PDFParser("path/to/pdf/file")

Test

To test your installation, go to pcu_pdf/ directory and execute the Makefile with the following command line :

make test