项目作者: chris-bbrs

项目描述 :
PDF merging and scraping for nlp use
高级语言: Jupyter Notebook
项目地址: git://github.com/chris-bbrs/pdf-merging-and-scraping.git
创建时间: 2020-10-23T01:00:50Z
项目社区:https://github.com/chris-bbrs/pdf-merging-and-scraping

开源协议:

下载


PDF merging and scraping for nlp use

A jupyter notebook for the use of merging pdf files into one, scrap and clean the text from the merged file and finally use them in any way you want.
In the current example I’m using nlp to search for specific phrazes and the possible chapter they’re in.