项目作者: chris-bbrs
项目描述 :
PDF merging and scraping for nlp use
高级语言: Jupyter Notebook
项目地址: git://github.com/chris-bbrs/pdf-merging-and-scraping.git
PDF merging and scraping for nlp use
A jupyter notebook for the use of merging pdf files into one, scrap and clean the text from the merged file and finally use them in any way you want.
In the current example I’m using nlp to search for specific phrazes and the possible chapter they’re in.