项目作者: pratikpv

项目描述 :
Downloads news articles from Google news and uses pre-trained NLP models to perform sentiment analysis
高级语言: Python
项目地址: git://github.com/pratikpv/google_news_scraper_and_sentiment_analyzer.git


Google News scraper and sentiment analyzer using python

Summary:

  • Downloads news articles by searching on http://www.news.google.com with keywords of interest and specific date ranges.
  • Generates CVS files of news text
  • Uses pre-trained NLP models to perform sentiment analysis of the news text.

google_news_scraper.py

  • Is a scraper script which can search google news for a given date range using keywords of interest.
  • sample data generated by the script looks like this. ( for keywords ‘bitcoin cryptocurrency)
    alt text

google_news_sentiment_analysis.py

This framework is used in https://github.com/pratikpv/predicting_bitcoin_market

Credits:

Code from https://towardsdatascience.com/web-scraping-news-articles-in-python-9dd605799558 is referenced as base to write scraper code.