项目作者: santhoshse7en

项目描述 :
Research Project to analyse the news articles about HIV & AIDS from various newspaper regions.
高级语言: Jupyter Notebook
项目地址: git://github.com/santhoshse7en/HIV-AIDS.git
创建时间: 2019-04-29T06:18:54Z
项目社区:https://github.com/santhoshse7en/HIV-AIDS

开源协议:MIT License

下载


HIV & AIDS

Research Project to analyse the news articles about HIV & AIDS from various newspaper regions. Data Extraction of HIV and AIDS articles from targeted regions and which is mostly on daily english newspaper publications.

For publication, top 5 or top 10 newspapers from targeted regions with duration from (2000 - 2019)

Targeted Newspapers Regions are

  • Afghanistan
  • Bangladesh
  • India
  • Indonesia
  • Vietnam

    Data Extraction parameters as follows

  • Headline
  • Description
  • Author
  • Published Date
  • Publication
  • Category
  • News
  • URL
  • Keywords
  • Summary

For each code files Proper comments are added for better understanding. All the above information with appropriate column headers and store the data in csv/sheet. Code and CSV fiels are stored under the folder.

The main aim of this project is to collect and analyze the information about of HIV in society.

  • Data Extraction
  • Analyzing the Data
  • Publish the Paper

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT