项目作者: ioilmio

项目描述 :
A web site data scraper built in Ruby fetching data from infamous dev.to site
高级语言: Ruby
项目地址: git://github.com/ioilmio/web-scraper.git
创建时间: 2020-05-29T14:51:00Z
项目社区:https://github.com/ioilmio/web-scraper

开源协议:MIT License

下载





Logo


Dev.to latest posts




This is the Capstone Project of the Microverse Ruby curriculum!


Explore the docs »




Report Bug
·

Request Feature


Table of Contents

About The Project

This is a scraper, a program written in Ruby.This specific scraper is built to scrape posts and authors from “Dev.to” site.Using the Nokogiri Gem to parse the HTML from the site,open URI to open the file in security, and some string manipulations to format them better for human readability.

Built With

This project was built using these technologies.

  • Ruby
  • Rubocop
  • VsCode
  • Open URI
  • Nokogiri Gem
  • Bundler Gem (for repl.it live Demo)

Live version

Run on Repl.it

Usage

If you want to install and use the scraper you need:

  • Ruby installed in your computer, version 2.6.5 recommended.
  • Go to https://github.com/ioilmio/web-scraper
  • Click the green button

    • Clone over HTTPS
  • Go into your terminal and type

    1. $ git clone https://github.com/ioilmio/web-scraper.git

    or

    1. $ git clone ctrl + shift + v
  • Type on your terminal bin/scraper.rb to get a list of articles.

You will get a lixt.txt file containing an Hash, in this Hash you can find clickable links for your article of interest.

Automated test

To run automated tests you can just type

  1. rspec
in the terminal inside the main directory.
You can also take a look at the doc for the test running
  1. rspec format doc
in the same directory.

Author

👤 Illuminato Salvatore

🤝 Contributing

Contributions, issues and feature requests are welcome!

Feel free to check the issues page.

Show your support

Give a ⭐️ if you like this project!

Acknowledgements