项目作者: mgurg

项目描述 :
Python script to parse otomoto offers
高级语言: Jupyter Notebook
项目地址: git://github.com/mgurg/py_otomoto.git
创建时间: 2019-08-30T15:03:17Z
项目社区:https://github.com/mgurg/py_otomoto

开源协议:

下载


py_otomoto

Tool for webscrapping and parsing Toyota Yaris II offers on otomoto.pl website.

scrapy_otomoto

Scrapy 1.8.0 required, usage:

  1. cd ./scarapy_otomoto/
  2. scrapy crawl otomoto

This will generate otomoto_YYMMDD.html file with all cars that are currently available and YYMMDD folder. Folder contains separate JSON files with details of each car.

carDataParser - tool for parsing otomoto_YYMMDD.html file

carFeatureParser - tool for parsing JSON files

Result is stored in SQLite database


Initial code:
https://bananovitch.github.io/blog/2018/09/19/python-car-scraper.html