Use Scrapy to crawl text reviews & images from Dianping.com and generate pretty static pages!
Use Scrapy to crawl text reviews & images from Dianping.com and generate pretty static pages!
Images
../imgs/
, sorted by /user/shop/
IMAGES_STORE
in settings.py
Text Reviews
review.json
To Be Done..
start_urls
in dianping_spider.py
to the url of the review page that you want to crawl. e.g., click here to view my dianping reviews pageUnder ../Dianping-Gallery/dianping_gallery/dianping_gallery/spiders
, run:scrapy runspider dianping_spider.py -o review.json
The downloading process will then show in the command screen