Skip to content
This repository has been archived by the owner on Nov 4, 2020. It is now read-only.
/ deephotel Public archive

scraping TripAdvisor, Booking.com with Scrapy

Notifications You must be signed in to change notification settings

gkzz/deephotel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Only Scraping Hotel's Data @TripAdvisor.jp, Booking.com

Python Function To Scrape Hotel's data is hotel's name, address, review text, score, and so on, "Except pricing data"

Based on 'https://github.com/monkeylearn/hotel-review-analysis'

Special thanks to @monkeylearn

crawl to review-text of hotels in kyoto @TripAdvisor.jp

scrapy crawl tripadvisor_review -a start_url='https://www.tripadvisor.jp/Hotels-g298564-Kyoto_Kyoto_Prefecture_Kinki-Hotels.html' -o filename.csv

crawl to review-score of hotels in kyoto @TripAdvisor.jp

scrapy crawl tripadvisor_score -a start_url='https://www.tripadvisor.jp/Hotels-g298564-Kyoto_Kyoto_Prefecture_Kinki-Hotels.html' -o filename.csv

Thank you for @monkeylearn!!

crawl to review-text of hotels in osaka @Booking.com

scrapy crawl booking_review -a start_url='https://www.booking.com/searchresults.ja.html?city=-240905;ss=%E5%A4%A7%E9%98%AA%E5%B8%82' -o filename.csv

crawl to review-score of hotels in osaka @Booking.com

scrapy crawl booking_score -a start_url='https://www.booking.com/searchresults.ja.html?city=-240905;ss=%E5%A4%A7%E9%98%AA%E5%B8%82' -o filename.csv

As you know, I'm Python Beginner. I'm glad you to give feedback.