Skip to content

Im-Rises/hotel_scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hotel_scraping

pythonLogo seleniumLogo plotlyDashLogo pythonanywhereLogo

Description

Scraping project made in python using selenium.
Project main goal is to scrap data from different website for study. The python scripts create bots that navigate through the different hotels' website (hotels.com, booking, trivago and kayak). It will use the research system, and it will scrap all hotels' data on every available pages (the data gathered are stored into different csv files, one csv file for each website).
Those data can be used by the GUI app website to search for the hotel that best suit you or for analysis. There is specific section on the gui app that allow you to study the price of the hotel and locate them on a map.

Scraped websites

  • hotels.com
  • booking.com
  • trivago.fr
  • kayak.fr

Hotels' data gathered

  • Name
  • Address
  • Price
  • Number of stars
  • Coordinates
  • Number of persons (adults, children)
  • Number of chambers

GUI application

We implemented a GUI application made in dash and hosted on pythonanywhere : http://maaelle.pythonanywhere.com/

The website is in French. No english translation has been made.

GUI source code

The code of the GUI available in this repository at https://github.com/clementreiffers/HotelScraping/tree/main/HotelScraping/website isn't our final GUI, it was only a test before coding it using Dash Python.

So if you want to check the code of the GUI, you need to go there :

https://github.com/maaelle/InterfaceHotel

GUI description

On the website you can search a hotel by different characteristics. You can also summarize check the mean and variance price per month of all the hotels.
Giving you information about the most expensive month for sleeping at the hotels.

GUI Images

Main page Research page
Image1 Image2
Statistics page Map page
Image3 Image4

Other sources

The source code of the website can be found here :
https://github.com/maaelle/InterfaceHotel

Flutter GUI prototype project :
https://github.com/clementreiffers/hotel-scraper-interface

APIs

Selenium :
https://www.selenium.dev

Pythonanywhere :
http://maaelle.pythonanywhere.com

Dash :
https://plotly.com/dash/

Contributors

Quentin Morel :

Clément Reiffers :

Maëlle Marcelin :

GitHub contributors

About

Hotels' scraping and data manipulation.

Topics

Resources

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •  

Languages