Skip to content

ShilajitRath/Go4Explore-Website_Scrapping-Selenium-Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Go4Explore-Website_Scrapping-Selenium-Python

Web scraping

Web scraping is the process of automatically extracting data from websites, web pages, and online documents. It involves using software or algorithms to navigate through websites, locate and extract specific data, and then store it in a structured format for further analysis or use.

Why we should use the data present on internet?

Here I've provided some real-world examples of why we need data on the internet.

  1. Day-to-day use case

    • In today's digital age, we mostly shop online. And in order to buy something, we must always scroll through numerous websites and pages, paying close attention to the price, quality, and dependability of the product that we wish to purchase.
  2. Organizational Use Cases

    • Collect and analyze online data for decision-making and identifying market trends and patterns. Customers provide feedback on products, and so on.

How to collect the data from internet

  • We can use webscraping technique to collect data from internet.

There are enough amount of packages in python to do web scraping such as:

  1. Beautiful Soup
  2. Mechanical Soup
  3. Selenium
  4. Scrapy

Also we can collect the data via APIs. Because in some cases you need to find some hidden APIs to request the data from the website.

Here I have used Selenium to scrape data about movies and films from Go4Explore website.

What you need to scrape the data from Go4Explore.

  1. Python IDE: To write an efficient code
  2. Selenium: Python package which is used to scape the data faster.
  3. Web driver: web driver is a tool to open the browser we chose (Chrome, Firefox, Edge, or Safari). Whatever browser you want to use, the driver should be compatible with your browser version and your operating system.

Here you can find all the information about selenium: https://selenium-python.readthedocs.io/

To see how I did the scraping. all the code open the go4Explore.ipynb file.

I have intended to find destination to have a trip, their prices and discounts.

I choose to scrape informations below

  • Trip Name
  • Prices
  • Discounts
  • Duration of the trip

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published