Skip to content

Web scraping to extract data from a website and stores it in a structured format. This project aims to assess your proficiency in web scraping techniques, data handling, error management.

Notifications You must be signed in to change notification settings

HackerajOfficial/Web-Scraping-Application

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Web Scraping Application

Description:

Web scraping to extract data from a website and stores it in a structured format. This project aims to assess your proficiency in web scraping techniques, data handling, error management.

Technologies Used:

  • bs4 / BeautifulSoup: Beautiful Soup is a Python library used for web scraping purposes. It provides tools for parsing HTML and XML documents and extracting useful information from them.

  • requests: The requests library in Python is a powerful and user-friendly tool for making HTTP requests.

Usage:

  1. Clone the repository to your local machine.
     git clone https://github.com/HackerajOfficial/Web-Scraping-Application.git
    
  2. Active environment
      venv\Scripts\activate
    
  3. Install the necessary dependencies
      pip install -r requirements.txt
    
  4. Start the Django development server:
      python scraper.py
    

Limitations:

  • Website Changes: Web scraping relies on the structure of the website being consistent over time. If the website's structure changes, such as class names or HTML layout, the scraping code may break.

About

Web scraping to extract data from a website and stores it in a structured format. This project aims to assess your proficiency in web scraping techniques, data handling, error management.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages