This project contains a Python script that extracts text content from web pages specified by a list of URLs. It utilizes BeautifulSoup and requests libraries for web scraping and parsing HTML content.
Extracts text from web pages based on specified HTML elements and class attributes. Simple and straightforward script for basic web scraping tasks. Easily customizable to handle different web page structures.
Python 3.x
requests, beautifulsoup4 Installation
-
Clone the repository to your local machine:
git clone https://github.com/SUDEEP-M-SHETTY/Price-Extractor.git
-
Install dependencies:
pip install -r requirements.txt
-
Usage: Edit the url1 list in the script to include URLs you want to scrape.
-
Run the script:
python extract_price.py
- Fork the repository.
- Create a new branch (git checkout -b feature/improvement).
- Make your changes.
- Commit your changes (git commit -am 'Add new feature').
- Push to the branch (git push origin feature/improvement).
- Create a new Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
Inspired by online tutorials and community forums on web scraping with Python. Thanks to BeautifulSoup and requests library developers for their contributions.