A Python script that scrapes structured data from an HTML page and exports it into CSV and JSON formats.
Part of a Data Science practice project focusing on real-world web scraping.
- Extract_Text_Data.CSV β Headings, paragraphs, and list items.
- Extract_Table_Data.CSV β Product table (Name, Price, Stock).
- Product_Information.JSON β Book cards with title, price, stock, button text.
- Form_Details.JSON β Form fields with name, type, placeholder, and label.
- Iframe_Links.JSON β Extracted video links.
- Featured_Products.JSON β Featured products with hidden prices & colors.
- Python 3
- requests, BeautifulSoup4, pandas, json
pip install requests beautifulsoup4 pandas lxml
python web_scrap_p.py