Skip to content

This webscraper use BeautifullSoup and Requests librairies to scrap informations from the website books.toscrape.com/. It provides a csv file including all data scraped (texts,prices,images)

Notifications You must be signed in to change notification settings

Theosers/PriceWise-2.0.0-Your-Python-Solution-for-Extracting-Prices-on-Books-to-Scrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PriceWise-2.0.0(Books Scraper)

Table of Contents

  1. General Info
  2. How to use this repository

General Info


This is the beta version of the price extraction program BS 2.0.0, developed in Python. The goal is to extract all kind of informations on the website Books to Scrape (http://books.toscrape.com/).

The informations are ( store in the file product.csv) :

  • Url
  • Universal product code (upc)
  • Title
  • Price including taxes
  • Price excluding taxes
  • Number of books available
  • Description
  • Catégory
  • Note of the reviews
  • Url of the picture

Then download and save the image file of each product.

How to use this repository


This program will require python 3.9.6 installed : https://www.python.org/downloads/

In a new virtual environment, install all dependency :

pip install -r requirements.txt

To execute BS :

launch the launcher.sh

sh launcher.sh

About

This webscraper use BeautifullSoup and Requests librairies to scrap informations from the website books.toscrape.com/. It provides a csv file including all data scraped (texts,prices,images)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published