Scrapy-GoodReads

Description 📝

This repository marks my first attempt at web scraping using Scrapy and what better way to do it than doing it on GoodReads to yield the details of the books which are described in the start_urls of /Learning/Spiders file.

This program is meant to retreive the image URL of the book, Title of the book and the description will be scraped via this crawler

To run the code 👨🏽‍💻

pip install -r requirements.txt

Change directory to Learning/spider

scrapy crawl GoodReads -o BooksData.json
(to store it in BooksData.json file, please note that this will just append the data in the file)

scrapy crawl GoodReads
(to run it normally and diplay the output)

Future prospects

As of now, we need to manually enter the links in the scraper.py file, which I would like to change to a command-line argument.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Learning		Learning
BooksData.json		BooksData.json
README.md		README.md
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapy-GoodReads

Description 📝

To run the code 👨🏽‍💻

Change directory to Learning/spider

Future prospects

About

Releases

Packages

Languages

DeStRoYeR-droid/Scrapy-GoodReads

Folders and files

Latest commit

History

Repository files navigation

Scrapy-GoodReads

Description 📝

To run the code 👨🏽‍💻

Change directory to Learning/spider

Future prospects

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages