Scraper

Description

This is a web scraper model for a larger project. The scraper has been designed using best practices for Scrapy and follows its recommended design patterns. It scrapes the books from https://books.toscrape.com. The book records are posted to a PostgreSQL database.

Getting Started

Running the Application

To run the Dockerized application:

cd prototype

docker-compose up

Installing Python3

Check if Python3 is installed on your operating system.

python3 --version

To install Python3 on Linux:

sudo apt update

sudo apt-get install python3

To install Python3 on Windows:

Visit https://www.python.org/downloads/ and download the appropriate installer for your operating system (32-bit or 64-bit).

Installing Pip

Check if pip is installed on your operating system.

pip --version

To install pip on Linux:

sudo apt update

sudo apt install python3-pip

To install pip on Windows:

curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py

python3 get-pip.py

Initializing the Database

docker-compose -f db/docker-compose.yaml up -d

Running the Script

cd app

Locally:
- pip install -r book_scraper/requirements.txt
- python3 book_scraper/main.py
Python virtual environment:
- Windows (elevated command prompt):
  - run.bat
- Linux python venv:
  - chmod +x run.sh
  - run.sh

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
app		app
db		db
.gitignore		.gitignore
README.md		README.md
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

db

db

.gitignore

.gitignore

README.md

README.md

docker-compose.yaml

docker-compose.yaml

Repository files navigation

Scraper

Description

Getting Started

Running the Application

Installing Python3

Installing Pip

Initializing the Database

Running the Script

Contributors

About

Languages

zacharyrperales/Scraper

Folders and files

Latest commit

History

Repository files navigation

Scraper

Description

Getting Started

Running the Application

Installing Python3

Installing Pip

Initializing the Database

Running the Script

Contributors

About

Topics

Resources

Stars

Watchers

Forks

Languages