Simple Python Web Scraper

This is a simple command-line web scraper built with Python, requests, and BeautifulSoup.

It's designed to demonstrate the basic workflow of:

Fetching a web page's HTML.
Parsing the HTML to find specific data.
Saving that data to a structured JSON file.

This script scrapes all quotes and authors from http://quotes.toscrape.com/.

Tech Stack

Python 3
requests (for making HTTP requests)
beautifulsoup4 (for parsing HTML)

How to Run It

Clone the repo:

git clone [https://github.com/justknuth/python-web-scraper.git](https://github.com/justknuth/python-web-scraper.git)
cd python-web-scraper

Create a virtual environment:
```
python -m venv venv
```
Activate the virtual environment:
- On macOS/Linux (Bash):
```
source venv/bin/activate
```
- On Windows (Command Prompt or PowerShell):
```
.\venv\Scripts\activate
```
Install dependencies:
```
pip install -r requirements.txt
```
Run the scraper:
```
python scraper.py
```

Output

This script will print the authors it finds to the console and create a quotes.json file in the root of the project containing the full text and author for all scraped quotes.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple Python Web Scraper

Tech Stack

How to Run It

Output

About

Uh oh!

Releases

Packages

Languages

justknuth/python-web-scraper

Folders and files

Latest commit

History

Repository files navigation

Simple Python Web Scraper

Tech Stack

How to Run It

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages