Python Web Scraper

A multi-source Python web scraping project that retrieves the latest headline data from major UK news platforms. This project demonstrates core concepts such as HTTP requests, HTML parsing, data extraction, and modular scraper architecture.

Features

Automated retrieval of headline data
Support for multiple news sources
- BBC News
- The Guardian UK
Modular scraping functions (scrape_bbc(), scrape_guardian(), scrape_all())
Clean and structured console output
Error-free BeautifulSoup parsing workflow

Technologies Used

Python 3
Requests
BeautifulSoup (bs4)

How It Works

The scraper consists of three main components:

`scrape_bbc()`

Fetches and extracts top headlines from BBC News.

`scrape_guardian()`

Fetches and extracts top headlines from The Guardian UK.

`scrape_all()`

Runs both scrapers together and prints aggregated results in a clear formatted structure.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitattributes		.gitattributes
README.md		README.md
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python Web Scraper

Features

Technologies Used

How It Works

`scrape_bbc()`

`scrape_guardian()`

`scrape_all()`

Sample Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Python Web Scraper

Features

Technologies Used

How It Works

scrape_bbc()

scrape_guardian()

scrape_all()

Sample Output

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`scrape_bbc()`

`scrape_guardian()`

`scrape_all()`

Packages