Full Python Web Scraper

Overview

This Python web scraper fetches a webpage and outputs the entire HTML structure as formatted JSON.
It recursively extracts every element, tag, attribute, and text node, providing a faithful JSON representation of the DOM tree.

Features

Fetches any page via URL
Recursively parses all HTML elements, attributes, and text
Outputs structured JSON representing the full DOM
Handles errors gracefully

Prerequisites

Python 3.6+
Required libraries:
- requests
- beautifulsoup4

Install dependencies with:

pip3 install requests beautifulsoup4

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
basic_webscraper.py		basic_webscraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Full Python Web Scraper

Overview

Features

Prerequisites

About

Uh oh!

Languages

License

SlamSb/basic-webscraper

Folders and files

Latest commit

History

Repository files navigation

Full Python Web Scraper

Overview

Features

Prerequisites

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages