This repository has been archived by the owner on Jul 3, 2023. It is now read-only.
A python script to strip unwanted attributes and tags from html.
evanmsanders/Html-Strainer
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
README for html strainer Html Strainer is my attempt at creating a script to remove unwanted attributes and tags from legacy (or just foreign) html code. How to use it. ============== Copy the html you want to clean into the input.html file. Then fire up a terminal and run the script by typing: 'python html_strainer.py' The cleaned html will appear in output.html for you to do what you want with it. Notes ===== 1. It's currently only compatible with python 2. 2. Future versions will allow you to specify input and output files via commandline arguments. I just didn't have the time or knowledge to write this into the first version.
About
A python script to strip unwanted attributes and tags from html.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published