Skip to content
A python script that strips all line in-between, and including, <article> and </article> tags inside of an HTML file.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


This python script is simple. When run in a directory, it will recursively search the directory and all subdirectories for files name "index.html" and strip the <article> elements, and their contents, into a file called "".

The original "index.html" files will remain unmodified and intact.


  • Can I use this in my XYZ program,script,etc?

    • Sure, all I asked is that you credit my original work, otherwise this is distributed under the MIT license.
  • Can I use this to steal copyrighted work and claim it as my own?

    • No, don't be a douchebag, always credit the original author for their work.
You can’t perform that action at this time.