Skip to content

A Perl script for cleaning the Wikipedia specific detritus from the HTML code of article pages.

Notifications You must be signed in to change notification settings

fvlasie/WikiHTMLcleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

WikiHTMLcleaner

A Perl script for cleaning the Wikipedia specific detritus from the HTML code of article pages.

Test with:

curl "https://en.wikipedia.org/w/index.php?title=Hello&action=render" | perl WikiHTMLcleaner.txt | pbcopy

Paste into Safari's Develop>Snippet Editor to see the results.

This works with the output of Wikipedia's &action=render URL option which returns only the HTML of the article requested by the title= argument.

About

A Perl script for cleaning the Wikipedia specific detritus from the HTML code of article pages.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages