Skip to content

Xwarli/Archive-Sitemap-Monolith

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sitemap_Monolith

Archives an entire website using monolith and sitemap-generator (and neatly puts it in a nice directory)

USAGE: Download and execute main.py as a superuser (sudo python3 main.py)


DEPENDENCIES

Uses two other repositories (please install before running Sitemap_Monolith):

Monolith (for the archiving) --> https://github.com/Y2Z/monolith

sitemap-generator (for the sitemap) --> https://github.com/Haikson/sitemap-generator


HOW IT WORKS

Essentially it takes a start url, and then generates an XML sitemap with sitemap-generator

Sitemap-Monolith then takes this XML file and strips out tags to generate a txt file of urls.

These are then passed one-by-one into monolith to create archive pages.

About

Using Monolith and Sitemap Generator to Archive an entire website in html format

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages