GitHub - ramaseshan/webpage_saver: When given a list of urls , download the htmlpage and the assets

This Code is supposed to take a list of urls as input and download all the webpages along with their javascripts and css and store in seperate folders also replacing the code in the html pages.

Tech Stack : Python 2.7 urllib2 beautifulsoup

This for now works only with urls ending with .com and .org. Also static files with absolute and ../ are not considered. Only assets with urls like /static/css/somefile.css are considered. Example url : https://wiki.python.org/moin/Python2orPython3

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
index.py		index.py
replace_file.py		replace_file.py
urllist.txt		urllist.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

index.py

index.py

replace_file.py

replace_file.py

urllist.txt

urllist.txt

Repository files navigation

About

Releases

Packages

Languages

ramaseshan/webpage_saver

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages