Skip to content

infoculture/preserved-russia

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Idea

Preserved government is digital preservation project dedicated to preserve all digital public records made by governments.

For now it includes following activities:

  • government websites archiving
  • archiving of official social accounts - Twitter, Facebook, Flickr, Livejournal and others
  • government open data archiving
  • historical data archiving

More info - https://github.com/infoculture/preserved-russia/wiki

Archival procedure

To save website as WARC

Run python archive_wget.py [domain_name] for example "python archive_httrack.py www.gov.ru"

Use "getnohub.sh [domain_name]" to do it as background process

To save website using Httrack

Run python archive_httrack.py [domain_name] for example "python archive_httrack.py www.gov.ru"

Archive result as 7z archive. using "7z a [domain_name].7z [directory_name]"

Current documents

About

Russia data and documents digital preservation project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published