Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TODO: Investigate archivebox and other options for mirroring reference content #139

Open
kurtseifried opened this issue Mar 2, 2023 · 2 comments
Assignees
Labels
needs refining This ticket requires some more details before it can be actioned tooling Any code / tooling related issues

Comments

@kurtseifried
Copy link
Contributor

TODO: Investigate archivebox and other options for mirroring reference content

I've reached out to several archival services, but they either don't reply or cost way too much. We should look at a self-hosted solution (e.g. even if running it nonpublic for now, let's capture the data at least).

https://github.com/ArchiveBox/ArchiveBox

are there any other good opensource solutions?

@joshbuker joshbuker added needs refining This ticket requires some more details before it can be actioned tooling Any code / tooling related issues and removed discussion labels Mar 29, 2023
@kurtseifried
Copy link
Contributor Author

https://github.com/ArchiveBox/ArchiveBox

TODO: make a VM with ... 10? 100? gigs of space.
TODO: setup VPN on VM
TODO: setup Archivebox on VM
TODO: extract ALL the URLs from the GSD and run through archivebox - monitor for size

@kurtseifried
Copy link
Contributor Author

Doesn't install cleanly on Ubuntu 20.10, apt fails. Lots of node problems. Docker?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
needs refining This ticket requires some more details before it can be actioned tooling Any code / tooling related issues
Projects
None yet
Development

No branches or pull requests

2 participants