Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

[2h馃憖馃摝] Auto-archive external links #1697

Open
nmorduch opened this issue Jul 13, 2021 · 2 comments
Open

[2h馃憖馃摝] Auto-archive external links #1697

nmorduch opened this issue Jul 13, 2021 · 2 comments
Assignees
Labels
enhancement tool/python python, django, wagtail

Comments

@nmorduch
Copy link
Member

Often publications reference links that eventually stop working, sometimes because the page has been taken down. In order to make references continue to be useful, I think it would be nice to archive them.

We could do something like Wikipedia does, e.g. in the NA article:

Hogan, Clara (April 3, 2013). "ANNE-MARIE SLAUGHTER NAMED NEXT PRESIDENT OF NEW AMERICA FOUNDATION". NEW AMERICA FOUNDATION. Archived from the original on April 7, 2013. Retrieved April 3, 2013.

@nmorduch nmorduch added this to Triage / Needs Estimate in Overall Kanban via automation Jul 13, 2021
@nmorduch nmorduch added 3 low enhancement tool/python python, django, wagtail labels Jul 13, 2021
@nmorduch nmorduch changed the title Auto-archive external links [2h馃憖馃摝] Auto-archive external links Jul 16, 2021
@nmorduch nmorduch moved this from Triage / Needs Estimate to Ready in Overall Kanban Jul 16, 2021
@nmorduch nmorduch added this to the 馃弮 July 2021 milestone Jul 16, 2021
@chigby
Copy link
Collaborator

chigby commented Jul 30, 2021

The Wayback Machine does offer this system for saving single web pages:

https://web.archive.org/save/

There is also their professional/paid service for larger-scale archiving: https://archive-it.org/blog/products-and-services/

There are also a few different read-only search options: https://archive.org/help/wayback_api.php

@chigby chigby moved this from Ready to In progress in Overall Kanban Jul 30, 2021
@nmorduch nmorduch moved this from In progress to Needs review in Overall Kanban Jul 30, 2021
@nmorduch nmorduch moved this from Needs review to In progress in Overall Kanban Jul 30, 2021
@nmorduch
Copy link
Member Author

The next question is what this would look like this on our end.

Estimating

  • [2h] Find all the links
  • [?] Set up archive service, check whether we would do it enough that we have to pay for it
  • [?] Save the archive link
  • [?] Allow overriding the archive link
  • [2h] Frontend display

Overall assessment is that this would take a fair amount of time and is not a high priority.

@nmorduch nmorduch moved this from In progress to Not Now / Blocked in Overall Kanban Aug 12, 2021
@nmorduch nmorduch removed this from the 馃弮 July 2021 milestone Aug 12, 2021
@nmorduch nmorduch removed the 3 low label Mar 31, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement tool/python python, django, wagtail
Projects
Overall Kanban
Not Now / Blocked
Development

No branches or pull requests

2 participants