Feature Request: Allowing the import of already archived URLs. #457
Labels
status: idea-phase
Work is tentatively approved and is being planned / laid out, but is not ready to be implemented yet
why: functionality
Intended to improve ArchiveBox functionality or features
Type
What is the problem that your feature request solves
Hi, thanks a lot for ArchiveBox! In the documentation, it's stated that "[..] ArchiveBox will never re-download sites that have already succeeded previously."
But what if I wanted to periodically export a website whose content's change are interesting, for example the frontpage of a newspaper website?
Having a way to export the same website several times could provide several snapshots over months or years that could be very interesting.
Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes
I think an optional argument to the
archivebox add
command would fit ideally. It could be named--force-add
or--add-if-present
or any similar name (I'm bad at naming things). A general configuration settings could also be considered, but I think the optional argument is more fitting.I don't know the underlying changes required specially to the database models, so I can't really estimate the difficulty of my request.
Thanks in advance!
How badly do you want this new feature?
The text was updated successfully, but these errors were encountered: