-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Site Archive Inconsistencies #8
Comments
But in general, yeah, the webpage mirroring feature is very barebones - the intended use case was to scrape just the front page and screenshots attached there, as that often includes instructions not included with games themselves. Getting image links correct, all the devlogs/comments, etc would require a lot more postprocessing. I'll try to find some time to fix up dates and at least partial site parsing in the coming weeks, but I've got a lot on my plate right now until end of February, so can't say when :/ |
Thanks for your careful consideration. Conceptually, I'm think I'm so keen on accurate mirroring because I can imagine a future where itch doesn't exist anymore, and this tool was used to back up a bunch of games and post them on archive.org. It would be a shame if anything was lost. |
Hey, I think a very good way to save webpages could be Monolith https://github.com/Y2Z/monolith |
I'm backing up games on Itch and I've noticed multiple inconsistencies with the archived pages generated by itch-dl.
--devlog
.I tried with and without using
--mirror-web
but there was not much of a difference. Screenshots are saved when specified but I did not note any additional benefit.The text was updated successfully, but these errors were encountered: