You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
so far, so good. however, after crawling/scraping these with wayback, only the src url is scraped and rewritten, leading to the image on wayback'ed page still being served from the original server:
this is very obvious because the original site doesn't use https, so it leads to a broken image on the wayback machine view:
Obviously, the correct behavior here is that all of the images should be scraped (in this case they're just resizings, but in theory they could be completely different images—nothing prevents that) and rewritten.
Thanks! let me know if you need more information, or want me to whip up a more minimal test case
The text was updated successfully, but these errors were encountered:
Let me know if this isn't the right repo, but ran into an issue when testing archival features on http://www.goodbyetohalos.com/
Like many webcomics using wordpress nowadays, Goodbye to Halos uses html5 srcset attribute to displays different image sizes to different devices:
so far, so good. however, after crawling/scraping these with wayback, only the src url is scraped and rewritten, leading to the image on wayback'ed page still being served from the original server:
this is very obvious because the original site doesn't use https, so it leads to a broken image on the wayback machine view:
Obviously, the correct behavior here is that all of the images should be scraped (in this case they're just resizings, but in theory they could be completely different images—nothing prevents that) and rewritten.
Thanks! let me know if you need more information, or want me to whip up a more minimal test case
The text was updated successfully, but these errors were encountered: