You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I come from Wallabag, which is using this project.
I have problem retrieving content from a webpage, because graby automatically adds an ?_escaped_fragment_= at the end of the URL, for crawling AJAX purpose.
That's a problem because the website in question gives a 404 error when detecting this escaped fragment. Probably to avoid being fetched by robots ?
Still, the content seems to be accessible without the fragment.
A solution would be to try to fetch again the URL without this escaped fragment if a 404 error is answered ?
Hello !
I come from Wallabag, which is using this project.
I have problem retrieving content from a webpage, because graby automatically adds an
?_escaped_fragment_=
at the end of the URL, for crawling AJAX purpose.That's a problem because the website in question gives a
404
error when detecting this escaped fragment. Probably to avoid being fetched by robots ?Still, the content seems to be accessible without the fragment.
A solution would be to try to fetch again the URL without this escaped fragment if a
404
error is answered ?Here is the website, you can test with or without the escaped fragment:
https://dzone.com/
https://dzone.com/?_escaped_fragment_=
Thank you !
Antonin
The text was updated successfully, but these errors were encountered: