-
-
Notifications
You must be signed in to change notification settings - Fork 686
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scaper can not recognise anything on Allerhande.nl (ah.nl) anymore while it did in the past #2888
Comments
The scraper debugger returns: recipe_scrapers was unable to scrape this URL |
So it seems (as already suggested in the Discord by other members of the team) that the scraper is being blocked by the website. All i can get out of the scraper is the domain and via html mode the following "Access Denied" Message.
I'll be closing this, as there is not much mealie can do against that. |
I ran into the same issue. Doing some additional research gave me the following insights: A regular request with insomnia gave the same results as reported here. Thus this issue may easily be resolved if the request is altered in such a way that this information is send with the scrape request. tls spoofing can be directly build into mealie perhaps to be used optionally or by allowing the use of a proxy additionally the "Accept" header should probably be set next to the user-agent https://github.com/mealie-recipes/mealie/blob/mealie-next/mealie/services/scraper/scraper_strategies.py |
First Check
I used the GitHub search to find a similar issue and didn't find it.
I have verified that this issue is not related to the underlying library
hhyrsev/recipe-scrapers by 1) checking
the debugger and data is returned, 2)
verifying that there are errors in the log related to application level code, or
3) verified that the site provides recipe data, or is otherwise supported by
hhyrsev/recipe-scrapers
This issue can be replicated on the demo site (https://demo.mealie.io/)
Please provide 1-5 example URLs that are having errors
https://www.ah.nl/allerhande/recept/R-R1199309/courgettelasagne-met-3-kazen-en-gehakt
https://www.ah.nl/allerhande/recept/R-R1199239/vegan-groenterollade-met-saliestuffing-van-sanne-vogel
Please provide your logs for the Mealie container
docker logs <container-id> > mealie.logs
mealie-log.zip
Deployment
Docker (Synology)
The text was updated successfully, but these errors were encountered: