Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wallabag can't retrieve contents of the article (edn.com) #5745

Open
2 tasks done
nataloko opened this issue Apr 23, 2022 · 2 comments
Open
2 tasks done

Wallabag can't retrieve contents of the article (edn.com) #5745

nataloko opened this issue Apr 23, 2022 · 2 comments

Comments

@nataloko
Copy link

Before submitting the issue, please read:
If wallabag can't parse / extract content for a given link, please first read the documentation about it:
http://doc.wallabag.org/en/user/errors_during_fetching.html#how-can-i-help-to-fix-that

We have a lot of requests about fetching config issue. It'll help us A LOT if you give a try to fix it on your own following the doc.
If you failed to fix it yourself, tick the following boxes:

  • I've tried myself without success
  • I've replaced HOST in the issue title with the host of the URL that can't be fetched (ie: nytimes.com, 20minutes.fr, bbc.com, etc.)

Content related:

Describe what's wrong:

Wallabag can't retrieve contents from edn.com at all, I've tried several articles and in all of them I get a "wallabag can't retrieve contents for this article" message.

I'm using the wallabag.it service so I don't have access to logs etc; I've tried the f43.me test and I get some curl timeout error but I don't know what to do with that information.

@HolgerAusB
Copy link

HolgerAusB commented Jul 6, 2023

I tried to solve this issue. But my config only works with FulltextRSS. :-(
fivefilters/ftr-site-config#1138

Wallabag/f43.me still throws a timing error:
Request throw exception (with no response): cURL error 28: Operation timed out after 10001 milliseconds with 0 bytes received

I think that could be a cloudflare thing or something similar?

@HolgerAusB
Copy link

from graby.log:

[2023-07-07 08:11:11] graby.INFO: Graby is ready to fetch [] []
[2023-07-07 08:11:11] graby.INFO: . looking for site config for edn.com in primary folder {"host":"edn.com"} []
[2023-07-07 08:11:11] graby.INFO: ... found site config edn.com.txt {"host":"edn.com.txt"} []
[2023-07-07 08:11:11] graby.INFO: Appending site config settings from global.txt [] []
[2023-07-07 08:11:11] graby.INFO: . looking for site config for global in primary folder {"host":"global"} []
[2023-07-07 08:11:11] graby.INFO: ... found site config global.txt {"host":"global.txt"} []
[2023-07-07 08:11:11] graby.INFO: Cached site config with key: edn.com {"key":"edn.com"} []
[2023-07-07 08:11:11] graby.INFO: . looking for site config for global in primary folder {"host":"global"} []
[2023-07-07 08:11:11] graby.INFO: ... found site config global.txt {"host":"global.txt"} []
[2023-07-07 08:11:11] graby.INFO: Appending site config settings from global.txt [] []
[2023-07-07 08:11:11] graby.INFO: Cached site config with key: global {"key":"global"} []
[2023-07-07 08:11:11] graby.INFO: Cached site config with key: edn.com.merged {"key":"edn.com.merged"} []
[2023-07-07 08:11:11] graby.INFO: Fetching url: https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/ {"url":"https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/"} []
[2023-07-07 08:11:11] graby.INFO: Trying using method "get" on url "https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/" {"method":"get","url":"https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/"} []
[2023-07-07 08:11:11] graby.INFO: Found user-agent "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:109.0) Gecko/20100101 Firefox/115.0" for url "https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/" from site config {"user-agent":"Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:109.0) Gecko/20100101 Firefox/115.0","url":"https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/"} []
[2023-07-07 08:11:11] graby.INFO: Use default referer "http://www.google.co.uk/url?sa=t&source=web&cd=1" for url "https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/" {"referer":"http://www.google.co.uk/url?sa=t&source=web&cd=1","url":"https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/"} []
[2023-07-07 08:11:21] graby.WARNING: Request throw exception (with no response): cURL error 28: Operation timed out after 10001 milliseconds with 0 bytes received {"error_message":"cURL error 28: Operation timed out after 10001 milliseconds with 0 bytes received"} []
[2023-07-07 08:11:21] graby.INFO: Data fetched: array{"effective_url":"https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/","body":"","headers":[],"status":500} {"data":{"effective_url":"https://www.edn.com/bootstrapping-your-op-amp-yields-wide-voltage-swings/","body":"","headers":[],"status":500}} []

setting a referer didn't help

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants