-
-
Notifications
You must be signed in to change notification settings - Fork 756
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to retrieve readable content http://habrahabr.ru/ #1541
Comments
Could you give us real link instead of Yep I don't speak russian |
I did a few tests and concluded that the part of the site can provide such an error because they think client is a bot and give ban. Maybe for advanced users make the settings so they could choose to use a browser? |
Thank you, everything works fine! |
How to add (how posible?) support of content in charset windows-1251 (or different) ? |
Find problem with url like http://habrahabr.ru/post/$someNumber/ and http://habrahabr.ru/company/$someCompany/blog/$someNumber/, but if use rss link part of content was grab, example link - http://habrahabr.ru/rss/post/$someNumber/.
At first time, i think this was problem with ru locale, but i was wrong, i think problem in -> 'default_parser' => 'libxml'.
The text was updated successfully, but these errors were encountered: