-
Notifications
You must be signed in to change notification settings - Fork 977
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HTML parsing abnormal #469
Comments
I had the same problem. It is strange since I saw in youtube running similar code but with expected results but, it is not my experience.
text1 text2 text3 text4
text1
text2
text3
text4
----------------------------------------------------------------------------------------------------------------------
text2 text3 text4
text2
text3
text4
----------------------------------------------------------------------------------------------------------------------
text3 text4
text3
text4
----------------------------------------------------------------------------------------------------------------------
text4
text4
----------------------------------------------------------------------------------------------------------------------
div {'class': 'class1'} text1 div {'class': 'class2'} text2 div {'class': 'class3'} text3 div {'class': 'class4'} text4
text1
text1
--------------------------------------------------------------------------------
text2
text2
--------------------------------------------------------------------------------
text3
text3
--------------------------------------------------------------------------------
text4
text4
--------------------------------------------------------------------------------
|
previous results error pasting. The last results should be as follows:
|
I recently do a "conda update --all" and then find that the HTML parsing of requests-html begins to work abnormally. In particular, the objection gotten from html.find() still contains all content of the html, e.g. if a = html.find("something", first=True), then a.text still shows all text of the page.
I then create a clean environment with only requests-html and it works well. So I guess the cause might be some recent updated version of some other package in my main environment has conflict with HTML parsing in requests-html. But I have no idea how this would happen and what would be the potential problematic package.
Any suggestion will be appreciated.
The text was updated successfully, but these errors were encountered: