Google results not working everytime #1596

unixfox · 2019-05-22T13:46:16Z

I just experienced a recent bug where Google sometimes is trying to load his new UI even if JavaScript is not activated. This make Searx not finding any results because it can't parse the new UI:

Here is the saved HTML page generated by Google when it tries reply with his new UI:

And the old UI that Searx can parse without any issue:

To resolve this bug I had the idea to force the user agent to Internet Explorer 12 by adding:

params['headers']['user-agent'] = "Mozilla / 5.0(MSIE 12.0; Trident / 7.0; rv: 11.0) like Gecko"

I tried that trick and it worked everytime because Google know by default that it can't load his new UI on IE.

rachmadaniHaryono · 2019-05-22T21:19:38Z

is there a way to check if they use a new one? can you share the html file?

unixfox · 2019-05-22T21:42:01Z

I had to write the HTML code into a file with python like this:

file = open('google.html', 'w')
file.write(resp.text)
file.close()

To check if Searx received the new one or the older one.

Here is the .zip of the new UI and old UI HTML files. Please disable Javascript on your browser before because Google automatically redirect to a page.
GoogleUI.zip

rachmadaniHaryono · 2019-05-22T23:50:33Z

 <div class="ZINbbc xpd O9g5cc uUPGi">
  <div>
   <div class="jfp3ef">
    <a href="/url?q=https://www.eonline.com/&amp;sa=U&amp;ved=...&amp;usg=...">
     <div class="BNeawe vvjwJb AP7Wnd"> E! News </div>
     <div class="BNeawe UPmit AP7Wnd"> https://www.eonline.com </div>
    </a>
   </div>
   <!-- ... -->
  </div>
 </div>

it seem the content is actually still exist on new google ui, but the class name is obfuscated

i will try if it is possible to parse starting from the url /url?q=https://www.eonline.com/&sa=U&ved=...&usg=... instead of class name

immanuelfodor · 2019-05-25T07:07:36Z

Although I tried to rebuild the docker image from latest master, I can't get any results from Google, it just doesn't work, the blue warning is displayed. Had to turn on alternative engines to get back search results.

unixfox · 2019-05-25T09:39:54Z

My PR isn't merged yet you will need to apply my PR as a patch.

immanuelfodor · 2019-05-25T10:55:23Z

Hmm, okay, I'll give #1597 a try :)

immanuelfodor · 2019-05-25T11:23:39Z

It didn't work for me :( #1597 (review)

immanuelfodor · 2019-05-25T12:49:12Z

Update: @unixfox has just pushed a change, and #1597 is now working as expected! :)

unixfox · 2019-05-29T19:23:15Z

@immanuelfodor @rachmadaniHaryono
What do you think guys should I close this issue now that #1597 is merged?

immanuelfodor · 2019-05-30T02:50:31Z

It seems it was a quick fix for a bigger problem, so yes, but maybe a new issue should be created to handle the scrambled classes. I think we should be able to parse the results based on the html structure and the links, it's not the end of the world, but the google.py needs a lot of changes to make it work.

unixfox · 2019-05-30T09:19:16Z

I'm closing this issue, I opened a new one for that specific problem with the new UI: #1609

unixfox mentioned this issue May 22, 2019

[fix] Force Google old UI #1597

Merged

rachmadaniHaryono mentioned this issue May 25, 2019

Feature/parse obfuscated google #1603

Closed

unixfox mentioned this issue May 27, 2019

Search's are failing often with no results #1607

Closed

unixfox mentioned this issue May 30, 2019

Handle parsing of the new Google UI #1609

Closed

unixfox closed this as completed May 30, 2019

rachmadaniHaryono mentioned this issue Jun 26, 2019

[fix] Update xpaths for new Google results page #1628

Merged

unixfox mentioned this issue Oct 9, 2019

No results with google search #1704

Closed

unixfox mentioned this issue Nov 22, 2019

Google never returns results: Sorry! we didn't find any results. Please use another query or search in more categories. #1748

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Google results not working everytime #1596

Google results not working everytime #1596

unixfox commented May 22, 2019 •

edited

rachmadaniHaryono commented May 22, 2019

unixfox commented May 22, 2019

rachmadaniHaryono commented May 22, 2019

immanuelfodor commented May 25, 2019

unixfox commented May 25, 2019

immanuelfodor commented May 25, 2019

immanuelfodor commented May 25, 2019

immanuelfodor commented May 25, 2019 •

edited

unixfox commented May 29, 2019

immanuelfodor commented May 30, 2019

unixfox commented May 30, 2019

Google results not working everytime #1596

Google results not working everytime #1596

Comments

unixfox commented May 22, 2019 • edited

rachmadaniHaryono commented May 22, 2019

unixfox commented May 22, 2019

rachmadaniHaryono commented May 22, 2019

immanuelfodor commented May 25, 2019

unixfox commented May 25, 2019

immanuelfodor commented May 25, 2019

immanuelfodor commented May 25, 2019

immanuelfodor commented May 25, 2019 • edited

unixfox commented May 29, 2019

immanuelfodor commented May 30, 2019

unixfox commented May 30, 2019

unixfox commented May 22, 2019 •

edited

immanuelfodor commented May 25, 2019 •

edited