Skip to content
Google Search Scraper
Python
Branch: master
Clone or download
Latest commit 00ca7fb Aug 10, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
goop fix logic Aug 10, 2019
LICENSE Initial commit Aug 2, 2019
README.md fixed anchor links Aug 3, 2019
cli.py Create cli.py Aug 2, 2019
setup.cfg Add files via upload Aug 2, 2019
setup.py Add files via upload Aug 2, 2019

README.md


goop
goop

Google Search Scraper

Contents

Introduction

goop can perform google searches without being blocked by the CAPTCHA or hitting any rate limits.

How it works?

Facebook provides a debugger tool for its scraper. Interestingly, Google doesn't limit the requests made by this debugger (whitelisted?) and hence it can be used to scrap the google search results without being blocked by the CAPTCHA.
Since facebook is involved, a facebook session Cookie must be supplied to the library with each request.

Usage

Installation

pip install goop

Example

from goop import goop

page_1 = goop.search('red shoes', '<your facebook cookie>')
page_2 = goop.search('red_shoes', '<your facebook cookie>', page='1')
include_omitted_results = goop.search('red_shoes', '<your facebook cookie>', page='8', full=True)

The returned is a dict of following structure

{
    "0": {
        "url": "https://example.com",
        "text": "Example webpage",
        "summary": "This is an example webpage whose aim is to demonstrate the usage of ..."
    },
    "1": {
...

cli.py demonstrates the usage by performing google searches from the terminal with the following command

python cli.py <query> <number_of_pages>

goop-cli

Legal & Disclaimer

Scraping google search results is illegal. This library is merely a proof of concept of the bypass. The author isn't responsible for the actions of the end users.

You can’t perform that action at this time.