Feature Request: Add support for 'Allow Large Results' to BigQuery connector #15

jreback · 2017-02-26T20:10:23Z

gbq.py currently returns an error if the result of a query is what Google considers to be 'Large'. The google api allows jobs to be sent with a flag to allow large results. It would be very beneficial to provide this as an option in the BigQuery connector.

tswast · 2017-06-12T23:30:34Z

People have posted some pretty elaborate workarounds on StackOverflow. https://stackoverflow.com/questions/34201923/python-bigquery-allowlargeresults-with-pandas-io-gbq/34203369

jasonqng · 2017-10-06T16:16:11Z

Can now do this in open PR by passing it via configuration setting: #25

read_gbq(sql, configuration={"allow_large_results":True})

This uses new google-cloud-python api.

yahyamortassim · 2017-11-30T16:09:34Z

@jasonqng you also have to add destinationTable.

tswast · 2017-11-30T16:27:30Z

Actually, I think the current API does support this, even without #25. The read_gbq function accepts a configuration keyword argument which is a job configuration resource, so to allow large results one would do either

Standard SQL:

pd.read_gbq(
    query,
    'my-project',
    dialect='standard',
    configuration={
        'query': {
            'destinationTable': {
                'projectId': 'my-project',
                'datasetId': 'mydataset',
                'tableId': 'mytable'
             }
        }
    })

Legacy SQL:

pd.read_gbq(
    query,
    'my-project',
    dialect='standard',
    configuration={
        'query': {
            'allowLargeResults': True,
            'destinationTable': {
                'projectId': 'my-project',
                'datasetId': 'mydataset',
                'tableId': 'mytable'
             }
        }
    })

Admittedly this is a bit onerous to do. We may wish to provide a friendlier interface for options such as these.

Gitman-code · 2017-11-30T18:05:00Z

The updated answer on stack overflow suggests just using dialect='standard' like tswast did but more simply as
pd.read_gbq(query, 'my-super-project', dialect='standard')

and notes AllowLargeResults: For standard SQL queries, this flag is ignored and large results are always allowed. This worked for me but maybe it is not generic.

tswast · 2017-11-30T18:35:08Z

I'm glad that worked for you. I believe there may be some size threshold where a destination table is required, even with standard SQL, but perhaps the threshold is larger than it was for legacy SQL.

tswast · 2018-02-12T04:35:34Z

Closing, as this can be passed in via the configuration argument to read_gbq.

jreback mentioned this issue Feb 26, 2017

Feature Request: Add support for 'Allow Large Results' to BigQuery connector pandas-dev/pandas#10474

Closed

jreback added the type: feature request ‘Nice-to-have’ improvement, new feature or different behavior or design. label Feb 26, 2017

parthea self-assigned this Mar 6, 2017

jreback modified the milestone: 0.3.0 Mar 11, 2017

jreback removed this from the 0.3.0 milestone Nov 25, 2017

tswast closed this as completed Feb 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add support for 'Allow Large Results' to BigQuery connector #15

Feature Request: Add support for 'Allow Large Results' to BigQuery connector #15

jreback commented Feb 26, 2017

tswast commented Jun 12, 2017

jasonqng commented Oct 6, 2017

yahyamortassim commented Nov 30, 2017

tswast commented Nov 30, 2017

Gitman-code commented Nov 30, 2017

tswast commented Nov 30, 2017

tswast commented Feb 12, 2018

Feature Request: Add support for 'Allow Large Results' to BigQuery connector #15

Feature Request: Add support for 'Allow Large Results' to BigQuery connector #15

Comments

jreback commented Feb 26, 2017

tswast commented Jun 12, 2017

jasonqng commented Oct 6, 2017

yahyamortassim commented Nov 30, 2017

tswast commented Nov 30, 2017

Gitman-code commented Nov 30, 2017

tswast commented Nov 30, 2017

tswast commented Feb 12, 2018