bigquery-python

Simple Python client for interacting with Google BigQuery

BigQuery client libraries

Install Dependencies

Install pip and virtualenv if you do not already have them.

Instalation

pip install --upgrade google-cloud-bigquery

Setting up authentication.

Go to the Create service account key page in the GCP Console using below link.
https://console.cloud.google.com/apis/credentials/serviceaccountkey
From the Service account drop-down list, select New service account.
Enter a name into the Service account name field.
From the Role drop-down list, select Project > Owner.
Click Create. A JSON file that contains your key downloads to your computer.
Set the environment variable GOOGLE_APPLICATION_CREDENTIALS to the file path of the JSON file that contains your service account key like below path.
export GOOGLE_APPLICATION_CREDENTIALS="[PATH]"
Replace [PATH] with the file path of the JSON file that contains your service account key.

Basic Usage

# Imports the Google Cloud client library
from google.cloud import bigquery
import uuid

# Instantiates a client
bigquery_client = bigquery.Client()

# The name for the new dataset
dataset_id = 'my_new_dataset'

# Submit an async query
query_execute = bigquery_client.run_async_query(str(uuid.uuid4()), 'select * from abc')

# Use standard SQL syntax.
query_execute.use_legacy_sql = False

# Wait for query to finish.
query_execute.begin()
query_execute.result()

# fetch rows. 
rows = query_execute.query_results().fetch_data()

# print rows.
for row in rows:
    print(row)

Table operation in bigquery

In bigquery we can manage dataset tables, including creating, deleting, checking the existence, and getting the metadata of tables.

# Listing tables
dataset_ref = bigquery_client.dataset('dataset_name')
tables = list(bigquery_client.list_tables(dataset_ref))
for table in tables:
    print(table)

# Fetch table refrence
dataset = bigquery_client.dataset('dataset_name')
table = dataset.table('table_name')

# Check if a table exists
print(table.exists()) #return True if table found

# Delete an existing table
print(table.delete()) #return True if table deleted

# Create an empty table without a schema definition
print(table.create())

# Create an empty table with a schema definition
schema = [
    bigquery.SchemaField('full_name', 'STRING', mode='REQUIRED'),
    bigquery.SchemaField('age', 'INTEGER', mode='REQUIRED'),
]
table_schema_ref = dataset.table(table_name, schema)
print(table_schema_ref.create())

Datasets operation in bigquery

# Fetch dataset refrence
dataset_ref = bigquery_client.dataset(dataset_id)
dataset = bigquery.Dataset(dataset_ref)

# Creating a dataset
dataset.location = 'US' # Specify the geographic location where the dataset should reside.
dataset = bigquery_client.create_dataset(dataset)

# Listing datasets
datasets = list(bigquery_client.list_datasets())
for dataset in datasets:
  print('\t{}'.format(dataset.dataset_id))
  
# Delete a dataset that does not contain any tables
bigquery_client.delete_dataset(dataset_ref)

# Use the delete_contents parameter to delete a dataset and its tables
client.delete_dataset(dataset_ref, delete_contents=True)

Import CSV data from Google cloud storage

import uuid

# initialize job name
job_name = str(uuid.uuid4())

# create job
job = bigquery_client.load_table_from_storage(job_name, table, 'gs://'+bucketName+'/'+csvfilepath)

# wait for job complete
job.begin()
wait_for_job(job)

def wait_for_job(job):
    while True:
        job.reload()
        if job.state == 'DONE':
            if job.error_result:
                raise RuntimeError(job.errors)
            return

Export CSV data to Google cloud storage

destination_uri = 'gs://{}/{}'.format(bucket_name, 'shakespeare.csv')
dataset_ref = bigquery_client.dataset(dataset_id, project=project)
table_ref = dataset_ref.table(table_id)

extract_job = bigquery_client.extract_table(table_ref,destination_uri,
    # Location must match that of the source table.
    location='US')  # API request
extract_job.result()  # Waits for job to complete.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bigquery-python

BigQuery client libraries

Install Dependencies

Instalation

Setting up authentication.

Basic Usage

Table operation in bigquery

Datasets operation in bigquery

Import CSV data from Google cloud storage

Export CSV data to Google cloud storage

About

Releases

Packages

ravigaur76/bigquery-python

Folders and files

Latest commit

History

Repository files navigation

bigquery-python

BigQuery client libraries

Install Dependencies

Instalation

Setting up authentication.

Basic Usage

Table operation in bigquery

Datasets operation in bigquery

Import CSV data from Google cloud storage

Export CSV data to Google cloud storage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages