target_gcs

Read in stdin and write out to Google Cloud Storage.

Example usage

Install

python3 -m venv ./venv
source ./venv/bin/activate

Then

pip install https://github.com/anelendata/target_gcs/tarball/master

Or

git clone git@github.com:anelendata/target_gcs.git
pip install -e target_gcs

Configure

Sample configuration file

Note: As in the sample, you can use the following parameters in the blob name:

etl_datetime (ISO 8601 format)
etl_tstamp (unix time stamp)

Set the path to Google Cloud API's application credential JSON file:

export GOOGLE_APPLICATION_CREDENTIALS=./path_to/your_cred_file.json

Test

Make sure your service account associated with the crendential file has sufficient GCS permissions. If the bucket specified in the config does not exist, target_gcs tries to create one. In this case, the account needs Storage Admin. Otherwise, Object Createor at minimum.

echo -e '{"line": 1, "value": "hello"}\n{"line": 2, "value": "world"}' | target_gcs -c ./your-config.json

Here is the example to get USGS earthquake events data:

curl "https://earthquake.usgs.gov/fdsnws/event/1/query?format=geojson&starttime=2020-06-24&endtime=2020-06-25" | target_gcs -c ./your-config.json

Extra: Creating a schemaless, externally partitioned BigQuery table from GCS files

git clone git@github.com:anelendata/target_gcs.git
cd target_gcs
pip install google-cloud-bigquery

python create_schemaless_table.py -p your-project-id -g gs://your-bucket/your-dataset -d your-dataset-name -t your-table-name

Note: dataset must exist.

About this project

This project is developed by ANELEN and friends. Please check out the ANELEN's open innovation philosophy and other projects

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
target_gcs		target_gcs
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
create_schemaless_table.py		create_schemaless_table.py
sample_config.json		sample_config.json
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

target_gcs

Example usage

Install

Configure

Test

Extra: Creating a schemaless, externally partitioned BigQuery table from GCS files

About this project

About

Releases

Packages

Languages

License

anelendata/target-gcs

Folders and files

Latest commit

History

Repository files navigation

target_gcs

Example usage

Install

Configure

Test

Extra: Creating a schemaless, externally partitioned BigQuery table from GCS files

About this project

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages