sf-query-benchmarks

Adding some code to demonstrate the benchmarking for snowflake queries

This repo accompanies my medium article.

I have generated some test data which can be loaded in snowflake and then these benchmarking queries can be run against it.

Obtain the data:

Download data from google drive
1. This data is a gzip parquet file.
2. You can run the script file_to_tbl.py to load data into your snowflake account.
3. Please setup your snowflake creds in the file snowflake_connection.py
You can generate your own data as well. I have provided the script generate_data.py which can be used to generate data. You can modify the script to generate data of your choice.
1. Install the requirements pip install -r requirements.txt in repo root. Use python@3.10.
2. Go to folder data_generation
3. Create an empty folder pq
4. Run python generate_data.py
5. This will generate a parquet file merge.parquet in the same folder. Move it out one level up
6. Run tge script file_to_tbl.py to load data into your snowflake account.

Run the benchmarking queries:

Go to folder benchmarking
Run the script pivot_query.py to obtain the benchmarking results for pivot query.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
benchmarking		benchmarking
data_generation		data_generation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
constants.py		constants.py
file_to_tbl.py		file_to_tbl.py
requirements.txt		requirements.txt
snowflake_connection.py		snowflake_connection.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarking

benchmarking

data_generation

data_generation

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

constants.py

constants.py

file_to_tbl.py

file_to_tbl.py

requirements.txt

requirements.txt

snowflake_connection.py

snowflake_connection.py

Repository files navigation

sf-query-benchmarks

Obtain the data:

Run the benchmarking queries:

About

Languages

License

prabodh1194/sf-query-benchmarks

Folders and files

Latest commit

History

Repository files navigation

sf-query-benchmarks

Obtain the data:

Run the benchmarking queries:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages