Publish Redshift benchmark results #806

pankajkoti · 2022-09-08T16:16:25Z

Publish Redshift benchmark results for native and default approach.

The existing dataset failed with schema mismatch error while inserting rows.
The pandas auto-detection created schema with columns as varchar(256) limiting
the values to be 256 bytes long. However, some row contained a value larger than 256
bytes and then it complained with value too long error.
Hence, we have created new fake data set and kept in S3 with data
of required various sizes for benchmarking purposes and have also updated
the datasets.md to provide details of this new files.

blocked by: #805
closes: #748

python-sdk/tests/benchmark/results.md

codecov · 2022-09-08T16:36:57Z

Codecov Report

Base: 93.25% // Head: 93.30% // Increases project coverage by +0.04% 🎉

Coverage data is based on head (52ff732) compared to base (8b79c39).
Patch has no changes to coverable lines.

❗ Current head 52ff732 differs from pull request most recent head 9b6c74f. Consider uploading reports for the commit 9b6c74f to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #806      +/-   ##
==========================================
+ Coverage   93.25%   93.30%   +0.04%     
==========================================
  Files          47       44       -3     
  Lines        2046     1911     -135     
  Branches      256      237      -19     
==========================================
- Hits         1908     1783     -125     
+ Misses        107      100       -7     
+ Partials       31       28       -3

Impacted Files	Coverage Δ
python-sdk/src/astro/airflow/datasets.py	`83.33% <0.00%> (-11.12%)`	⬇️
python-sdk/src/astro/sql/operators/raw_sql.py	`86.36% <0.00%> (-3.30%)`	⬇️
python-sdk/src/astro/sql/operators/export_file.py	`93.93% <0.00%> (-0.80%)`	⬇️
python-sdk/src/astro/databases/snowflake.py	`95.95% <0.00%> (-0.66%)`	⬇️
python-sdk/src/astro/sql/operators/transform.py	`88.00% <0.00%> (-0.47%)`	⬇️
python-sdk/src/astro/sql/operators/dataframe.py	`93.05% <0.00%> (-0.37%)`	⬇️
...thon-sdk/src/astro/sql/operators/base_decorator.py	`94.05% <0.00%> (-0.29%)`	⬇️
python-sdk/src/astro/databases/base.py	`96.10% <0.00%> (-0.26%)`	⬇️
python-sdk/src/astro/databases/aws/redshift.py	`93.04% <0.00%> (-0.18%)`	⬇️
python-sdk/src/astro/sql/operators/load_file.py	`97.14% <0.00%> (-0.08%)`	⬇️
... and 26 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

dimberman · 2022-09-08T17:53:49Z

@pankajkoti why did we have to use a fake dataset for redshift? Can you please add this to the description?

pankajkoti · 2022-09-08T19:11:03Z

@pankajkoti why did we have to use a fake dataset for redshift? Can you please add this to the description?

@dimberman Updated the description. Please check

utkarsharma2 · 2022-09-08T22:05:43Z

@pankajkoti Let's not merge this PR until we resolve and merge - #805

Publish Redshift benchmark results

44bf6ba

pankajkoti requested review from dimberman, tatiana, utkarsharma2, sunank200, pankajastro and feluelle as code owners September 8, 2022 16:16

feluelle reviewed Sep 8, 2022

View reviewed changes

python-sdk/tests/benchmark/results.md Show resolved Hide resolved

sunank200 approved these changes Sep 8, 2022

View reviewed changes

Fix codespell check

52ff732

pankajkoti marked this pull request as draft September 8, 2022 22:36

utkarsharma2 marked this pull request as ready for review September 14, 2022 18:13

Merge branch 'main' into redshift-benchmark-results

9b6c74f

utkarsharma2 approved these changes Sep 14, 2022

View reviewed changes

utkarsharma2 merged commit f91a84a into main Sep 14, 2022

utkarsharma2 deleted the redshift-benchmark-results branch September 14, 2022 18:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Publish Redshift benchmark results #806

Publish Redshift benchmark results #806

pankajkoti commented Sep 8, 2022 •

edited

codecov bot commented Sep 8, 2022 •

edited

dimberman commented Sep 8, 2022 •

edited

pankajkoti commented Sep 8, 2022

utkarsharma2 commented Sep 8, 2022 •

edited

Publish Redshift benchmark results #806

Publish Redshift benchmark results #806

Conversation

pankajkoti commented Sep 8, 2022 • edited

codecov bot commented Sep 8, 2022 • edited

Codecov Report

dimberman commented Sep 8, 2022 • edited

pankajkoti commented Sep 8, 2022

utkarsharma2 commented Sep 8, 2022 • edited

pankajkoti commented Sep 8, 2022 •

edited

codecov bot commented Sep 8, 2022 •

edited

dimberman commented Sep 8, 2022 •

edited

utkarsharma2 commented Sep 8, 2022 •

edited