Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S3 Asset e2e annotated with TODOs #9876

Draft
wants to merge 4 commits into
base: develop
Choose a base branch
from

Conversation

jcampbell
Copy link
Member

  • Description of PR changes above includes a link to an existing GitHub issue
  • PR title is prefixed with one of: [BUGFIX], [FEATURE], [DOCS], [MAINTENANCE], [CONTRIB]
  • Code is linted - run invoke lint (uses ruff format + ruff check)
  • Appropriate tests and docs have been updated

For more information about contributing, see Contribute.

After you submit your PR, keep the page open and monitor the statuses of the various checks made by our continuous integration process at the bottom of the page. Please fix any issues that come up and reach out on Slack if you need help. Thanks for contributing!

@jcampbell jcampbell requested a review from cdkini May 3, 2024 16:56
Copy link

netlify bot commented May 3, 2024

Deploy Preview for niobium-lead-7998 ready!

Name Link
🔨 Latest commit 7cae6eb
🔍 Latest deploy log https://app.netlify.com/sites/niobium-lead-7998/deploys/663905e009af190008c08427
😎 Deploy Preview https://deploy-preview-9876.docs.greatexpectations.io
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Copy link

codecov bot commented May 3, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 64.21%. Comparing base (9b525ca) to head (7cae6eb).
Report is 8 commits behind head on develop.

Additional details and impacted files
@@             Coverage Diff              @@
##           develop    #9876       +/-   ##
============================================
- Coverage    78.61%   64.21%   -14.41%     
============================================
  Files          484      493        +9     
  Lines        42394    42537      +143     
============================================
- Hits         33330    27316     -6014     
- Misses        9064    15221     +6157     
Flag Coverage Δ
3.10 64.21% <ø> (-0.26%) ⬇️
3.11 64.21% <ø> (-0.26%) ⬇️
3.11 athena or clickhouse or openpyxl or pyarrow or project or sqlite or aws_creds ?
3.11 aws_deps ?
3.11 big ?
3.11 databricks ?
3.11 filesystem ?
3.11 mssql ?
3.11 mysql ?
3.11 postgresql ?
3.11 snowflake ?
3.11 spark ?
3.11 trino ?
3.8 64.23% <ø> (-0.26%) ⬇️
3.8 athena or clickhouse or openpyxl or pyarrow or project or sqlite or aws_creds ?
3.8 aws_deps ?
3.8 big ?
3.8 databricks ?
3.8 filesystem ?
3.8 mssql ?
3.8 mysql ?
3.8 postgresql ?
3.8 snowflake ?
3.8 spark ?
3.8 trino ?
3.9 64.23% <ø> (-0.26%) ⬇️
cloud ?
docs-basic ?
docs-creds-needed ?
docs-spark ?

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.


try:
suite = context.suites.get("project_name")
# TODO: error will change to ResourceNotFoundError
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

# TODO: error will change to ResourceNotFoundError
except DataContextError:
# TODO: will change to:
# suite = context.suites.add(name="project_name")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"s3://nyc-tlc/trip data/yellow_tripdata_2019-01.parquet"
)

# TODO: column_index will not be required
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

completed in #9860

try:
data_source = context.datasources["project_name"]
# TODO: this will be updated to become
# data_source = context.data_sources.get("project_name")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

import great_expectations.expectations as gxe
from great_expectations import get_context

# TODO: will become from great_expectations import get_context, ExpectationSuite
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should be fixed as a followup to https://greatexpectations.atlassian.net/browse/V1-67

# TODO: will become except ResourceNotFoundError:
except DataContextError:
# TODO: will become
# validation_definition = context.validation_definitions.add(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

batch_parameters={"year": "2020", "month": "04"}
)

# TODO: This should only run on the latest batch, or it should fail entirely with an error that
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be running on the latest batch already, right? Is this just about how we load data from all the assets? https://greatexpectations.atlassian.net/browse/V1-292 should tackle that.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

context = get_context(project_root_dir="./")
# NOTE: It is critical to pass the batch_parameters to the run method, otherwise the validation stall
# by trying to read all the data. We will have a fix in place before the final release.
# TODO: Implement fix for above issue
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@@ -0,0 +1,31 @@
from great_expectations import get_context

# TODO: will become from great_expectations import Checkpoint
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should be handled in followup to https://greatexpectations.atlassian.net/browse/V1-67

validation_definitions=[context.validation_definitions.get("my_project")],
actions=[
SlackNotificationAction(
# TODO: config variable substitution not working
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants