[MAINTENANCE] Remove some references to block style datasource #9868

tyler-hoffman · 2024-05-02T13:49:06Z

Removes all references to block style Datasource except for in new_datasource.py, abstract_data_context.py, and an __init__.py file. There are likely a near infinite number of tests that depend on the existence of the class, though.

Description of PR changes above includes a link to an existing GitHub issue
PR title is prefixed with one of: [BUGFIX], [FEATURE], [DOCS], [MAINTENANCE], [CONTRIB]
Code is linted - run invoke lint (uses ruff format + ruff check)
Appropriate tests and docs have been updated

For more information about contributing, see Contribute.

After you submit your PR, keep the page open and monitor the statuses of the various checks made by our continuous integration process at the bottom of the page. Please fix any issues that come up and reach out on Slack if you need help. Thanks for contributing!

…used it

netlify · 2024-05-02T13:49:23Z

✅ Deploy Preview for niobium-lead-7998 canceled.

Name	Link
🔨 Latest commit	`04c2d0b`
🔍 Latest deploy log	https://app.netlify.com/sites/niobium-lead-7998/deploys/6635458b5c8e810008717680

codecov · 2024-05-02T14:07:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 78.07%. Comparing base (bd52c5f) to head (04c2d0b).

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9868      +/-   ##
===========================================
- Coverage    78.25%   78.07%   -0.18%     
===========================================
  Files          484      484              
  Lines        42394    42390       -4     
===========================================
- Hits         33174    33096      -78     
- Misses        9220     9294      +74

Flag	Coverage Δ
3.10	`64.26% <ø> (-0.14%)`	⬇️
3.11	`64.26% <ø> (-0.14%)`	⬇️
3.11 athena or clickhouse or openpyxl or pyarrow or project or sqlite or aws_creds	`53.87% <ø> (-0.14%)`	⬇️
3.11 aws_deps	`44.77% <ø> (-0.18%)`	⬇️
3.11 big	`55.76% <ø> (+<0.01%)`	⬆️
3.11 databricks	`45.95% <ø> (-0.15%)`	⬇️
3.11 filesystem	`61.21% <ø> (-0.07%)`	⬇️
3.11 mssql	`48.77% <ø> (-0.21%)`	⬇️
3.11 mysql	`48.83% <ø> (-0.22%)`	⬇️
3.11 postgresql	`52.76% <ø> (-0.15%)`	⬇️
3.11 snowflake	`46.56% <ø> (-0.18%)`	⬇️
3.11 spark	`57.18% <ø> (-0.01%)`	⬇️
3.11 trino	`50.75% <ø> (-0.15%)`	⬇️
3.8	`64.28% <ø> (-0.13%)`	⬇️
3.8 athena or clickhouse or openpyxl or pyarrow or project or sqlite or aws_creds	`53.88% <ø> (-0.14%)`	⬇️
3.8 aws_deps	`44.78% <ø> (-0.18%)`	⬇️
3.8 big	`55.77% <ø> (+<0.01%)`	⬆️
3.8 databricks	`45.96% <ø> (-0.15%)`	⬇️
3.8 filesystem	`61.22% <ø> (-0.07%)`	⬇️
3.8 mssql	`48.75% <ø> (-0.21%)`	⬇️
3.8 mysql	`48.81% <ø> (-0.22%)`	⬇️
3.8 postgresql	`52.74% <ø> (-0.15%)`	⬇️
3.8 snowflake	`46.57% <ø> (-0.18%)`	⬇️
3.8 spark	`57.14% <ø> (-0.01%)`	⬇️
3.8 trino	`50.73% <ø> (-0.15%)`	⬇️
3.9	`64.28% <ø> (-0.14%)`	⬇️
cloud	`0.00% <ø> (ø)`
docs-basic	`49.10% <ø> (-0.01%)`	⬇️
docs-creds-needed	`50.23% <ø> (-0.01%)`	⬇️
docs-spark	`48.31% <ø> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

This reverts commit 323faa4.

tyler-hoffman · 2024-05-03T16:00:57Z

great_expectations/self_check/util.py

@@ -913,28 +913,6 @@ def build_sa_validator_with_data(  # noqa: C901, PLR0912, PLR0913, PLR0915
    if context is None:
        context = build_in_memory_runtime_context()

-    assert context is not None, 'Instance of any child of "AbstractDataContext" class is required.'


why was this stuff here? IDK, but things pass without it 🤷

This reverts commit 9d651d1.

…block style datasources

tyler-hoffman · 2024-05-03T18:47:27Z

tests/data_context/test_data_context_state_management.py

@@ -226,246 +218,6 @@ def test_update_project_config(
    assert context.progress_bars["globally"] is True


-@pytest.mark.unit
-def test_add_datasource_with_existing_datasource(


we're going to remove datasource mgmt from context, so no use in reworking these tests.

…block style datasources

joshua-stauffer · 2024-05-03T20:17:35Z

great_expectations/self_check/util.py

-            },
-        },
-    )
-    # Updating "execution_engine" to insure peculiarities, incorporated herein, propagate to "ExecutionEngine" itself.  # noqa: E501


joshua-stauffer · 2024-05-03T20:20:22Z

tests/data_context/test_data_context_datasources.py

-
-@pytest.mark.unit
-def test_get_datasource_cache_miss(in_memory_runtime_context) -> None:
-    """
-    What does this test and why?
-
-    For all non-Cloud contexts, we should leverage the underlying store in the case
-    of a cache miss.
-    """
-    context = in_memory_runtime_context
-
-    name = "my_fake_datasource_name"
-
-    # Initial GET will miss the cache, necessitating store retrieval
-    with mock.patch(
-        "great_expectations.datasource.datasource_dict.DatasourceDict.__getitem__"
-    ) as mock_get:
-        context.get_datasource(name)
-
-    assert mock_get.called
-
-    # Subsequent GET will retrieve from the cache
-    with mock.patch("great_expectations.data_context.store.DatasourceStore.get") as mock_get:
-        context.get_datasource(name)
-
-    assert not mock_get.called
-
-
-@pytest.mark.unit
-def test_BaseDataContext_add_datasource_updates_cache(
-    in_memory_runtime_context: EphemeralDataContext,
-    pandas_enabled_datasource_config: dict,
-) -> None:
-    """
-    What does this test and why?
-
-    For persistence-disabled contexts, we should only update the cache upon adding a
-    datasource.
-    """
-    context = in_memory_runtime_context
-
-    name = pandas_enabled_datasource_config["name"]
-
-    assert name not in context.datasources
-
-    context.add_datasource(**pandas_enabled_datasource_config)
-
-    assert name in context.datasources
-
-
-@pytest.mark.unit
-def test_BaseDataContext_update_datasource_updates_existing_data_source(
-    in_memory_runtime_context: EphemeralDataContext,
-    pandas_enabled_datasource_config: dict,
-) -> None:
-    """
-    What does this test and why?
-
-    Updating a Data Source should update a Data Source
-    """
-    context = in_memory_runtime_context
-
-    name = context.list_datasources()[0]["name"]
-    pandas_enabled_datasource_config["name"] = name
-    data_connectors = pandas_enabled_datasource_config["data_connectors"]
-    pandas_enabled_datasource_config.pop("class_name")
-    datasource = Datasource(**pandas_enabled_datasource_config)
-
-    assert name in context.datasources
-    cached_datasource = context.datasources[name]
-    assert cached_datasource.data_connectors.keys() != data_connectors.keys()  # type: ignore[union-attr]
-
-    context.update_datasource(datasource)
-
-    retrieved_datasource = context.get_datasource(datasource_name=name)
-    assert retrieved_datasource.data_connectors.keys() == data_connectors.keys()  # type: ignore[union-attr]
-
-
-@pytest.mark.unit
-def test_BaseDataContext_update_datasource_fails_when_datsource_does_not_exist(
-    in_memory_runtime_context: EphemeralDataContext,
-    pandas_enabled_datasource_config: dict,
-) -> None:
-    """
-    What does this test and why?
-
-    Updating a data source that does not exist should create a new data source.
-    """
-    context = in_memory_runtime_context
-
-    name = pandas_enabled_datasource_config["name"]
-    pandas_enabled_datasource_config.pop("class_name")
-    datasource = Datasource(**pandas_enabled_datasource_config)
-
-    assert name not in context.datasources
-
-    with pytest.raises(DatasourceNotFoundError):
-        context.update_datasource(datasource)
-
-


just confirming that these are being removed because we don't support these methods on the data context.

Yup, I think there were 2 files that tested all this crud on context, but since we'll remove these methods in https://greatexpectations.atlassian.net/browse/V1-321, I don't think there's sense in taking the time to port them over to FDS

joshua-stauffer · 2024-05-03T20:27:56Z

tests/validator/test_validator.py

-    except AssertionError as e:
-        result = e
+    result = Validator(
+        execution_engine=PandasExecutionEngine(),


interesting that this works. it's preferable to accessing a private attr on the datasource, but makes me wonder what our ideal pattern would be.

joshua-stauffer · 2024-05-03T20:30:00Z

tests/validator/test_validator.py

+    # TODO: Convert this to actually mock an exception being thrown
+    # graph = ValidationGraph(execution_engine=execution_engine)
+    # graph.build_metric_dependency_graph = mock_error  # type: ignore[method-assign]


is this going to be follow up work? is it scoped somewhere?

Thanks for keeping me honest! https://greatexpectations.atlassian.net/browse/V1-322

joshua-stauffer

looks good! 🦖

tyler-hoffman and others added 8 commits May 1, 2024 15:57

[MAINTENANCE] Remove test_yaml_config and all integration tests that …

33b2aee

…used it

Delete references to removed files

c3f1ef9

Remove more references to deleted files

e4998d9

Remove more references to deleted files

dffa8da

Merge branch 'develop' into m/v1-25/remove-test-yaml-config

3de3c4a

Remove more references to deleted files

2782bf6

Remove unneeded test files

b6bc253

Remove references to block style in a test file

e5499c1

Remove another test

78f14ca

tyler-hoffman and others added 3 commits May 2, 2024 10:21

Convert test from block style to fds

a259928

Remove unused subclass

660e2d7

Merge branch 'develop' into m/v1-25/remove-block-style-datasource

7d9517f

tyler-hoffman changed the title ~~M/v1 25/remove block style datasource~~ [MAINTENANCE] Remove block style datasource May 2, 2024

Remove more block style

e621322

tyler-hoffman changed the title ~~[MAINTENANCE] Remove block style datasource~~ [MAINTENANCE] Remove some references to block style datasource May 2, 2024

tyler-hoffman and others added 13 commits May 2, 2024 11:26

Back out some logging

ccf3b14

Merge branch 'develop' into m/v1-25/remove-block-style-datasource

65b01c2

Remove reference in another test

80d21f8

Convert util to use fds

279faf1

Account for sqlite

70f4dfd

Fix name

46a2ca1

Merge branch 'develop' into m/v1-25/remove-block-style-datasource

0545e61

Remove v3 test file

323faa4

Revert "Remove v3 test file"

10e2dae

This reverts commit 323faa4.

Catch exceptions in setup

84c5b2e

Skip adding asset

92ea953

Actually don't even make the datasource

bfed853

Remoev that v3 test file again

9d651d1

Remove commented out code

f57b26c

tyler-hoffman commented May 3, 2024

View reviewed changes

tyler-hoffman added 4 commits May 3, 2024 12:18

Revert "Remoev that v3 test file again"

4c730ec

This reverts commit 9d651d1.

Convert more from block style to fluent

c28d846

Remove unused fixture

352962a

Remove tests around datasource management on context since they used …

ed6137a

…block style datasources

tyler-hoffman commented May 3, 2024

View reviewed changes

tyler-hoffman and others added 5 commits May 3, 2024 14:50

Remove tests around datasource management on context since they used …

19394ff

…block style datasources

Remove fixture

d205f93

Merge branch 'develop' into m/v1-25/remove-block-style-datasource

a4624ac

Make mypy happy

c684af0

Merge branch 'develop' into m/v1-25/remove-block-style-datasource

04c2d0b

joshua-stauffer reviewed May 3, 2024

View reviewed changes

tyler-hoffman enabled auto-merge May 3, 2024 20:26

joshua-stauffer reviewed May 3, 2024

View reviewed changes

joshua-stauffer approved these changes May 3, 2024

View reviewed changes

tyler-hoffman added this pull request to the merge queue May 3, 2024

Merged via the queue into develop with commit 93ee4a2 May 3, 2024
70 checks passed

tyler-hoffman deleted the m/v1-25/remove-block-style-datasource branch May 3, 2024 20:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MAINTENANCE] Remove some references to block style datasource #9868

[MAINTENANCE] Remove some references to block style datasource #9868

tyler-hoffman commented May 2, 2024 •

edited

netlify bot commented May 2, 2024 •

edited

codecov bot commented May 2, 2024 •

edited

tyler-hoffman May 3, 2024

tyler-hoffman May 3, 2024

joshua-stauffer May 3, 2024

joshua-stauffer May 3, 2024

tyler-hoffman May 3, 2024

joshua-stauffer May 3, 2024

joshua-stauffer May 3, 2024

tyler-hoffman May 3, 2024

joshua-stauffer left a comment

[MAINTENANCE] Remove some references to block style datasource #9868

[MAINTENANCE] Remove some references to block style datasource #9868

Conversation

tyler-hoffman commented May 2, 2024 • edited

netlify bot commented May 2, 2024 • edited

✅ Deploy Preview for niobium-lead-7998 canceled.

codecov bot commented May 2, 2024 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joshua-stauffer left a comment

Choose a reason for hiding this comment

tyler-hoffman commented May 2, 2024 •

edited

netlify bot commented May 2, 2024 •

edited

codecov bot commented May 2, 2024 •

edited