193 append test refactor #217

dimberman · 2022-03-15T19:23:08Z

Description
As a step towards "maturing" the astro DAG authoring project, we must rewrite our tests to ensure that every integration test runs against every database.

This step will simultaneously reduce the number of tests we need to maintain, make testing much simpler as we add new databases, and will make a future refactor much simpler as we can ensure proper coverage.

To do this, we will take advantage of two features in pytest, fixtures and parameterize.

For this ticket, we will update the append function.

Acceptance criteria
Have a single test file validating append across all databases.

Integration tests should be marked with pytest.marker.integration:

Each test should work across all databases
Validating both Table and TempTable as inputs
Use test_utils.run_dag
Use PyTest fixtures and parameterize to have a single main test that will validate transform across multiple databases

The tests should validate these scenarios:

appending two tables against a single column
appending two tables against multiple columns
appending against all fields by not specifying fields
appending with casting
append with some casted fields and some uncasted fields
test with two different databases (should fail)

… rewrite our tests to ensure that every integration test runs against every database. This step will simultaneously reduce the number of tests we need to maintain, make testing much simpler as we add new databases, and will make a future refactor much simpler as we can ensure proper coverage. To do this, we will take advantage of two features in pytest, fixtures and parameterize. For this ticket, we will update the `append` function. **Acceptance criteria** Have a single test file validating `append` across all databases. 1. Integration tests should be marked with `pytest.marker.integration`: * Each test should work across all databases * Validating both `Table` and `TempTable` as inputs * Use `test_utils.run_dag` * Use PyTest `fixtures` and `parameterize` to have a single main test that will validate transform across multiple databases The tests should validate these scenarios: 1. appending two tables against a single column 2. appending two tables against multiple columns 3. appending against all fields by not specifying fields 4. appending with casting 5. append with some casted fields and some uncasted fields 6. test with two different databases (should fail)

codecov · 2022-03-15T19:27:55Z

Codecov Report

Merging #217 (3d1395f) into main (9a565e8) will decrease coverage by 0.37%.
The diff coverage is 89.09%.

@@            Coverage Diff             @@
##             main     #217      +/-   ##
==========================================
- Coverage   88.33%   87.96%   -0.38%     
==========================================
  Files          61       60       -1     
  Lines        3583     3381     -202     
  Branches      317      317              
==========================================
- Hits         3165     2974     -191     
+ Misses        376      365      -11     
  Partials       42       42

Impacted Files	Coverage Δ
tests/operators/test_agnostic_append.py	`89.09% <89.09%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9a565e8...3d1395f. Read the comment docs.

utkarsharma2 · 2022-03-16T21:05:24Z

tests/operators/test_agnostic_append.py

+
+
+@pytest.fixture
+def append_params(request):


I think this should be expanded to testcases instead of fixtures?

I think this will make it much easier to add more testcases with less boilerplate. We can see if it becomes a problem but this ultimately makes it easier to create test grids

@dimberman I agree with @utkarsharma2 on this one. I don't think we should aim to have a single test per operator. We already have a few dimensions we are using parametrizations:

multiple databases, when relevant

multiple files, when relevant

multiple file locations, when relevant

multiple file types, when relevant

I strongly recommend we do not use parametrization for groups of parameters sent to our tasks/operators.
For many operators, I don't think we need to test all the possible configurations of parameters with all the databases.

utkarsharma2 · 2022-03-16T21:07:00Z

tests/operators/test_agnostic_append.py

+    app_param, validate_append = append_params
+
+    with sample_dag:
+        load_main = aql.load_file(


can we move loading files into tmp_table? is there any usecase with just tmp_table without loading it to db?

For now using load_file ensures that the actual function is being passed info from a previous task instead of a table object

utkarsharma2 · 2022-03-16T21:08:31Z

tests/operators/test_agnostic_append.py

+    ],
+    indirect=True,
+)
+def test_append_on_tables_on_different_db(sample_dag, sql_server):


Should this test case be renamed or we should add more DBs?

Why would it need to be renamed? It's testing a piece of code that's not DB dependent

utkarsharma2 · 2022-03-16T21:09:48Z

@dimberman +116 −586, nicely done!

tatiana · 2022-03-17T06:56:44Z

tests/operators/test_agnostic_append.py

+        }, validate_basic
+    if mode == "all_fields":
+        return {}, validate_append_all
+    if mode == "with_caste":


If this test runs against all the databases, we probably do not need to run "basic" and "cast_only" against all the other databases.

tatiana · 2022-03-17T06:59:00Z

tests/operators/test_agnostic_append.py

+    tmp_table_1 = TempTable(conn_id="postgres_conn")
+    tmp_table_2 = TempTable(conn_id="sqlite_conn")


@dimberman Are we tearing down these tables?

* As a step towards "maturing" the astro DAG authoring project, we must rewrite our tests to ensure that every integration test runs against every database. This step will simultaneously reduce the number of tests we need to maintain, make testing much simpler as we add new databases, and will make a future refactor much simpler as we can ensure proper coverage. To do this, we will take advantage of two features in pytest, fixtures and parameterize. For this ticket, we will update the `append` function. **Acceptance criteria** Have a single test file validating `append` across all databases. 1. Integration tests should be marked with `pytest.marker.integration`: * Each test should work across all databases * Validating both `Table` and `TempTable` as inputs * Use `test_utils.run_dag` * Use PyTest `fixtures` and `parameterize` to have a single main test that will validate transform across multiple databases The tests should validate these scenarios: 1. appending two tables against a single column 2. appending two tables against multiple columns 3. appending against all fields by not specifying fields 4. appending with casting 5. append with some casted fields and some uncasted fields 6. test with two different databases (should fail) * merged append tests * fix invalid test * fix different db test

dimberman added 2 commits March 15, 2022 11:50

merged append tests

b70ccaf

dimberman added 2 commits March 15, 2022 13:05

fix invalid test

0dc3d1b

fix different db test

3d1395f

utkarsharma2 reviewed Mar 16, 2022

View reviewed changes

dimberman merged commit 2b4bef2 into main Mar 16, 2022

dimberman deleted the 193_append-test-refactor branch March 16, 2022 22:06

tatiana reviewed Mar 17, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

193 append test refactor #217

193 append test refactor #217

dimberman commented Mar 15, 2022 •

edited

codecov bot commented Mar 15, 2022 •

edited

utkarsharma2 Mar 16, 2022

dimberman Mar 16, 2022

tatiana Mar 17, 2022 •

edited

utkarsharma2 Mar 16, 2022

dimberman Mar 16, 2022

utkarsharma2 Mar 16, 2022

dimberman Mar 16, 2022

utkarsharma2 commented Mar 16, 2022 •

edited

tatiana Mar 17, 2022

tatiana Mar 17, 2022

		tmp_table_1 = TempTable(conn_id="postgres_conn")
		tmp_table_2 = TempTable(conn_id="sqlite_conn")



		@pytest.fixture
		def append_params(request):

193 append test refactor #217

193 append test refactor #217

Conversation

dimberman commented Mar 15, 2022 • edited

codecov bot commented Mar 15, 2022 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tatiana Mar 17, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

utkarsharma2 commented Mar 16, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dimberman commented Mar 15, 2022 •

edited

codecov bot commented Mar 15, 2022 •

edited

tatiana Mar 17, 2022 •

edited

utkarsharma2 commented Mar 16, 2022 •

edited