TEST-#6996: Update tests in `test_io.py` #6997

anmyachev · 2024-03-04T12:50:42Z

What do these changes do?

first commit message and PR title follow format outlined here

NOTE: If you edit the PR title to match this format, you need to add another commit (even if it's empty) or amend your last commit for the CI job that checks the PR title to pick up the new PR title.
passes flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
passes black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
signed commit with git commit -s
Resolves Update tests in test_io.py #6996
tests added and passing
module layout described at docs/development/architecture.rst is up-to-date

anmyachev · 2024-03-04T12:52:13Z

modin/pandas/test/test_io.py

-    @pytest.mark.parametrize("sep", [None, "_", ",", ".", "\n"])
-    @pytest.mark.parametrize("delimiter", ["_", ",", ".", "\n"])


These parameters mean the same thing. When using them at the same time, an exception occurs - I put it in a separate test.

anmyachev · 2024-03-04T12:54:42Z

modin/pandas/test/test_io.py

@@ -303,7 +311,7 @@ def comparator(df1, df2):
    @pytest.mark.parametrize("header", ["infer", None, 0])
    @pytest.mark.parametrize("index_col", [None, "col1"])
    @pytest.mark.parametrize(
-        "names", [lib.no_default, ["col1"], ["c1", "c2", "c3", "c4", "c5", "c6", "c7"]]


The last column is missing, this is a test to reproduce the same error as in pandas. It is redundant to test this situation with all combinations of parameters in this test.

anmyachev · 2024-03-04T12:56:01Z

modin/pandas/test/test_io.py

    # Internal parameters tests
-    @pytest.mark.parametrize("use_str_data", [True, False])
-    @pytest.mark.parametrize("engine", [None, "python", "c"])


Many parameters are not implemented for the python engine (and exceptions are thrown), moved to separate tests above.

anmyachev · 2024-03-04T12:57:11Z

modin/pandas/test/test_io.py

@@ -2756,20 +2767,19 @@ def test_fwf_file_usecols(self, make_fwf_file, usecols):
        "dtype_backend", [lib.no_default, "numpy_nullable", "pyarrow"]
    )
    def test_read_fwf_dtype_backend(self, make_fwf_file, dtype_backend):
-        with ensure_clean(".fwf") as unique_filename:
-            make_fwf_file(filename=unique_filename)


make_fwf_file fixture doesn't actually take a filename as a parameter.

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

YarShev · 2024-03-04T21:46:48Z

modin/pandas/test/test_io.py

-    def test_read_csv_delimiters(
-        self, make_csv_file, sep, delimiter, decimal, thousands
-    ):
+    def test_read_csv_delimiters(self, make_csv_file, sep, decimal, thousands):


Suggested change

def test_read_csv_delimiters(self, make_csv_file, sep, decimal, thousands):

def test_read_csv_seps(self, make_csv_file, sep, decimal, thousands):

Should we rename this test for consistency with the parameter? or just rename the parameter name sep to delimeter.

YarShev · 2024-03-04T21:49:57Z

modin/pandas/test/test_io.py

@@ -794,75 +802,64 @@ def test_read_csv_error_handling(self, on_bad_lines):
            on_bad_lines=on_bad_lines,
        )

+    @pytest.mark.parametrize("float_precision", [None, "high", "legacy", "round_trip"])
+    def test_python_engine_float_precision_except(self, float_precision):


Can you elaborate on why python engine only?

We already tested python engine in test_read_csv_internal, but I moved it to a separate test. Reason: #6997 (comment)

YarShev · 2024-03-04T21:50:07Z

modin/pandas/test/test_io.py

+        )
+
+    @pytest.mark.parametrize("low_memory", [False, True])
+    def test_python_engine_low_memory_except(self, low_memory):


We already tested python engine in test_read_csv_internal, but I moved it to a separate test. Reason: #6997 (comment)

YarShev · 2024-03-04T21:52:26Z

modin/pandas/test/test_io.py

@@ -3095,6 +3097,12 @@ def test_read_xml(self):
   <degrees>360</degrees>
   <sides/>
 </row>
+ <row>


Why is this added?

The example is outdated, I took the updated one from https://pandas.pydata.org/docs/reference/api/pandas.read_xml.html#pandas-read-xml.

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

anmyachev commented Mar 4, 2024

View reviewed changes

TEST-modin-project#6996: Update tests in 'test_io.py'

0467ee9

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

anmyachev force-pushed the issue6996 branch from 3a750d6 to 0467ee9 Compare March 4, 2024 13:01

fixes

8364995

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

anmyachev marked this pull request as ready for review March 4, 2024 14:52

anmyachev requested review from devin-petersohn, mvashishtha, RehanSD, YarShev, vnlitvinov, dchigarev and a team as code owners March 4, 2024 14:52

YarShev reviewed Mar 4, 2024

View reviewed changes

address review comments

804b798

Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>

YarShev approved these changes Mar 5, 2024

View reviewed changes

YarShev merged commit 0b6c8e4 into modin-project:master Mar 5, 2024
37 checks passed

anmyachev deleted the issue6996 branch March 5, 2024 08:59

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TEST-#6996: Update tests in `test_io.py` #6997

TEST-#6996: Update tests in `test_io.py` #6997

anmyachev commented Mar 4, 2024 •

edited

Loading

anmyachev Mar 4, 2024

anmyachev Mar 4, 2024

anmyachev Mar 4, 2024

anmyachev Mar 4, 2024

YarShev Mar 4, 2024

anmyachev Mar 4, 2024

YarShev Mar 4, 2024

anmyachev Mar 4, 2024

YarShev Mar 4, 2024

anmyachev Mar 4, 2024

YarShev Mar 4, 2024

anmyachev Mar 4, 2024

		@pytest.mark.parametrize("sep", [None, "_", ",", ".", "\n"])
		@pytest.mark.parametrize("delimiter", ["_", ",", ".", "\n"])

	def test_read_csv_delimiters(self, make_csv_file, sep, decimal, thousands):
	def test_read_csv_seps(self, make_csv_file, sep, decimal, thousands):

TEST-#6996: Update tests in test_io.py #6997

TEST-#6996: Update tests in test_io.py #6997

Conversation

anmyachev commented Mar 4, 2024 • edited Loading

What do these changes do?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TEST-#6996: Update tests in `test_io.py` #6997

TEST-#6996: Update tests in `test_io.py` #6997

anmyachev commented Mar 4, 2024 •

edited

Loading