Fix type annotations for `fill_nan()` #4445

lorenzwalthert · 2022-08-16T16:10:53Z

I added the types as suggested in #3066 (comment) for series, data frame and expression. tested it but for some reason, when comparing series and expressions with their reference, I could not get them to be equal. I had to convert to a tuple to pass the equality check. E.g. check this:

import polars as pl
lazy = pl.DataFrame({"a": [1.0, np.nan, 3.0]}).lazy().fill_nan(None).collect()["a"]
assert lazy.series_equal(pl.Series("a", [1.0, None, 3.0]))

They have the same printed output but they are not identical.
For eager, it works:

lazy = pl.DataFrame({"a": [1.0, np.nan, 3.0]}).fill_nan(None)
assert lazy.frame_equal(pl.DataFrame({"a": [1.0, None, 3.0]}))

Not sure the comparison method needs to be adapted too, but I considered that out of scope for this PR.

matteosantama · 2022-08-16T17:47:01Z

py-polars/tests/test_lazy.py

@@ -589,6 +589,7 @@ def test_fill_nan() -> None:
        .collect()["a"]
        .series_equal(pl.Series("a", [1.0, 2.0, 3.0]))
    )
+    assert tuple(df.lazy().fill_nan(None).collect()["a"]) == (1.0, None, 3.0)


If you don't convert to tuple what is the mismatch? Is it a datatype problem?

The print methods are identical, but comparision with frame_assert_equal() yields not identical. So I don't really understand. You can also c/p the example I posted in my initial PR description to see the problem and maybe you'll find a hint...

Ah, I see. This is a problem with Series.series_equal.

In [25]: s = pl.Series("a", [1.0, None, 3.0]) In [26]: s.series_equal(s) Out[26]: False

I will open a separate issue.

@lorenzwalthert Series.series_equal has a null_equal parameter that defaults to False. Can you change the assert statement to

assert ( df.lazy() .fill_nan(None) .collect()["a"] .series_equal(pl.Series("a", [1.0, None, 3.0]), null_equal=True) )

Ups of course. Did not know that, thanks.

So if null_equal defaults to True for data frames in frame_equal(), shouldn't it also have the same default value for polars.series?

Yes, it should. That's more consistent.

…rame and series and expression

ritchie46 · 2022-08-18T06:56:54Z

Thanks @lorenzwalthert 👍

github-actions bot added the python Related to Python Polars label Aug 16, 2022

lorenzwalthert force-pushed the type-annotation-fill-nan branch from b63f76a to cd4365c Compare August 16, 2022 16:12

lorenzwalthert changed the title ~~Fix type annotation for fill_nan()~~ Fix type annotations for fill_nan() Aug 16, 2022

matteosantama reviewed Aug 16, 2022

View reviewed changes

doc[python]: add allowed type None to method .fill_nan() for data f…

21720e2

…rame and series and expression

lorenzwalthert force-pushed the type-annotation-fill-nan branch from cd4365c to 21720e2 Compare August 17, 2022 12:40

ritchie46 merged commit fca8433 into pola-rs:master Aug 18, 2022

lorenzwalthert deleted the type-annotation-fill-nan branch August 18, 2022 07:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix type annotations for `fill_nan()` #4445

Fix type annotations for `fill_nan()` #4445

lorenzwalthert commented Aug 16, 2022

matteosantama Aug 16, 2022

lorenzwalthert Aug 16, 2022 •

edited

matteosantama Aug 16, 2022

matteosantama Aug 16, 2022

lorenzwalthert Aug 17, 2022

lorenzwalthert Aug 17, 2022

ritchie46 Aug 18, 2022

ritchie46 commented Aug 18, 2022

Fix type annotations for fill_nan() #4445

Fix type annotations for fill_nan() #4445

Conversation

lorenzwalthert commented Aug 16, 2022

matteosantama Aug 16, 2022

Choose a reason for hiding this comment

lorenzwalthert Aug 16, 2022 • edited

Choose a reason for hiding this comment

matteosantama Aug 16, 2022

Choose a reason for hiding this comment

matteosantama Aug 16, 2022

Choose a reason for hiding this comment

lorenzwalthert Aug 17, 2022

Choose a reason for hiding this comment

lorenzwalthert Aug 17, 2022

Choose a reason for hiding this comment

ritchie46 Aug 18, 2022

Choose a reason for hiding this comment

ritchie46 commented Aug 18, 2022

Fix type annotations for `fill_nan()` #4445

Fix type annotations for `fill_nan()` #4445

lorenzwalthert Aug 16, 2022 •

edited