New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FEAT-#6906: Update to pandas 2.2.* #6907
Conversation
d906fdf
to
a67f209
Compare
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
This reverts commit 65879e4.
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
fuzzy / test-fuzzydata (3.9, ray) failed |
Yes, I mentioned it in the description. Generally speaking, I'm waiting for a response from fuzzydata owner. |
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Looks good to me (besides fuzzy tests, we may want to disable them though if we won't get a response from them in reasonable time) |
modin/experimental/core/execution/native/implementations/hdk_on_native/io/io.py
Outdated
Show resolved
Hide resolved
@@ -185,7 +192,7 @@ def get_col_names(): | |||
usecols_md = cls._prepare_pyarrow_usecols(kwargs) | |||
|
|||
po = ParseOptions( | |||
delimiter="\\s+" if kwargs["delim_whitespace"] else delimiter, | |||
delimiter="\\s+" if kwargs["delim_whitespace"] is True else delimiter, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why can't we leave the previous code?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lib.no_default
is equivalent for False
value
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You probably mean the opposite. I see, we can leave the new change as is.
from pandas._libs import lib
a = lib.no_default
def f(a):
if a:
print("A")
else:
print("B")
f(a)
# A
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Although, maybe we should make this condition (and all related ones) more explicit?
delimiter="\\s+" if kwargs["delim_whitespace"] and kwargs["delim_whitespace"] is not lib.no_default else delimiter
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You probably mean the opposite
Let me clarify. lib.no_default
is equivalent for True value in if condition, but in pandas, for particular method it's equivalent for False value.
Although, maybe we should make this condition (and all related ones) more explicit?
It seems to me that this is an additional burden on the reader of the code.
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
This reverts commit 2b0fef5.
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @anmyachev , left a couple of structural comments
@@ -31,6 +31,83 @@ | |||
from pandas._typing import npt | |||
|
|||
|
|||
@_inherit_docstrings(pandas.core.arrays.arrow.ListAccessor) | |||
class ListAccessor(ClassLogger): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similarly here, is it possible to move the _default_to_pandas
into the query compiler?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved
Signed-off-by: Anatoly Myachev <anatoly.myachev@intel.com>
What do these changes do?
Main changes (what's new in Pandas):
list
andstruct
(and simple tests for them).case_when
.TODO: add a simple test.TODO: fix fuzzydata tests. These tests are disabled due tosqlalchemy
package incompatibility. Update SQLAlchemy pin from >=1.4.31,<2.0.0 -> >=2.0.0 suhailrehman/fuzzydata#11flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
git commit -s
docs/development/architecture.rst
is up-to-date