CLN: avoid values_from_object in Series #32426

jbrockmendel · 2020-03-04T00:59:24Z

…ries-values_from_object

jreback

looks good, some comments

jreback · 2020-03-04T14:15:06Z

pandas/core/frame.py

@@ -7803,11 +7804,11 @@ def _reduce(
        self, op, name, axis=0, skipna=True, numeric_only=None, filter_type=None, **kwds
    ):

-        dtype_is_dt = self.dtypes.apply(lambda x: x.kind == "M")
+        dtype_is_dt = self.dtypes.apply(lambda x: x.kind == "M" or is_period_dtype(x))


can you apply needs_i8_conversion instead here? (like you do below), this is a non-standard usage

trouble is we don't want to include td64

ok how about using is_datetime64_any_dtype then; this is a non-obvious pattern

…ries-values_from_object

jreback · 2020-03-08T15:46:37Z

pandas/core/frame.py

@@ -7803,11 +7804,11 @@ def _reduce(
        self, op, name, axis=0, skipna=True, numeric_only=None, filter_type=None, **kwds
    ):

-        dtype_is_dt = self.dtypes.apply(lambda x: x.kind == "M")
+        dtype_is_dt = self.dtypes.apply(lambda x: x.kind == "M" or is_period_dtype(x))


ok how about using is_datetime64_any_dtype then; this is a non-obvious pattern

pandas/core/frame.py

jreback · 2020-03-08T15:49:05Z

pandas/core/nanops.py

        # changing timedelta64/datetime64 to int64 needs to happen after
        #  finding `mask` above
-        values = getattr(values, "asi8", values)
-        values = values.view(np.int64)
+        if isinstance(values, np.ndarray):


why is this case still here? when is this actually an ndarray? (or conversely, when is this a DTI/TDI). it is non-obvious how we get to this point.

pandas/core/nanops.py

…ries-values_from_object

jreback · 2020-03-11T02:29:19Z

pandas/tests/frame/test_analytics.py

@@ -875,11 +875,6 @@ def test_mean_datetimelike(self):
        expected = pd.Series({"A": 1.0, "C": df.loc[1, "C"]})
        tm.assert_series_equal(result, expected)

-    @pytest.mark.xfail(


I guess you should technically have a whatsnew note as this 'bug' is fixed (do in followon)

jreback · 2020-03-11T02:29:37Z

thanks

jorisvandenbossche · 2020-03-16T13:23:16Z

pandas/core/series.py

@@ -2055,7 +2055,7 @@ def idxmax(self, axis=0, skipna=True, *args, **kwargs):
        nan
        """
        skipna = nv.validate_argmax_with_skipna(skipna, args, kwargs)
-        i = nanops.nanargmax(com.values_from_object(self), skipna=skipna)
+        i = nanops.nanargmax(self._values, skipna=skipna)


nanargmax/nanargmin expect to get an ndarray. Due to this change, it is no longer guaranteed to be an ndarray. Reported this as #32749

So those lines should either be reverted, or another "convert to ndarray" function should be used (or nanargmax/nanargmin could be rewritten to support EAs, but personally I think it is much cleaner to keep those algos based on numpy arrays)

@jbrockmendel can you respond to this?

(or nanargmax/nanargmin could be rewritten to support EAs, but personally I think it is much cleaner to keep those algos based on numpy arrays)

I'd be fine with either of these options. Probably prefer both actually: a EA-supporting public method and an ndarray-only private method for each of the relevant nanops funcs.

jorisvandenbossche · 2020-04-24T08:16:03Z

@jbrockmendel another problem with this PR is that you enabled "mean" for Period dtype (but only for DataFrames), while we had long discussions before (when initially adding mean support for datetimelikes) that ended in not supporting mean for period (-> #24757)

jorisvandenbossche · 2020-04-24T09:45:45Z

Opened a PR for that in the mean time: #33758

jbrockmendel added 3 commits March 3, 2020 16:58

CLN: avoid values_from_object in Series

2d8a274

Merge branch 'master' of https://github.com/pandas-dev/pandas into se…

ec7d005

…ries-values_from_object

Fix test failures

cf6466b

jreback requested changes Mar 4, 2020

View reviewed changes

Merge branch 'master' of https://github.com/pandas-dev/pandas into se…

29c785b

…ries-values_from_object

jbrockmendel mentioned this pull request Mar 5, 2020

CLN: use _values_for_argsort for join_non_unique, join_monotonic #32467

Merged

jbrockmendel added 3 commits March 5, 2020 11:02

Merge branch 'master' of https://github.com/pandas-dev/pandas into se…

45a278f

…ries-values_from_object

Flip condition

fb6c6ff

mypy fixup

200ac68

jreback added the Clean label Mar 8, 2020

jreback added this to the 1.1 milestone Mar 8, 2020

jreback requested changes Mar 8, 2020

View reviewed changes

jbrockmendel added 3 commits March 8, 2020 12:29

Merge branch 'master' of https://github.com/pandas-dev/pandas into se…

09c7354

…ries-values_from_object

update per comments

da451a1

Merge branch 'master' of https://github.com/pandas-dev/pandas into se…

1017b08

…ries-values_from_object

jreback approved these changes Mar 11, 2020

View reviewed changes

jreback reviewed Mar 11, 2020

View reviewed changes

jreback merged commit 8c38283 into pandas-dev:master Mar 11, 2020

jbrockmendel deleted the series-values_from_object branch March 11, 2020 02:51

jorisvandenbossche reviewed Mar 16, 2020

View reviewed changes

SeeminSyed pushed a commit to CSCD01-team01/pandas that referenced this pull request Mar 22, 2020

CLN: avoid values_from_object in Series (pandas-dev#32426)

261c925

jorisvandenbossche mentioned this pull request Apr 24, 2020

REGR: disallow mean of period column again #33758

Merged

simonjayhawkins mentioned this pull request Sep 7, 2020

BUG: regression in error raised by idxmin/idxmax for extension dtypes #32749

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLN: avoid values_from_object in Series #32426

CLN: avoid values_from_object in Series #32426

jbrockmendel commented Mar 4, 2020

jreback left a comment

jreback Mar 4, 2020

jbrockmendel Mar 4, 2020

jreback Mar 8, 2020

jreback Mar 8, 2020

jreback Mar 8, 2020

jreback Mar 11, 2020

jreback commented Mar 11, 2020

jorisvandenbossche Mar 16, 2020

jorisvandenbossche Apr 24, 2020

jbrockmendel Apr 24, 2020

jorisvandenbossche commented Apr 24, 2020

jorisvandenbossche commented Apr 24, 2020

CLN: avoid values_from_object in Series #32426

CLN: avoid values_from_object in Series #32426

Conversation

jbrockmendel commented Mar 4, 2020

jreback left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Mar 11, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jorisvandenbossche commented Apr 24, 2020

jorisvandenbossche commented Apr 24, 2020