ENH: EA._cast_pointwise_result #62105

jbrockmendel · 2025-08-14T15:38:38Z

closes REF: make _cast_pointwise_result an EA method #59895 (Replace xxxx with the GitHub issue number)
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

Still have a bunch of pyarrow tests involving duration/timestamp dtypes failing. Also need to update/remove the test files' _cast_pointwise_result methods.

xref #56430, could close that with a little effort.

~~I suspect a bunch of "pyarrow dtype retention" tests are solved by this, will update as I check.~~ Nope!

jbrockmendel · 2025-08-14T16:12:55Z

The pyarrow duration stuff is caused by an upstream issue apache/arrow#40620

jbrockmendel · 2025-08-15T21:21:53Z

@rhshadrach i think you had a recent issue/pr involving retaining nullable dtypes in a .map?

rhshadrach

Really great improvement here. It seems to me there are two potential situations we might find ourselves in: (a) take the dtype of self into account or (b) don't take the dtype of self into account. I highlighted one specific example below where I think this might go awry. The base implementation is doing (b) whereas the subclasses are doing various degrees of (a). Understand that isn't being introduced here, but I think the long term goal is to make this more consistent?

~~For now, do we want to setup the framework to separate these two cases out somehow - perhaps an argument to _cast_pointwise_result?~~

I see now that this is just preserving the dtype when possible. I think we don't need two different cases as I first imagined.

pandas/core/arrays/arrow/array.py

pandas/core/arrays/categorical.py

pandas/tests/extension/decimal/test_decimal.py

pandas/core/arrays/arrow/array.py

rhshadrach

lgtm

mroeschke · 2025-08-20T16:00:25Z

Thanks @jbrockmendel

jorisvandenbossche · 2025-09-10T16:10:45Z

@jbrockmendel is there a reason you removed the _from_scalars EA interface method (which I think was added after quite some discussions, #33254 / #38315 / #53089). Was it no longer used / needed?
And what's the difference exactly with _cast_pointwise_result? That one is not strict? (the base EA implementation of it does not even return an EA, so not entirely sure I understand the purpose of that base method, and it is not being documented as somethin to override for external EA authors?)

jbrockmendel · 2025-09-10T16:22:19Z

is there a reason you removed the _from_scalars

Because it is no longer used anywhere.

And what's the difference exactly with _cast_pointwise_result? That one is not strict?

_from_scalars either returned the same dtype as the original or raised. _cast_pointwise_result does inference while attempting to retain the dtype_backend of* the** original.

So e.g. with a timestamp[pyarrow] Index***, if you did idx.map(lambda x: x - pd.Timestamp(0)), if Index.map uses _from_scalars, it would try to cast back to timestamp[pyarrow], fail and raise. Then we'd fall back to lib.maybe_convert_objects and get a non-pyarrow timedelta64. With _cast_pointwise_result, we get duration[pyarrow], which is what we generally want in these cases.

* and itemsize where relevant
** for this purpose "dtype_backend" is a little fuzzy about whether it includes categorical-ness or sparse-ness.
*** don't hold me to this exact example, as im not sure off the top of my head if we yet use _cast_pointwise_result consistently

jorisvandenbossche · 2025-09-10T17:53:28Z

_from_scalars either returned the same dtype as the original or raised. _cast_pointwise_result does inference while attempting to retain the dtype_backend of* the** original.

That might happen for our own arrays, but I don't think the base class version of _cast_pointwise_result is trying to do any inference with attempt to retain the dtype?
(it just calls maybe_convert_objects, which will typically return a numpy array, or one of our period/datetime/timedelta types)

So as far as I can see, there is no way that the base implementation can ever work correctly for an external EA, which means they will always have to override this method.
While the current implementation of maybe_cast_pointwise_result using _from_scalars can work fine for external EAs.

jorisvandenbossche · 2025-09-10T17:55:17Z

I see that in the issue you wrote:

This will effectively replace _from_scalars, which was a mis-feature.

Can you explain why you think that is the case?

jbrockmendel · 2025-09-10T18:01:09Z

Can you explain why you think that is the case?

Because it only handled same-dtype casting, and even then required lots of overriding for cases when _from_sequence is more aggressive than we'd want. The actual method we need was dtype_backend-preserving inference.

So as far as I can see, there is no way that the base implementation can ever work correctly for an external EA, which means they will always have to override this method.

Fair point. I'll take a look at how we can update the base class method to prevent geopandas from having to override.

ENH: EA._cast_pointwise_result

48f6a8b

jbrockmendel requested a review from rhshadrach as a code owner August 14, 2025 15:38

jbrockmendel changed the title ~~ENH: EA._cast_pointwise_result~~ WIP/ENH: EA._cast_pointwise_result Aug 14, 2025

fix remaining tests

2b55311

jbrockmendel changed the title ~~WIP/ENH: EA._cast_pointwise_result~~ ENH: EA._cast_pointwise_result Aug 15, 2025

jbrockmendel added 4 commits August 15, 2025 07:17

Merge branch 'main' into api-cast_pointwise_result

059f41d

mypy fixup

4905bd4

32bit builds

08a135f

Merge branch 'main' into api-cast_pointwise_result

b107913

jbrockmendel added 2 commits August 15, 2025 15:46

no-infer-string build

08d9ede

simplify override

13fb411

rhshadrach reviewed Aug 18, 2025

View reviewed changes

pandas/core/arrays/arrow/array.py Show resolved Hide resolved

rhshadrach reviewed Aug 18, 2025

View reviewed changes

pandas/core/arrays/categorical.py Show resolved Hide resolved

pandas/tests/extension/decimal/test_decimal.py Show resolved Hide resolved

mroeschke reviewed Aug 19, 2025

View reviewed changes

pandas/core/arrays/arrow/array.py Show resolved Hide resolved

rhshadrach approved these changes Aug 20, 2025

View reviewed changes

mroeschke approved these changes Aug 20, 2025

View reviewed changes

mroeschke added this to the 3.0 milestone Aug 20, 2025

mroeschke added the ExtensionArray Extending pandas with custom dtypes or arrays. label Aug 20, 2025

mroeschke merged commit cb7b334 into pandas-dev:main Aug 20, 2025
41 checks passed

jbrockmendel deleted the api-cast_pointwise_result branch August 20, 2025 16:20

m-richards mentioned this pull request Sep 9, 2025

COMPAT: define _cast_pointwise_result for pandas compat geopandas/geopandas#3646

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: EA._cast_pointwise_result #62105

ENH: EA._cast_pointwise_result #62105

Uh oh!

jbrockmendel commented Aug 14, 2025 •

edited

Loading

Uh oh!

jbrockmendel commented Aug 14, 2025

Uh oh!

jbrockmendel commented Aug 15, 2025

Uh oh!

rhshadrach left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rhshadrach left a comment

Uh oh!

Uh oh!

mroeschke commented Aug 20, 2025

Uh oh!

jorisvandenbossche commented Sep 10, 2025

Uh oh!

jbrockmendel commented Sep 10, 2025

Uh oh!

jorisvandenbossche commented Sep 10, 2025

Uh oh!

jorisvandenbossche commented Sep 10, 2025

Uh oh!

jbrockmendel commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

ENH: EA._cast_pointwise_result #62105

ENH: EA._cast_pointwise_result #62105

Uh oh!

Conversation

jbrockmendel commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jbrockmendel commented Aug 14, 2025

Uh oh!

jbrockmendel commented Aug 15, 2025

Uh oh!

rhshadrach left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rhshadrach left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mroeschke commented Aug 20, 2025

Uh oh!

jorisvandenbossche commented Sep 10, 2025

Uh oh!

jbrockmendel commented Sep 10, 2025

Uh oh!

jorisvandenbossche commented Sep 10, 2025

Uh oh!

jorisvandenbossche commented Sep 10, 2025

Uh oh!

jbrockmendel commented Sep 10, 2025

Uh oh!

Uh oh!

jbrockmendel commented Aug 14, 2025 •

edited

Loading

rhshadrach left a comment •

edited

Loading