Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI: drop python 3.9, numpy 1.21 #266

Open
wants to merge 6 commits into
base: main
Choose a base branch
from
Open

CI: drop python 3.9, numpy 1.21 #266

wants to merge 6 commits into from

Conversation

ev-br
Copy link
Member

@ev-br ev-br commented Mar 3, 2025

closes gh-230

This likely affects scikit-learn, scikit-learn/scikit-learn#30895 (comment)

@ev-br ev-br added this to the 1.12 milestone Mar 3, 2025
ev-br added 2 commits March 3, 2025 13:10
Drop python 3.9 and 3.10, drop numpy 1.21
Not adding python 3.13 because ndonnx only supports 3.12 at the moment.
@ev-br ev-br modified the milestones: 1.11.1, 1.12 Mar 4, 2025
@ev-br ev-br mentioned this pull request Mar 4, 2025
3 tasks
@lucascolley
Copy link
Member

lucascolley commented Mar 19, 2025

scikit-learn have now dropped 3.9 and 1.21

@ev-br ev-br modified the milestones: 1.11.2, 1.12 Mar 20, 2025
@ev-br ev-br mentioned this pull request Mar 20, 2025
Copy link
Member

@lucascolley lucascolley left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we test with NuPy 1.22?

# Min version of dask we need dropped support for Python 3.9
# There is no numpy git tip for Python 3.9 or 3.10
python-version: ${{ (inputs.package-name == 'dask' && fromJson('[''3.10'', ''3.11'', ''3.12'']')) || (inputs.package-name == 'numpy' && inputs.xfails-file-extra == '-dev' && fromJson('[''3.11'', ''3.12'']')) || fromJson('[''3.9'', ''3.10'', ''3.11'', ''3.12'']') }}
python-version: ['3.11', '3.12', '3.13']
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No 3.10?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, as an attempt to trim the CI size a bit. Experience over the last couple of months is that there were no issues which were python version dependent.

@ev-br
Copy link
Member Author

ev-br commented Mar 21, 2025

Am planning to merge this next week if this looks reasonable to people. A possible contention point is the size of the CI matrix: this PR proposes to drop python 3.9 and 3.10 and numpy 1.21; this way we'll test on python 3.11, 3.12 and 3.13 x numpy 1.26, "latest released", and numpy-dev.

Jax, ndonnx, dask and pytorch: no changes, we only test with the latest released version.

@rgommers
Copy link
Member

The linked scikit-learn issue has a clear plan, which is nice. It just dropped Python 3.9, so dropping Python 3.10 here seems quite aggressive - and probably not necessary? If you want to reduce the CI matrix, just remove 3.11 instead while keeping the lowest-supported version.

@lucascolley
Copy link
Member

Similarly, I think we should be testing NumPy 1.22 as the min. supported by sklearn.

@lucascolley
Copy link
Member

I spoke to Olivier and Tim yesterday and suggested that there shouldn't be any problems with array-api-compat and array-api-extra dropping more aggressively than scikit-learn, now that they have written down some reasonable rules. This matters to them as we are very close to them vendoring both libraries: scikit-learn/scikit-learn#30340

@lucascolley
Copy link
Member

For comparison, in array-api-extra we define the following test envs

tests-numpy1 = ["py310", "tests", "numpy1"]
tests-py310 = ["py310", "tests"]
tests-py313 = ["py313", "tests"]
tests-backends = ["py310", "tests", "backends"]
tests-cuda = ["py310", "tests", "backends", "cuda-backends"]

and use the non-CUDA envs in the CI matrix

matrix:
    environment: [tests-py310, tests-py313, tests-numpy1, tests-backends]

Our numpy1 env is currently using oldest SPEC 0 NumPy (1.25), but I think we should change that to the oldest supported by sklearn (1.22) (data-apis/array-api-extra#169 (comment))

Copy link
Member

@lucascolley lucascolley left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

to clarify, I would like to see NumPy 1.22 tested before merging

@ev-br
Copy link
Member Author

ev-br commented Mar 31, 2025

Well, from past experience there was not much useful signal from keeping both numpy 1.21 and 1.26. All it did was a bit was a bit more churn to duplicate xfails for the two numpy versions. If we are to add 1.22, we can keep 1.21 just as well.

@lucascolley
Copy link
Member

How about 1.22 instead of 1.26?

@ev-br
Copy link
Member Author

ev-br commented Mar 31, 2025

Honestly, I don't see much point.
Esp given that we're planning to make parts of the testing matrix only compatible with python 3.12+ for _buffer__,
https://hackmd.io/zn5bvdZTQIeJmb3RW1B-8g#Downstream-libraries-update, and numpy 1.26 seems to be the first one to even support python 3.12.
Granted, it's mainly for array-api-strict, not array-api-compat. So technically, we can keep -compat very very conservative. Why though.

@lucascolley
Copy link
Member

lucascolley commented Mar 31, 2025

Well, I suppose we should ask the scikit-learn devs, as they are the 'users' of the NumPy 1.22 support. Does you foresee it being useful to catch things in array-api-compat CI before they propagate to array-api-extra or scikit-learn, @betatim?

@crusaderky
Copy link
Contributor

crusaderky commented Apr 3, 2025

My 2 cents on NumPy 1.22 is that

  1. it's out of SPEC0 so it's reasonable to ask scikit-learn developers to drop it. I think it would also be reasonable to say that they won't be able to upgrade array-api-compat beyond 1.11 until they do so.
  2. however, if it is supported, it must be tested.

FYI SPEC0 dropped NumPy 1.25 two days ago.

For the record, I do not advocate being this much aggressive when SPEC0 will drop 1.26 three months from now. But asking final users to upgrade to the latest available 1.x should be painless for most.

I just verified that NumPy 1.22 does have _CopyMode and would work without this special casing, whereas 1.21 doesn't:

if hasattr(np, '_CopyMode'):
if copy is None:
copy = np._CopyMode.IF_NEEDED
elif copy is False:
copy = np._CopyMode.NEVER
elif copy is True:
copy = np._CopyMode.ALWAYS
else:
# Not present in older NumPys. In this case, we cannot really support
# copy=False.
if copy is False:
raise NotImplementedError("asarray(copy=False) requires a newer version of NumPy.")

that said, I don't know (without running the unit tests) if anything else would still require a special code path.

@@ -5,13 +5,11 @@ jobs:
runs-on: ubuntu-latest
strategy:
matrix:
python-version: ['3.9', '3.10', '3.11', '3.12']
numpy-version: ['1.21', '1.26', '2.0', 'dev']
python-version: ['3.10', '3.12']
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
python-version: ['3.10', '3.12']
python-version: ['3.10', '3.12', '3.13']

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

3.12 testing can be dropped from CI?

Copy link
Contributor

@crusaderky crusaderky Apr 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not if you want to test numpy 1.26 on the most recent python version it supports.
But yes, we could run on 3.12 for numpy 1.x and on 3.13 for everything else.

Comment on lines +66 to +67
# NumPy 1.26 doesn't support Python 3.13. There doesn't seem to be a way
# to put this in the numpy 1.26 config file.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can't we just use an exclude: directive like in tests.yml below?

@lucascolley
Copy link
Member

lucascolley commented Apr 3, 2025

it's out of SPEC0 so it's reasonable to ask scikit-learn developers to drop it. I think it would also be reasonable to say that they won't be able to upgrade array-api-compat beyond 1.11 until they do so.

We have asked and the clear answer was that scikit-learn won't be following SPEC 0 in its current form. They have written down concrete rules for dropping versions at https://scikit-learn.org/dev/developers/maintainer.html#guideline-for-bumping-minimum-versions-of-our-dependencies. I hope that SPEC 0 can be adjusted so that scikit-learn is compliant (perhaps at the Scientific Python summit this year), but I think preventing scikit-learn from using the latest versions of these libraries has the potential to be very problematic, for insignificant practical gain.

@crusaderky
Copy link
Contributor

https://scikit-learn.org/dev/developers/maintainer.html#guideline-for-bumping-minimum-versions-of-our-dependencies

Forget about 1.22 vs. 1.26. This policy implies that they commit to support NumPy 1.x until October 2027, when Python 3.12 will fall out of their 4 years window. By then, SPEC0 will recommend NumPy >=2.3.
This is a lot of luggage that everyone else who maintains their dependencies is asked to carry too.

NEP29 first and SPEC0 then were written for a reason. I find it a bit baffling - not that one library can decide to diverge (to each the problems they choose for themselves) - but that other libraries, whose developers likely disagree with their decision, are asked to accommodate them.

@lucascolley
Copy link
Member

I hope that SPEC 0 can be adjusted so that scikit-learn is compliant

support NumPy 1.x until October 2027

perhaps I am hoping for more of a compromise then :)

@betatim
Copy link
Member

betatim commented Apr 3, 2025

I'm not sure I can add much here (sorry for being a few days late Lucas). scikit-learn will follow its policy for the next while. Both because it has been its inofficial policy for a long time and because getting agreement on making it official policy took some effort. Of course in the future things will get reconsidered and we left an escape hatch to drop things earlier if there is a good reason. Right now I think it will be hard to convince the rest of scikit-learn to accelerate the Numpy schedule because of a compat library that is used by an experimental feature of scikit-learn (at least I'm hard to convince).

I think having a CI job that tests the oldest set of dependencies is worth having, and skipping intermediate versions is a good way to reduce CI matrix size.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Drop Python 3.9
5 participants