FEA ridge benchmarks #18

fcharras · 2024-01-11T09:30:52Z

run scikit-learn vanilla benchmark for svd
have consolidation script work
push vanilla scikit-learn to the worksheet
run scikit-learn vanilla benchmark for other solvers
add cuml benchmark
add scikit-learn-intelex benchmark
add scikit-learn + array-api benchmark

Targeted devices will be 32 cores cpu and cuda gpu. Maybe intel max series too if access is not too complicated.

Probably a follow-up PR:

factorize consolidation script (with this third example I have some clearer ideas)

fcharras · 2024-01-12T13:11:05Z

I'm not quite confident I wrote the objective evaluation well. Is that correct ?

        if self.sample_weight_ is not None:
            X, y, _ = _rescale_data(X, y, self.sample_weight_, inplace=False)

        y = y.reshape((y.shape[0], -1))
        weights = weights.reshape((-1, X.shape[1], 1))

        value = (
            (((X @ weights).squeeze(2) + (intercept - y).T) ** 2).sum()
            + (self.alpha * (weights**2).sum())
        ) / (X.shape[0] * len(y.T))

or is it better to just report r2_score ?

ogrisel

Some feed back.

benchmarks/ridge/objective.py

benchmarks/ridge/datasets/simulated_blobs.py

benchmarks/ridge/objective.py

ogrisel · 2024-01-12T14:22:45Z

or is it better to just report r2_score ?

r2_score is not the training objective (in particular it does not reflect the bias induced by the regularizer) so it's harder to interpret as a metric to check that models converge to comparable solutions. Unfortunately, scikit-learn does not yet expose the value of its internal training objective, nor a method to compute it from a fitted model.

What you wrote seems sensible at first glance but would require detailed code inspection to check that this is actually what is optimized in scikit-learn internally. Note that I started work to improve the convergence tests for Ridge models scikit-learn/scikit-learn#25948 but I could not yet find time to get back to it.

Co-authored-by: Olivier Grisel <[email protected]>

…ers of different nature ?

fcharras · 2024-01-15T17:44:56Z

So I added some solvers other than svd. Not sure it's appropriate though, since the initial purpose is rather comparing backends of same solvers. If we want to have different solvers, then maybe let's consider having one different folder and sheet per solver. I think it's still interesting to look at but more difficult to setup and read (because of max_iter and tol parameters mostly)

ogrisel

I browsed the results a bit and it's quite interesting.

Based on the results, here are some suggestions to the benchmark setting to make this even more interesting (in my opinion).

benchmarks/ridge/datasets/simulated_blobs.py

benchmarks/ridge/objective.py

benchmarks/ridge/datasets/simulated_blobs.py

ogrisel · 2024-01-16T16:06:20Z

benchmarks/ridge/objective.py

+            ("lsqr", 25, 0),
+            ("sparse_cg", 25, 0),
+            ("sag", 50, 0),
+            ("saga", 25, 0),


max_iter below 100 seems small to me. Based on the results it happens quite to often that the maximum is reached for lsqr, possibly with a very bad value of the objective.

I reverted to None and set tol to 1e-4 (which are the defaults parameters), so iteration-based solvers might have different number of iterations now. For multi targets, lsqr report a different number of iterations for each target, but it's unconvenient to report all those... I settled for report of the maximum over the targets, after observing that it doesn't seem to differ much from one target to another.

benchmarks/ridge/objective.py

fcharras · 2024-01-17T16:00:38Z

I have updated following your suggestions +sync'd the spreadsheet, but for some reason I can't run the cuml solver anymore:

RuntimeError: cuSOLVER error encountered at: file=/[...]/ridge_benchmark/include/raft/linalg/detail/svd.cuh line=78:

edit: so apparently cuml svd solver uses a lot of vram and this is a vram issue. I add some more dataset dimensions that show what cuml supports or not. (no issue with torch+cuda svd on the other hand)

edit2: does not explain all occurences of the error

edit3: cuml svd solver does not accept n_features > n_samples apparently, and use up a lot of memory, will crash on biggest datasets.

… dimensions in warmup

fcharras · 2024-01-18T15:32:21Z

Somes fixes done, and all results pushed and synchronized to the sheet. Let's merge ?

WIP: add benchmark setup for Ridge

8f06185

fcharras force-pushed the FEA/ridge_benchmarks branch from d0720da to 8f06185 Compare January 11, 2024 10:02

fcharras added 9 commits January 11, 2024 15:10

scikit-learn benchmark works

08cc70f

smaller dimensions / consolidation script works

9de7154

add results.csv file

d1d8760

Increase dimensions / update results

893f67e

Order sheets alphabetically, ignore casing

ca43909

do not display alpha column in the spreadsheet

2be4ee7

Fix solver column

0480354

move data copy and objective evaluation in objective dedicated methods

00005f0

Update results.csv

0bcb890

update yaml

82b3ec5

ogrisel reviewed Jan 12, 2024

View reviewed changes

benchmarks/ridge/objective.py Outdated Show resolved Hide resolved

benchmarks/ridge/objective.py Outdated Show resolved Hide resolved

benchmarks/ridge/datasets/simulated_blobs.py Outdated Show resolved Hide resolved

benchmarks/ridge/objective.py Outdated Show resolved Hide resolved

Trigger ci after fix

0ce2013

fcharras marked this pull request as ready for review January 12, 2024 19:30

fcharras and others added 10 commits January 12, 2024 20:50

Adress review

6bb0aa5

Co-authored-by: Olivier Grisel <[email protected]>

Tune down dataset size and update results

5fddbac

Add other scikit-learn solvers

ac5a5c9

Skip sag, saga, sparse_cg and lbfgs ~ much too slow ?

50a2cd5

Add number of iterations

ca4077f

fix n_iter report

2f9d369

fix n_iter report

1cf6569

max_iter and tol out of benchmark unicity key ~ enable comparing solv…

40fb097

…ers of different nature ?

update results sheet

60556a8

linting

c5cb0bd

fcharras added 2 commits January 15, 2024 18:50

Add cuml solver

658bc82

Add cuml results

cf662eb

fcharras added 4 commits January 15, 2024 20:15

Also add warm-up for scikit-learn

f3e9765

Faster warmup

c2f2479

Add sklearn+array_api+torch solver

edd2e90

Fix sklearn+array api+torch solver, and add results

4e1a2c0

fcharras changed the title ~~WIP FEA ridge benchmarks~~ FEA ridge benchmarks Jan 16, 2024

fcharras mentioned this pull request Jan 16, 2024

FEA: Ridge support for Array API compliant inputs scikit-learn/scikit-learn#27800

Merged

fcharras added 3 commits January 16, 2024 16:03

Add scikit-learn-intelex solver

70ac835

Add scikit-learn-intelex result even if actual solver is unkown

9e808cd

Skip if array_api_support for Ridge not available

d6cc692

ogrisel reviewed Jan 16, 2024

View reviewed changes

benchmarks/ridge/datasets/simulated_blobs.py Outdated Show resolved Hide resolved

benchmarks/ridge/objective.py Outdated Show resolved Hide resolved

benchmarks/ridge/datasets/simulated_blobs.py Outdated Show resolved Hide resolved

ogrisel reviewed Jan 16, 2024

View reviewed changes

benchmarks/ridge/objective.py Outdated Show resolved Hide resolved

fcharras added 5 commits January 17, 2024 10:50

wip: iterate on benchmark parameters, following review suggestions

6bcf4d0

Nits

bd78618

Fix lsqr

35a40d0

Update results (wip: cuml missing)

9344602

revert unrelated kmeans changes

6d021d0

fcharras added 6 commits January 18, 2024 11:10

Add more solvers and dimensions, work around issues with incompatible…

e7a1399

… dimensions in warmup

fixup

8144d23

fixup cuml

d5bab3f

fixup torch

d2ccf5b

Remove lbfgs (because it enforces positive=True)

5a0da41

Update all results.

7b4c52e

Fix a bug that didn't reset bold settings after a sync refresh

92aa833

fcharras merged commit bd405f0 into main Feb 21, 2024
5 checks passed

fcharras deleted the FEA/ridge_benchmarks branch February 21, 2024 11:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEA ridge benchmarks #18

FEA ridge benchmarks #18

fcharras commented Jan 11, 2024 •

edited

Loading

fcharras commented Jan 12, 2024 •

edited

Loading

ogrisel left a comment

ogrisel commented Jan 12, 2024 •

edited

Loading

fcharras commented Jan 15, 2024

ogrisel left a comment

ogrisel Jan 16, 2024 •

edited

Loading

fcharras Jan 17, 2024

fcharras commented Jan 17, 2024 •

edited

Loading

fcharras commented Jan 18, 2024

FEA ridge benchmarks #18

FEA ridge benchmarks #18

Conversation

fcharras commented Jan 11, 2024 • edited Loading

fcharras commented Jan 12, 2024 • edited Loading

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel commented Jan 12, 2024 • edited Loading

fcharras commented Jan 15, 2024

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel Jan 16, 2024 • edited Loading

Choose a reason for hiding this comment

fcharras Jan 17, 2024

Choose a reason for hiding this comment

fcharras commented Jan 17, 2024 • edited Loading

fcharras commented Jan 18, 2024

fcharras commented Jan 11, 2024 •

edited

Loading

fcharras commented Jan 12, 2024 •

edited

Loading

ogrisel commented Jan 12, 2024 •

edited

Loading

ogrisel Jan 16, 2024 •

edited

Loading

fcharras commented Jan 17, 2024 •

edited

Loading