Shape-Constrained Regression (SCR) Benchmarks

This repository serves as a benchmark suite for shape-constrained regression algorithms, which can integrate prior knowledge about the expected shape of the prediction function.

Our benchmark suite builds upon established benchmarks for symbolic regression and extends them in functionality. We provide python code for generating training data for experiments, a set of validation datasets for result comparison, and the ability to check if a function adheres to the specified constraints.

Additionally, we provide the code and platform to compare to other shape-constrained regression algorithms by hosting up-to-date comparison charts and linking to other algorithms and their respective publications (if available).

Usage

Examples

See our examples folder for more code snippets and usage examples. There we detail the data sampling methodology and show how to use our constraint checking interface. This README only highlights the key features.

Installation

pip install git+https://github.com/florianBachinger/SCR-Benchmarks.git

(you may need to run pip with root permission: sudo pip install git+https://github.com/florianBachinger/SCR-Benchmarks.git)

Then import the package:

import SCRBenchmark

Generate Benchmark Data

Go here for more examples.

from SCRBenchmark import FEYNMAN_SRSD_HARD,HARD_NOISE_LEVELS,HARD_SAMPLE_SIZES
from SCRBenchmark import BenchmarkSuite

#creates one folder per equation under the parent folder
# each equation folder contains the info file as json
# and the data files for each configuration as csv
BenchmarkSuite.create_hard_instances(target_folder = './data',
                                        Equations=FEYNMAN_SRSD_HARD,
                                        sample_sizes=HARD_SAMPLE_SIZES,
                                        noise_levels=HARD_NOISE_LEVELS)

Data for individual equations

Go here for more examples.

import SCRBenchmark.SRSDFeynman as srsdf
from SCRBenchmark import Benchmark

ICh6Eq20 = Benchmark(srsdf.FeynmanICh6Eq20)
(training, test) = ICh6Eq20.create_dataset(sample_size=1000, patience= 10, noise_level = 0)

Check if functions adhere to constraints

Go here for more detail.

import SCRBenchmark.SRSDFeynman as srsdf
from SCRBenchmark import Benchmark

ICh6Eq20 = Benchmark(srsdf.FeynmanICh6Eq20)
# Raw: exp(-(theta / sigma) ** 2 / 2) / (sqrt(2 * pi) * sigma)
# function image is overall positive
# is monotonic increasing over theta in -inf. <= theta <= 0
# is monotonic decreasing over theta in 0 <= theta <= inf.

#no constraint is violated in the tested input domain
ICh6Eq20.check_constraints("-(theta*theta)+100",use_display_names=True)

#violation: the image of the function is not positive
ICh6Eq20.check_constraints("-(theta*theta)",use_display_names=True)

#violation: both constraints for x0 (theta) are violated
ICh6Eq20.check_constraints("(theta*theta)",use_display_names=True)

Benchmarked Methods

The following methods are currently benchmarked on the data:

Approach	Description	Reference	Links
SCPR	shape-constrained polynomial regression creating a single global polynomial model	SCRBenchmark: A Benchmarking library for Shape-constrained Regression (Bachinger et al. 2025)	code, paper
SCSR	shape-constrained symbolic regression single objective	SCRBenchmark: A Benchmarking library for Shape-constrained Regression (Bachinger et al. 2025)	code, paper

We will report all results using benchmark set. Please send a pull request containing your execution results and a link to your result repository for the table below.

References

Cite this work:

will be added in future

Credit

We enhance the work of Matsubara et al. by determining the shape constraints that describe the expected function shapes of the benchmark formulas. This knowledge can be used to enforce the behavior of trained prediction models and to e.g., improve extrapolation behavior.

Credit to Matsubara et al. for reviewing and adapting the formulas and variable ranges of Udrescu et al. to reasonable sampling values:

@article{Matsubara2022,
  title={Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery},
  author={Matsubara, Yoshitomo and Chiba, Naoya and Igarashi, Ryo and Tatsunori, Taniai and Ushiku, Yoshitaka},
  journal={arXiv preprint arXiv:2206.10540},
  year={2022}
}

Credit to Udrescu et al. for the initial AI Feynman benchmark set:

@article{Udrescu2020,
  title={AI Feynman: A physics-inspired method for symbolic regression},
  author={Udrescu, Silviu-Marian and Tegmark, Max},
  journal={Science Advances},
  volume={6},
  number={16},
  pages={eaay2631},
  year={2020},
  publisher={American Association for the Advancement of Science}
}

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
SCRBenchmark		SCRBenchmark
examples		examples
generate		generate
results		results
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements-development.txt		requirements-development.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Shape-Constrained Regression (SCR) Benchmarks

Usage

Generate Benchmark Data

Benchmarked Methods

References

Credit

About

Uh oh!

Releases

Uh oh!

Contributors 2

Uh oh!

Languages

License

prescriptiveanalytics/SCR-Benchmarks

Folders and files

Latest commit

History

Repository files navigation

Shape-Constrained Regression (SCR) Benchmarks

Usage

Generate Benchmark Data

Benchmarked Methods

References

Credit

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors 2

Uh oh!

Languages