Added `dist` data, generation script, and CI #1

pasabanov · 2025-01-25T08:17:51Z

Types of changes

Feature
Testing
Configuration (CI/CD)

Description

Implemented dist/generate.py script to generate data for testing distributions.
All formulas for generating distributions implemented in the library were implemented in a simplified form.

a and b are global constants. The results of all functions are calculated without taking into account the values of a and b. This is made for simplicity. Instead of changing the values of a and b, mapping_intervals (see below) are added.

The calculations rely on the mpmath library and are performed with a precision of 100 significant digits. However, the numbers are rounded to 17 significant digits before being output. This ensures the highest possible precision (actually higher than double precision).

It is not expected that the library data will match the test data exactly, as the library provides ideal results. The library should match with a precision up to a certain significant digit (the exact precision will be determined later).

The script allows the output file name to be specified using the -o (--output) flag. If no output file is specified, the script’s output will be directed to the standard output stream.
Created dist/dist.toml as output of the script.
The dist/dist.toml file has been carefully constructed step-by-step based on the output from the dist/generate.py script. At each stage, it was verified that the old values remained unchanged. The calculations for uniform and chebyshev were initially done manually using wolframalpha.com.

At the beginning of the file, intervals labeled mapping_intervals were added, within which each test case will undergo additional testing. The expected values specified in the file are linearly mapped to each interval, and the library generates new values for each interval.

Since the library first generates points on intervals like (-1, 1) or (0, 1) when creating distributions, the loss of precision during the linear mapping should be approximately the same for both generated and test data.
Added verify_data.py script to verify the generated data against the saved data.
To verify the generated data, a temporary file is used, which is deleted after the verification process.
Added .github/workflows/ci.yml to verify the data during the CI build process.
To maintain the integrity of the repository, a CI script has been created to verify the test data with every change to the main branch.

1. Implemented `dist/generate.py` script to generate data for testing distributions. 2. Created `dist/dist.toml` as output of the script. 3. Added `verify_data.py` script to verify the generated data against the saved data. 4. Added `.github/workflows/ci.yml` to verify the data during the CI process.

pasabanov added config Configuring the project feature New feature or request tests Adding or changing tests labels Jan 25, 2025

pasabanov self-assigned this Jan 25, 2025

pasabanov force-pushed the dist branch from 58b3897 to ca76fde Compare January 25, 2025 08:28

pasabanov force-pushed the dist branch from ca76fde to 5647eda Compare January 25, 2025 08:30

pasabanov merged commit e9dc09d into main Jan 25, 2025
1 check passed

pasabanov deleted the dist branch January 25, 2025 08:31

pasabanov mentioned this pull request Jan 26, 2025

Implemented data-driven testing for dist, related fixes and docs ALFI-lib/ALFI#19

Merged

This was referenced Feb 2, 2025

Renamed sigmoid to logistic #2

Merged

Simplified circle_proj, logistic and erf #3

Merged

Added quadratic and cubic distributions data #4

Merged

pasabanov mentioned this pull request Mar 11, 2025

Added chebyshev(_ellipse)?_(3|4) distributions data #6

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added `dist` data, generation script, and CI #1

Added `dist` data, generation script, and CI #1

Uh oh!

pasabanov commented Jan 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added dist data, generation script, and CI #1

Added dist data, generation script, and CI #1

Uh oh!

Conversation

pasabanov commented Jan 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Types of changes

Description

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added `dist` data, generation script, and CI #1

Added `dist` data, generation script, and CI #1

pasabanov commented Jan 25, 2025 •

edited

Loading