Releases: sodadata/soda-core
Releases · sodadata/soda-core
v3.0.12
What's Changed
- [core] Duplicate percent check by @m1n0 in #1649
- [core] Change over time - remove 'same day last month' by @m1n0 in #1648
- [cloud] don't send null file ref, when failed rows are disabled by @vijaykiran in #1650
- [core] Modify Test Column Names by @tdstark in #1652
- [scientific] feat: allow use of in-check filters for distribution checks by @tituskx in #1655
- [trino] Update trino_data_source.py by @ScottAtDisney in #1658
- [sqlserver] Change count to big_count by @vijaykiran in #1660
- [core] Failed rows exclude columns by @m1n0 in #1657
- [core] Introduce http sampler by @vijaykiran in #1665
New Contributors
Full Changelog: v3.0.11...v3.0.12
v3.0.11
What's Changed
- fix: limit the bin size and handle zero division for continious DRO by @baturayo in #1624
- fix: handle DRO generation for columns with 0 rows by @baturayo in #1627
- chore: pin prophet to >=1.1 by @bastienboutonnet in #1629
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #1630
- Parse cli variables correctly, fix cli tests to actually assert result. by @m1n0 in #1634
- refactor: add bins and weights doc link to DRO exception handling logs by @baturayo in #1633
- Verify data source connection command by @m1n0 in #1636
- Redshift: use SVV_COLUMNS to get table metadata by @m1n0 in #1635
- Snowflake: use upper case in table metadata query by @m1n0 in #1639
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #1643
- Trino: fix py310 type hints by @m1n0 in #1641
- fix: variable substitution in schema check query by @ceyhunkerti in #1628
- fix(anomaly_check): only send outcomeReasons with severity "warn" or "error" by @tituskx in #1640
- fixing bq separate compute storage project by @thiagodeschamps in #1638
- Bigquery: fix distribution check by @m1n0 in #1647
- Change over time - add same day/month support by @m1n0 in #1645
New Contributors
- @ceyhunkerti made their first contribution in #1628
- @thiagodeschamps made their first contribution in #1638
Full Changelog: v3.0.10...v3.0.11
v3.0.10
What's Changed
- Failed rows configurable sample size by @m1n0 in #1608
- Skip change over time checks when historical measurements not available by @m1n0 in #1615
- Add Check source field for cloud by @m1n0 in #1614
- Use correct dicts when building scan result by @m1n0 in #1612
- Failed rows configurable sample size for duplicate_count check by @m1n0 in #1613
- feat: check historical metrics are not None or log helpful message by @bastienboutonnet in #1600
- Dremio support by @vijaykiran in #1618
- fix: handle very large bin sizes by filtering out outliers for dro generation by @baturayo in #1616
- Include psycopg2 requirement for redshift by @m1n0 in #1620
- fix: ensure PSI and SWD can deal with decimal.Decimal type by @tituskx in #1611
Full Changelog: v3.0.9...v3.0.10
v3.0.9
What's Changed
- Limit failed rows sample limit to 1000 by @m1n0 in #1599
- Add scan result getter by @m1n0 in #1602
- BQ separate project for compute and storage. by @m1n0 in #1598
- Scan results file argument by @vijaykiran in #1603
- Chore/move snowflake account by @jmarien in #1607
- Use filename in check identity by @m1n0 in #1606
Full Changelog: v3.0.8...v3.0.9
Please note that installing soda-core may fail with latest version of protobuf (protocolbuffers/protobuf#10571), if the bug isn't fixed yet, try installing 3.19.4 or 3.20.1
v3.0.8
What's Changed
- feat: raise more user-friendly log message when improting sci library fails by @bastienboutonnet in #1584
- Fix dbt table name by @vijaykiran in #1587
- SODA-914: add context authentication and impersonation for BigQuery by @tooobsias in #1588
- Basic Sqlserver regex support by @m1n0 in #1586
- Fix change over time with percentage with no extra config by @m1n0 in #1592
- Fix mysql/mariadb compatibility for regex by @vijaykiran in #1591
- Minor edits to cli help messages. by @janet-can in #1590
- fix: prevent empty message in outcomeReasons by @bastienboutonnet in #1596
- Scan results dict by @m1n0 in #1595
- Add variable resolution to query level by @vijaykiran in #1597
New Contributors
- @tooobsias made their first contribution in #1588
Full Changelog: v3.0.7...v3.0.8
v3.0.7
What's Changed
- Add
dbt:
prefix to dbt checks by @vijaykiran in #1574 - Update freshness value to be milliseconds and add measure by @vijaykiran in #1575
- Add configurable API URL for dbt cloud by @vijaykiran in #1576
- Resolve variables in user defined queries by @vijaykiran in #1577
- Fix dbt cloud ingest, improve logging. by @m1n0 in #1578
- Fixed port option and regexp by @ScottAtDisney in #1579
- Fix dbt checks not being sent properly by @vijaykiran in #1580
- Fix regex tests for mysql by @vijaykiran in #1583
Full Changelog: v3.0.6...v3.0.7
v3.0.6
What's Changed
- fix: add identityB to add datasource name in identity by @vijaykiran in #1556
- Databricks SQL Warehouse support by @vijaykiran in #1559
- SODA-239 Added application flag to snowflake connect by @tombaeyens in #1561
- Add identites by @vijaykiran in #1569
- Add support for custom sampler by @vijaykiran in #1570
- SAS-322 Handle numerical column/table names by @m1n0 in #1572
- dbt ingestion support by @m1n0 in #1552
Full Changelog: v3.0.5...v3.0.6
v3.0.5
What's Changed
- Update README to include package for mysql by @janet-can in #1532
- Fix duplicate query by @m1n0 in #1543
- Add comments explaining cloud payload by @m1n0 in #1545
- Data source contributing docs by @m1n0 in #1546
- refactor: turn no matching table error into a warning to avoid scan failing when all tables are excluded by @bastienboutonnet in #1533
- feat(profiling): add support for extra numeric and text datatypes by @bastienboutonnet in #1534
- Fix 'missing format' in numeric metrics by @m1n0 in #1549
- Trino data source by @ScottAtDisney in #1553
- Change spark installation to decouple deps by @vijaykiran in #1554
Full Changelog: v3.0.4...v3.0.5
v3.0.4
What's Changed
- testing switch to 22.04 for GA by @jmarien in #1521
- Log and trace Soda Cloud trace IDs by @m1n0 in #1520
- Update docker image for sqlserver support by @vijaykiran in #1522
- Add option to set scan datatime by @vijaykiran in #1531
- MySQL Support by @vijaykiran in #1526
Full Changelog: v3.0.3...v3.0.4
v3.0.3
New Features
- SQLServer support by @vijaykiran in #1515
Fixes
- fix: better logging messages for profiling and discover datasets by @baturayo in #1498
- Fix config file creation when first path is not writable by @m1n0 in #1504
- fix: Failed rows don't consider filter by @vijaykiran in #1505
- Fix log message by @m1n0 in #1507
- Fix reference check for null values in source column by @m1n0 in #1509
- Attach sample rows to reference check by @m1n0 in #1508
- Make sure results to sodacloud are sent when there is an exception by @vijaykiran in #1510
- Fix for regex on collated columns in Snowflake by @ScottAtDisney in #1516
Enhancements
- Check name refactor by @m1n0 in #1502
- Set basic telemetry scan data even in case of exceptions by @m1n0 in #1512
- Improve athena text fixture auth setup by @m1n0 in #1501
- Publish data source packages for python 3.7 by @m1n0 in #1514
- Inform about wrong check indentation in logs by @m1n0 in #1517
- feat: skip row count query during column profiling by @bastienboutonnet in #1518
- feat: support 'text' data type in column profiling by @bastienboutonnet in #1519
Full Changelog: v3.0.2...v3.0.3