Fix epsilon_from_gdp to return a valid high-confidence lower bound by vvv214 · Pull Request #270 · google-deepmind/jax_privacy

vvv214 · 2026-06-15T12:14:18Z

Summary

CanaryScoreAuditor.epsilon_from_gdp should return a high-confidence lower bound. The previous implementation used

np.abs(isf(FPR_ub) - ppf(FNR_ub))

as a shortcut for the reverse D', D direction. With Clopper-Pearson upper confidence bounds, that shortcut can be too optimistic: a negative forward-direction mu is not automatically valid reverse-direction evidence, because the reverse direction needs its own error-rate bounds and threshold frontier.

This PR computes the two directions explicitly:

one-sided GDP mu on the original D, D' score frontier;
one-sided GDP mu on the swapped D', D score frontier;
one Bonferroni budget over both frontiers' FPR/FNR confidence bounds;
one final conversion from max(mu_forward, mu_reverse, 0) to (epsilon, delta).

Tests

Added coverage for null, small-sample null, forward-separated, and reverse-separated scores. Against the previous implementation:

scenario	previous implementation	this PR
null scores	fails (`epsilon ~= 3.54`)	passes (`epsilon = 0`)
small-sample null scores	fails (`epsilon ~= 5.92`)	passes (`epsilon = 0`)
forward-separated scores	passes	passes
reverse-separated scores	fails (`epsilon = 0`)	passes (`epsilon ~= 15.81`)

Also checked locally:

uvx pyink --check --diff jax_privacy/auditing.py tests/auditing_test.py
git diff --check
python3 -m py_compile jax_privacy/auditing.py tests/auditing_test.py

google-cla · 2026-06-15T12:14:34Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

galenandrew-google

Thank you for your change. The code looks good but I believe the tests can be improved.

galenandrew-google · 2026-06-30T20:50:43Z

+    eps = auditor.epsilon_from_gdp(significance, delta)
+    self.assertLessEqual(eps, 0.1)
+
+  def test_epsilon_from_gdp_separated_is_positive(self):


This test is less specific than the earlier test [test_epsilon_from_gdp_tight]. Please remove it.

galenandrew-google · 2026-06-30T20:55:26Z

    true_eps = dp_accounting.get_epsilon_gaussian(1 / mu, delta)
    np.testing.assert_allclose(eps, true_eps, rtol=0.05)

+  def test_epsilon_from_gdp_null_is_zero(self):


If I understand correctly, the change in this PR means that if the in_canary_scores have smaller values on average than out_canary_scores, the returned epsilon is zero. If so, we should test that case like:

in_canary_scores = rng.normal(-1, 1, m)
out_canary_scores = rng.normal(0, 1, m)

I would suggest that instead of this test and test_epsilon_from_gdp_small_sample_null_is_zero below, we have a single parameterized product test that looks at
(large sample, small sample) x (mu 0, mu negative)

vvv214 force-pushed the fix-auditing-gdp-lower-bound branch 7 times, most recently from 62b5ab7 to 022bad8 Compare June 23, 2026 07:35

Fix epsilon_from_gdp to return a valid two-sided lower bound

65e2006

vvv214 force-pushed the fix-auditing-gdp-lower-bound branch from 022bad8 to 65e2006 Compare June 23, 2026 14:40

vvv214 added 3 commits June 24, 2026 10:34

Implement two-sided GDP lower-bound audit

5844ce9

Avoid protected access in two-sided GDP audit

3c0313e

Clarify two-sided GDP audit implementation

b7b5c3d

vvv214 marked this pull request as ready for review June 24, 2026 03:00

Trigger CI rerun

4c4d65c

galenandrew-google reviewed Jun 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix epsilon_from_gdp to return a valid high-confidence lower bound#270

Fix epsilon_from_gdp to return a valid high-confidence lower bound#270
vvv214 wants to merge 5 commits into
google-deepmind:mainfrom
vvv214:fix-auditing-gdp-lower-bound

vvv214 commented Jun 15, 2026 •

edited

Loading

Uh oh!

google-cla Bot commented Jun 15, 2026

Uh oh!

galenandrew-google left a comment

Uh oh!

galenandrew-google Jun 30, 2026

Uh oh!

galenandrew-google Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

vvv214 commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tests

Uh oh!

google-cla Bot commented Jun 15, 2026

Uh oh!

galenandrew-google left a comment

Choose a reason for hiding this comment

Uh oh!

galenandrew-google Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

galenandrew-google Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vvv214 commented Jun 15, 2026 •

edited

Loading