Add PyGrain MapDataset integration to jax_privacy experimental training by copybara-service[bot] · Pull Request #279 · google-deepmind/jax_privacy

copybara-service · 2026-06-19T02:12:31Z

Add PyGrain MapDataset integration to jax_privacy experimental training

Integrates PyGrain as an optional dependency for jax_privacy's
experimental training API, enabling DP training on datasets backed
by PyGrain MapDataset (e.g., ArrayRecord on disk).

Key changes:

New experimental/_data_loader.py: Private module that provides a
batch iterator for PyGrain MapDataset using BatchSelectionStrategy.
Auto-detects whether to preload: estimates in-memory size from the
first element's PyTree structure × dataset length, and preloads if
under 1 GiB. Both preload and streaming modes use a
ThreadPoolExecutor for parallel element loading.
Updated experimental/training.py: DPTrainer.fit() now detects
PyGrain MapDataset inputs (by class name string, not import) and
routes to the new loader. The module is never imported unless needed.
New experimental/_data_loader_test.py: Standalone and end-to-end
tests for the data loader integration.

Integrates PyGrain as an optional dependency for jax_privacy's experimental training API, enabling DP training on datasets backed by PyGrain MapDataset (e.g., ArrayRecord on disk). Key changes: - New `experimental/_data_loader.py`: Private module that provides a batch iterator for PyGrain `MapDataset` using `BatchSelectionStrategy`. Auto-detects whether to preload: estimates in-memory size from the first element's PyTree structure × dataset length, and preloads if under 1 GiB. Both preload and streaming modes use a `ThreadPoolExecutor` for parallel element loading. - Updated `experimental/training.py`: `DPTrainer.fit()` now detects PyGrain `MapDataset` inputs (by class name string, not import) and routes to the new loader. The module is never imported unless needed. - New `experimental/_data_loader_test.py`: Standalone and end-to-end tests for the data loader integration. PiperOrigin-RevId: 934664099

github-actions · 2026-06-27T02:19:12Z

This PR has been idle for 7 days. Please provide an update or review.

copybara-service Bot force-pushed the test_934664099 branch 14 times, most recently from 1e8c4a4 to e739f72 Compare June 19, 2026 14:30

copybara-service Bot force-pushed the test_934664099 branch from e739f72 to 291cb23 Compare June 19, 2026 14:37

github-actions Bot added the Stale label Jun 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add PyGrain MapDataset integration to jax_privacy experimental training#279

Add PyGrain MapDataset integration to jax_privacy experimental training#279
copybara-service[bot] wants to merge 1 commit into
mainfrom
test_934664099

copybara-service Bot commented Jun 19, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Uh oh!

Conversation

copybara-service Bot commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

copybara-service Bot commented Jun 19, 2026 •

edited

Loading