[kernel-spark] Implement planInputPartitions and createReaderFactory for dsv2 streaming #5499

zikangh · 2025-11-14T18:39:39Z

🥞 Stacked PR

Use this link to review incremental changes.

stack/plan1 [Files changed]
- stack/integration [Files changed]
  - stack/integration2 [Files changed]
    - stack/reader [Files changed]
      - stack/lazy [Files changed]
        
        stack/snapshot [Files changed]

Which Delta project/connector is this regarding?

Description

In this PR, we implement the DSv2 methods planInputPartitions() and createReaderFactory().
These are DSv2-only API that will replace DSv1's getBatch()

planInputPartitions(): Returns physical partitions describing how to read the data; Called once per micro-batch during planning
createReaderFactory(): Returns a factory that creates readers for the partitions; Each executor uses this factory to read its assigned partitions

How was this patch tested?

Does this PR introduce any user-facing changes?

…treaming (#5409) ## 🥞 Stacked PR Use this [link](https://github.com/delta-io/delta/pull/5409/files) to review incremental changes. - [**stack/latestsnapshot2**](#5409) [[Files changed](https://github.com/delta-io/delta/pull/5409/files)] - [stack/initialoffset2](#5498) [[Files changed](https://github.com/delta-io/delta/pull/5498/files/1718356813a6b39c80585d36e7aac6c8abc3a6a0..9833eaf816ee2f1dcf94d5d9a47136e69fd26336)] - [stack/plan1](#5499) [[Files changed](https://github.com/delta-io/delta/pull/5499/files/9833eaf816ee2f1dcf94d5d9a47136e69fd26336..90345c732c6bd182c51648a4b875fdce2c14fc63)] - [stack/integration](#5572) [[Files changed](https://github.com/delta-io/delta/pull/5572/files/90345c732c6bd182c51648a4b875fdce2c14fc63..813a49a41719ef4b773caf5438c975c8f77c646b)] - stack/reader ---------  #### Which Delta project/connector is this regarding?  - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ## Description  We add implementation for `latestOffset(startOffset, limit)` and `getDefaultReadLimit()` for a complete `SupportsAdmissionControl` implementation. Also refactored a few `DeltaSource.scala` methods -- we make them static so we can call them from SparkMicrobatchStream.java. ## How was this patch tested?  Parameterized tests verifying parity between DSv1 (DeltaSource) and DSv2 (SparkMicroBatchStream). ## Does this PR introduce _any_ user-facing changes?  No --------- Signed-off-by: TimothyW553 <[email protected]> Signed-off-by: Timothy Wang <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Timothy Wang <[email protected]>

…treaming (delta-io#5409) ## 🥞 Stacked PR Use this [link](https://github.com/delta-io/delta/pull/5409/files) to review incremental changes. - [**stack/latestsnapshot2**](delta-io#5409) [[Files changed](https://github.com/delta-io/delta/pull/5409/files)] - [stack/initialoffset2](delta-io#5498) [[Files changed](https://github.com/delta-io/delta/pull/5498/files/1718356813a6b39c80585d36e7aac6c8abc3a6a0..9833eaf816ee2f1dcf94d5d9a47136e69fd26336)] - [stack/plan1](delta-io#5499) [[Files changed](https://github.com/delta-io/delta/pull/5499/files/9833eaf816ee2f1dcf94d5d9a47136e69fd26336..90345c732c6bd182c51648a4b875fdce2c14fc63)] - [stack/integration](delta-io#5572) [[Files changed](https://github.com/delta-io/delta/pull/5572/files/90345c732c6bd182c51648a4b875fdce2c14fc63..813a49a41719ef4b773caf5438c975c8f77c646b)] - stack/reader ---------  #### Which Delta project/connector is this regarding?  - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ## Description  We add implementation for `latestOffset(startOffset, limit)` and `getDefaultReadLimit()` for a complete `SupportsAdmissionControl` implementation. Also refactored a few `DeltaSource.scala` methods -- we make them static so we can call them from SparkMicrobatchStream.java. ## How was this patch tested?  Parameterized tests verifying parity between DSv1 (DeltaSource) and DSv2 (SparkMicroBatchStream). ## Does this PR introduce _any_ user-facing changes?  No --------- Signed-off-by: TimothyW553 <[email protected]> Signed-off-by: Timothy Wang <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Timothy Wang <[email protected]>

kernel-spark/src/main/java/io/delta/kernel/spark/read/SparkMicroBatchStream.java

gengliangwang · 2025-12-04T00:42:24Z

kernel-spark/src/main/java/io/delta/kernel/spark/read/SparkMicroBatchStream.java

+
+    List<PartitionedFile> partitionedFiles = new ArrayList<>();
+    long totalBytesToRead = 0;
+    try (CloseableIterator<IndexedFile> fileChanges =


QQ: is it possible to push down dataFilters when getting the file list?

I think we need to support data skipping in kernel's getChange API

Yes, but we don't need to add this support for now, because the DSv1 connector doesn't actually enable data skipping.

huan233usc · 2025-12-04T19:32:36Z

kernel-spark/src/main/java/io/delta/kernel/spark/read/SparkMicroBatchStream.java

+        InternalRow partitionRow =
+            PartitionUtils.getPartitionRow(
+                addFile.getPartitionValues(),
+                partitionSchema,
+                ZoneId.of(sqlConf.sessionLocalTimeZone()));
+        // Preferred node locations are not used.
+        String[] preferredLocations = new String[0];
+        // Constant metadata columns are not used.
+        scala.collection.immutable.Map<String, Object> otherConstantMetadataColumnValues =
+            scala.collection.immutable.Map$.MODULE$.empty();


Can those code be shared between batch and streaming

PartitionedFile buildPartitionFile(AddFile, partitionSchema)

## 🥞 Stacked PR Use this [link](https://github.com/delta-io/delta/pull/5498/files) to review incremental changes. - [**stack/initialoffset2**](#5498) [[Files changed](https://github.com/delta-io/delta/pull/5498/files)] - [stack/plan1](#5499) [[Files changed](https://github.com/delta-io/delta/pull/5499/files/90e1d9ba4b26d039bfa1b870e693e73204201750..35731eb6ffcb10f85ed97b04058e3bf49de771d8)] - [stack/integration](#5572) [[Files changed](https://github.com/delta-io/delta/pull/5572/files/35731eb6ffcb10f85ed97b04058e3bf49de771d8..9c2e743cff0c1fcb8cf6ddf8efa3a1b98fddba3c)] - stack/snapshot1 - [stack/reader](#5638) [[Files changed](https://github.com/delta-io/delta/pull/5638/files/35731eb6ffcb10f85ed97b04058e3bf49de771d8..35731eb6ffcb10f85ed97b04058e3bf49de771d8)] ---------  #### Which Delta project/connector is this regarding?  - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ## Description  We finish implementing `initialOffset()` in `SparkMicroBatchStream.java`. The `initialOffset()` method determines where a streaming query should start reading when there's no checkpointed offset. This is a DSv2-only API. Details: - Added `isFirstBatch` tracking field - Boolean flag to track whether we're processing the first batch (set to true in initialOffset()) - Updated `latestOffset(startOffset, limit)` - Now handles first batch differently by returning null (not `previousOffset`) when no data is available, matching DSv1's `getStartingOffsetFromSpecificDeltaVersion` behavior ## How was this patch tested? Parameterized tests verifying parity between DSv1 (DeltaSource) and DSv2 (SparkMicroBatchStream).  ## Does this PR introduce _any_ user-facing changes?

gengliangwang · 2025-12-05T23:57:12Z

kernel-spark/src/test/java/io/delta/kernel/spark/read/SparkMicroBatchStreamTest.java

  }

-  @Test
-  public void testGetFileChanges_StartingVersionAfterCheckpointAndLogCleanup(@TempDir File tempDir)


is this test case removed by mistake?

Reverted. thanks!

gengliangwang · 2025-12-05T23:59:17Z

kernel-spark/src/main/java/io/delta/kernel/spark/read/SparkMicroBatchStream.java

+    this.scalaOptions = Objects.requireNonNull(scalaOptions, "scalaOptions is null");
+
+    // Initialize snapshot at source init to get table ID, similar to DeltaSource.scala
+    Snapshot snapshotAtSourceInit = snapshotManager.loadLatestSnapshot();


SparkScan already has initialSnapshot available. The snapshot might be loaded redundantly

Removed. this was brought in by mistake after resolving merge conflicts with master.

kernel-spark/src/main/java/io/delta/kernel/spark/read/SparkMicroBatchStream.java

…#5498) ## 🥞 Stacked PR Use this [link](https://github.com/delta-io/delta/pull/5498/files) to review incremental changes. - [**stack/initialoffset2**](delta-io#5498) [[Files changed](https://github.com/delta-io/delta/pull/5498/files)] - [stack/plan1](delta-io#5499) [[Files changed](https://github.com/delta-io/delta/pull/5499/files/90e1d9ba4b26d039bfa1b870e693e73204201750..35731eb6ffcb10f85ed97b04058e3bf49de771d8)] - [stack/integration](delta-io#5572) [[Files changed](https://github.com/delta-io/delta/pull/5572/files/35731eb6ffcb10f85ed97b04058e3bf49de771d8..9c2e743cff0c1fcb8cf6ddf8efa3a1b98fddba3c)] - stack/snapshot1 - [stack/reader](delta-io#5638) [[Files changed](https://github.com/delta-io/delta/pull/5638/files/35731eb6ffcb10f85ed97b04058e3bf49de771d8..35731eb6ffcb10f85ed97b04058e3bf49de771d8)] ---------  #### Which Delta project/connector is this regarding?  - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ## Description  We finish implementing `initialOffset()` in `SparkMicroBatchStream.java`. The `initialOffset()` method determines where a streaming query should start reading when there's no checkpointed offset. This is a DSv2-only API. Details: - Added `isFirstBatch` tracking field - Boolean flag to track whether we're processing the first batch (set to true in initialOffset()) - Updated `latestOffset(startOffset, limit)` - Now handles first batch differently by returning null (not `previousOffset`) when no data is available, matching DSv1's `getStartingOffsetFromSpecificDeltaVersion` behavior ## How was this patch tested? Parameterized tests verifying parity between DSv1 (DeltaSource) and DSv2 (SparkMicroBatchStream).  ## Does this PR introduce _any_ user-facing changes?

huan233usc · 2025-12-08T19:56:21Z

kernel-spark/src/test/java/io/delta/kernel/spark/read/SparkMicroBatchStreamTest.java

+
+  @ParameterizedTest
+  @MethodSource("planInputPartitionsParameters")
+  public void testPlanInputPartitions_DataParity(


nit: naming testPlanInputPartitions_dataParity

…#5498) ## 🥞 Stacked PR Use this [link](https://github.com/delta-io/delta/pull/5498/files) to review incremental changes. - [**stack/initialoffset2**](delta-io#5498) [[Files changed](https://github.com/delta-io/delta/pull/5498/files)] - [stack/plan1](delta-io#5499) [[Files changed](https://github.com/delta-io/delta/pull/5499/files/90e1d9ba4b26d039bfa1b870e693e73204201750..35731eb6ffcb10f85ed97b04058e3bf49de771d8)] - [stack/integration](delta-io#5572) [[Files changed](https://github.com/delta-io/delta/pull/5572/files/35731eb6ffcb10f85ed97b04058e3bf49de771d8..9c2e743cff0c1fcb8cf6ddf8efa3a1b98fddba3c)] - stack/snapshot1 - [stack/reader](delta-io#5638) [[Files changed](https://github.com/delta-io/delta/pull/5638/files/35731eb6ffcb10f85ed97b04058e3bf49de771d8..35731eb6ffcb10f85ed97b04058e3bf49de771d8)] ---------  #### Which Delta project/connector is this regarding?  - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ## Description  We finish implementing `initialOffset()` in `SparkMicroBatchStream.java`. The `initialOffset()` method determines where a streaming query should start reading when there's no checkpointed offset. This is a DSv2-only API. Details: - Added `isFirstBatch` tracking field - Boolean flag to track whether we're processing the first batch (set to true in initialOffset()) - Updated `latestOffset(startOffset, limit)` - Now handles first batch differently by returning null (not `previousOffset`) when no data is available, matching DSv1's `getStartingOffsetFromSpecificDeltaVersion` behavior ## How was this patch tested? Parameterized tests verifying parity between DSv1 (DeltaSource) and DSv2 (SparkMicroBatchStream).  ## Does this PR introduce _any_ user-facing changes?

…for dsv2 streaming (delta-io#5499) ## 🥞 Stacked PR Use this [link](https://github.com/delta-io/delta/pull/5499/files) to review incremental changes. - [**stack/plan1**](delta-io#5499) [[Files changed](https://github.com/delta-io/delta/pull/5499/files)] - [stack/integration](delta-io#5572) [[Files changed](https://github.com/delta-io/delta/pull/5572/files/a0512bb563ff00a31461c2a188e11d59f19146e1..321432605fc1efe3253b30116de2d389f6f66977)] - [stack/integration2](delta-io#5652) [[Files changed](https://github.com/delta-io/delta/pull/5652/files/321432605fc1efe3253b30116de2d389f6f66977..dee64c4ca3abbdd530c232e42325e702e9d61a0b)] - [stack/reader](delta-io#5638) [[Files changed](https://github.com/delta-io/delta/pull/5638/files/dee64c4ca3abbdd530c232e42325e702e9d61a0b..c8f25572585682e9d540bce0c4e982dc8dc1079c)] - [stack/lazy](delta-io#5650) [[Files changed](https://github.com/delta-io/delta/pull/5650/files/c8f25572585682e9d540bce0c4e982dc8dc1079c..0703f5750fa765d45c5b8c04b286d05f87e0ac6c)] - [stack/snapshot](delta-io#5651) [[Files changed](https://github.com/delta-io/delta/pull/5651/files/0703f5750fa765d45c5b8c04b286d05f87e0ac6c..cb6efb41f4cf62ec1501f6b7dce5e8d6e926eaf1)] ---------  #### Which Delta project/connector is this regarding?  - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ## Description  In this PR, we implement the DSv2 methods `planInputPartitions()` and `createReaderFactory()`. These are DSv2-only API that will replace DSv1's `getBatch()` `planInputPartitions()`: Returns physical partitions describing how to read the data; Called once per micro-batch during planning `createReaderFactory()`: Returns a factory that creates readers for the partitions; Each executor uses this factory to read its assigned partitions ## How was this patch tested?  ## Does this PR introduce _any_ user-facing changes?

jerrypeng · 2025-12-12T01:47:46Z

kernel-spark/src/main/java/io/delta/kernel/spark/utils/PartitionUtils.java

+   * Calculate the maximum split bytes for file partitioning, considering total bytes and file
+   * count. This is used for optimal file splitting in both batch and streaming read.
+   */
+  public static long calculateMaxSplitBytes(


Why can we not re-use FilePartition.maxSplitBytes()?

jerrypeng · 2025-12-12T01:55:41Z

kernel-spark/src/main/java/io/delta/kernel/spark/utils/PartitionUtils.java

+        partitionRow,
+        SparkPath.fromUrlString(tablePath + addFile.getPath()),
+        /* start= */ 0L,
+        /* length= */ addFile.getSize(),


We do we always read to the end?

Oh I see, we never really split a single add file right?

jerrypeng · 2025-12-12T05:44:15Z

kernel-spark/src/test/java/io/delta/kernel/spark/utils/PartitionUtilsTest.java

+    assertTrue(result > 0);
+    assertTrue(result >= sqlConf.filesOpenCostInBytes());
+    assertTrue(result <= sqlConf.filesMaxPartitionBytes());
+    long calculatedTotalBytes = totalBytes + (long) fileCount * sqlConf.filesOpenCostInBytes();


Why don't we compare with the results from FilePartition.maxSplitBytes()?

zikangh changed the title ~~minor change~~ [kernel-spark] planInputPartitions Nov 17, 2025

zikangh changed the title ~~[kernel-spark] planInputPartitions~~ [WIP] [kernel-spark] planInputPartitions Nov 17, 2025

zikangh force-pushed the stack/plan1 branch from 68a923a to 42e38bf Compare November 24, 2025 22:40

zikangh force-pushed the stack/plan1 branch 7 times, most recently from 5382311 to 90345c7 Compare December 1, 2025 19:11

zikangh changed the title ~~[WIP] [kernel-spark] planInputPartitions~~ [kernel-spark] Implement planInputPartitions and createReaderFactory for dsv2 streaming Dec 1, 2025

zikangh force-pushed the stack/plan1 branch from 90345c7 to f1da8cd Compare December 2, 2025 03:39

gengliangwang reviewed Dec 4, 2025

View reviewed changes

kernel-spark/src/main/java/io/delta/kernel/spark/read/SparkMicroBatchStream.java Outdated Show resolved Hide resolved

gengliangwang reviewed Dec 4, 2025

View reviewed changes

huan233usc reviewed Dec 4, 2025

View reviewed changes

zikangh force-pushed the stack/plan1 branch from f1da8cd to ad8b4ac Compare December 4, 2025 21:48

zikangh mentioned this pull request Dec 4, 2025

[kernel-spark] Implement 2-pass delta commit validation algorithm for dsv2 streaming #5638

Open

5 tasks

zikangh force-pushed the stack/plan1 branch 5 times, most recently from 154897c to 35731eb Compare December 5, 2025 00:59

zikangh force-pushed the stack/plan1 branch from 35731eb to 4b7b319 Compare December 5, 2025 19:23

zikangh requested a review from gengliangwang December 5, 2025 19:25

zikangh requested a review from huan233usc December 5, 2025 19:25

gengliangwang reviewed Dec 5, 2025

View reviewed changes

gengliangwang reviewed Dec 6, 2025

View reviewed changes

kernel-spark/src/main/java/io/delta/kernel/spark/read/SparkMicroBatchStream.java Show resolved Hide resolved

zikangh force-pushed the stack/plan1 branch from 4b7b319 to 4023a3d Compare December 6, 2025 03:11

zikangh requested a review from gengliangwang December 6, 2025 03:14

This was referenced Dec 8, 2025

[kernel-spark] Add flatMap to CloseableIterator.java to support lazy loading of delta log changes. #5650

Open

initial snapshot #5651

Draft

[kernel-spark] Enable a few E2E dsv2 streaming tests and fix offset management edge cases #5652

Open

gengliangwang approved these changes Dec 8, 2025

View reviewed changes

huan233usc approved these changes Dec 8, 2025

View reviewed changes

planInputPartitions: squashed

a0512bb

zikangh force-pushed the stack/plan1 branch from 4023a3d to a0512bb Compare December 9, 2025 00:20

gengliangwang merged commit fd1fbbe into delta-io:master Dec 9, 2025
19 checks passed

zikangh added the kernel-spark label Dec 10, 2025

jerrypeng reviewed Dec 12, 2025

View reviewed changes

[kernel-spark] Implement planInputPartitions and createReaderFactory for dsv2 streaming #5499

[kernel-spark] Implement planInputPartitions and createReaderFactory for dsv2 streaming #5499

Conversation

zikangh commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🥞 Stacked PR

Which Delta project/connector is this regarding?

Description

How was this patch tested?

Does this PR introduce any user-facing changes?

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zikangh commented Nov 14, 2025 •

edited

Loading