-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-50655][SS] Move virtual col family related mapping into db layer instead of encoder #49304
Conversation
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Outdated
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala
Outdated
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Outdated
Show resolved
Hide resolved
@HeartSaVioR - done ! PTAL |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will continue with reviewing tests, but I see something to be really careful.
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala
Outdated
Show resolved
Hide resolved
...core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBStateEncoder.scala
Outdated
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Outdated
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Outdated
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done another round of review.
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala
Outdated
Show resolved
Hide resolved
...rc/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBStateStoreProvider.scala
Show resolved
Hide resolved
...rc/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBStateStoreProvider.scala
Outdated
Show resolved
Hide resolved
...rc/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBStateStoreProvider.scala
Outdated
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDB.scala
Show resolved
Hide resolved
sql/core/src/test/scala/org/apache/spark/sql/execution/streaming/state/RocksDBSuite.scala
Outdated
Show resolved
Hide resolved
sql/core/src/test/scala/org/apache/spark/sql/execution/streaming/state/RocksDBSuite.scala
Show resolved
Hide resolved
sql/core/src/test/scala/org/apache/spark/sql/execution/streaming/state/RocksDBSuite.scala
Show resolved
Hide resolved
sql/core/src/test/scala/org/apache/spark/sql/execution/streaming/state/RocksDBSuite.scala
Show resolved
Hide resolved
sql/core/src/test/scala/org/apache/spark/sql/execution/streaming/state/RocksDBSuite.scala
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 pending CI
Thanks! Merging to master/4.0. |
…er instead of encoder ### What changes were proposed in this pull request? Move virt col family related mapping into db layer instead of encoder ### Why are the changes needed? Keep abstraction clear around ownership and also expose internal/non-internal key metrics correctly. With this change, we have the following: - encoder is only responsible for managing encoding based on type such as noPrefix, prefix, range etc - the onus of maintaining virtual col families is now with the underlying DB layer - this layer can now also expose metrics for internal as well as non-internal column families ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Existing Unit tests and added unit tests ``` [info] Run completed in 8 minutes, 48 seconds. [info] Total number of tests run: 305 [info] Suites: completed 1, aborted 0 [info] Tests: succeeded 305, failed 0, canceled 0, ignored 0, pending 0 [info] All tests passed. ``` ### Was this patch authored or co-authored using generative AI tooling? No Closes #49304 from anishshri-db/task/SPARK-50655. Authored-by: Anish Shrigondekar <[email protected]> Signed-off-by: Jungtaek Lim <[email protected]> (cherry picked from commit 5759882) Signed-off-by: Jungtaek Lim <[email protected]>
…er instead of encoder ### What changes were proposed in this pull request? Move virt col family related mapping into db layer instead of encoder ### Why are the changes needed? Keep abstraction clear around ownership and also expose internal/non-internal key metrics correctly. With this change, we have the following: - encoder is only responsible for managing encoding based on type such as noPrefix, prefix, range etc - the onus of maintaining virtual col families is now with the underlying DB layer - this layer can now also expose metrics for internal as well as non-internal column families ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Existing Unit tests and added unit tests ``` [info] Run completed in 8 minutes, 48 seconds. [info] Total number of tests run: 305 [info] Suites: completed 1, aborted 0 [info] Tests: succeeded 305, failed 0, canceled 0, ignored 0, pending 0 [info] All tests passed. ``` ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#49304 from anishshri-db/task/SPARK-50655. Authored-by: Anish Shrigondekar <[email protected]> Signed-off-by: Jungtaek Lim <[email protected]>
…er instead of encoder ### What changes were proposed in this pull request? Move virt col family related mapping into db layer instead of encoder ### Why are the changes needed? Keep abstraction clear around ownership and also expose internal/non-internal key metrics correctly. With this change, we have the following: - encoder is only responsible for managing encoding based on type such as noPrefix, prefix, range etc - the onus of maintaining virtual col families is now with the underlying DB layer - this layer can now also expose metrics for internal as well as non-internal column families ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Existing Unit tests and added unit tests ``` [info] Run completed in 8 minutes, 48 seconds. [info] Total number of tests run: 305 [info] Suites: completed 1, aborted 0 [info] Tests: succeeded 305, failed 0, canceled 0, ignored 0, pending 0 [info] All tests passed. ``` ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#49304 from anishshri-db/task/SPARK-50655. Authored-by: Anish Shrigondekar <[email protected]> Signed-off-by: Jungtaek Lim <[email protected]>
What changes were proposed in this pull request?
Move virt col family related mapping into db layer instead of encoder
Why are the changes needed?
Keep abstraction clear around ownership and also expose internal/non-internal key metrics correctly.
With this change, we have the following:
Does this PR introduce any user-facing change?
No
How was this patch tested?
Existing Unit tests and added unit tests
Was this patch authored or co-authored using generative AI tooling?
No