[native] Fail-fast for file formats unsupported by hive connector #25147

pramodsatya · 2025-05-20T04:24:42Z

Description

Presto C++ only supports reading of tables with DWRF, ORC and PARQUET file formats with hive connector. Using config native-execution-enabled, we can fail-fast at coordinator when attempting to read from tables with unsupported file formats in Presto C++.

Motivation and Context

Currently attempting to read from tables with unsupported file formats in Presto C++ fails at the worker:

it != readerFactories().end() ReaderFactory is not registered for format text

These missing reader factories can be detected at the coordinator itself instead of sending the splits to workers.

== NO RELEASE NOTE ==

tdcmeehan · 2025-05-20T18:13:25Z

presto-hive/src/main/java/com/facebook/presto/hive/HiveSplitManager.java

+        if (connectorSystemConfig.isNativeExecution()) {
+            StorageFormat storageFormat = table.getStorage().getStorageFormat();
+            Optional<HiveStorageFormat> hiveStorageFormat = getHiveStorageFormat(storageFormat);
+            if (hiveStorageFormat.isPresent() && !(hiveStorageFormat.equals(Optional.of(DWRF))


Let's add this in the Hive configs. By default, it is empty, which means whatever is available in Hive is fine. It can be a set of comma separated values.

aditi-pandit · 2025-05-20T20:51:00Z

@pramodsatya : Thanks for this code. Should we add a check for the file formats applicable at the Writer side as well ? Native execution only supports DWRF and Parquet writers.

[native] Fail-fast for file formats unsupported by hive connector

c9b0ff6

prestodb-ci added the from:IBM PR from IBM label May 20, 2025

pramodsatya marked this pull request as ready for review May 20, 2025 15:20

pramodsatya requested a review from a team as a code owner May 20, 2025 15:20

pramodsatya requested a review from jaystarshot May 20, 2025 15:20

prestodb-ci requested review from a team, sh-shamsan and pdabre12 and removed request for a team May 20, 2025 15:20

pramodsatya requested review from tdcmeehan, aditi-pandit, a team and nishithakbhaskaran and removed request for sh-shamsan, pdabre12 and a team May 20, 2025 15:20

tdcmeehan reviewed May 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[native] Fail-fast for file formats unsupported by hive connector #25147

[native] Fail-fast for file formats unsupported by hive connector #25147

pramodsatya commented May 20, 2025 •

edited

Loading

Uh oh!

tdcmeehan May 20, 2025

Uh oh!

aditi-pandit commented May 20, 2025

Uh oh!

Uh oh!

[native] Fail-fast for file formats unsupported by hive connector #25147

Are you sure you want to change the base?

[native] Fail-fast for file formats unsupported by hive connector #25147

Conversation

pramodsatya commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

tdcmeehan May 20, 2025

Choose a reason for hiding this comment

Uh oh!

aditi-pandit commented May 20, 2025

Uh oh!

Uh oh!

pramodsatya commented May 20, 2025 •

edited

Loading