HIVE-26653: Wrong results when (map) joining multiple tables on parti… #6165

soumyakanti3578 · 2025-11-04T04:47:32Z

…tion column

What changes were proposed in this pull request?

In INNER joins, when we only project the small table's joining key, we can run into a situation when the hashmap's value is empty. Then, if we serialize the empty value, we will get NULLs. Instead we should just copy the key into the vectorized batch.

Why are the changes needed?

Explained in detail: https://issues.apache.org/jira/browse/HIVE-26653

Does this PR introduce any user-facing change?

No

How was this patch tested?

Added unit tests in TestMapJoinOperator.java. The original issue is not reproducible anymore because of an unrelated patch, as explained in the Jira.

…tion column

sonarqubecloud · 2025-11-04T05:53:34Z

Quality Gate passed

Issues
8 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

zabetak

I have some more high level questions/comments regarding the bug/solution but will post those under the JIRA ticket.

zabetak · 2025-11-21T14:37:46Z

ql/src/test/org/apache/hadoop/hive/ql/exec/vector/mapjoin/MapJoinTestData.java

+        smallTableValueRow[c] =
+            VectorizedBatchUtil.getPrimitiveWritable(primitiveTypeInfos[c].getPrimitiveCategory());


Which real use-case is this code trying to simulate? Is this equivalent to having null values in the data or something else. Basically, I am trying to understand under what circumstances we have these values in the table.

From a quick look, it seems that we are using special values (e.g., new Text(ArrayUtils.EMPTY_BYTE_ARRAY)) but not really nulls. If that's the case then I don't see why we need to handle this separately from VectorRandomRowSource.randomWritablePrimitiveRow. Wouldn't it make more sense to tune the random generator to occasioanlly generate this "special" values if they can really appear in practice?

zabetak · 2025-11-21T14:40:23Z

ql/src/test/org/apache/hadoop/hive/ql/exec/vector/mapjoin/MapJoinTestDescription.java

      ONLY_ONE,
-      NO_REGULAR_SMALL_KEYS
+      NO_REGULAR_SMALL_KEYS,
+      EMPTY_VALUE, // Generate empty value entries.


By going over the code, I get the impression that the ValueOption enumeration is about the generation of the keys of the small table not the values. Mixing the two creates confusion and makes the code harder to understand.

zabetak · 2025-11-21T14:49:04Z

ql/src/test/org/apache/hadoop/hive/ql/exec/vector/mapjoin/TestMapJoinOperator.java

+                final boolean isEmptyValue = 
+                    testDesc.smallTableGenerationParameters.getValueOption() == ValueOption.EMPTY_VALUE &&
+                    testDesc.smallTableRetainValueColumnNums.length > 0 &&
+                    testDesc.smallTableRetainValueColumnNums.length == testDesc.bigTableKeyColumnNums.length;


This simulation is problematic cause it makes the solution and the test code somewhat identical. We're implementing a copy logic in two places (prod & test) so the tests will trivially pass as it is right now and immediately fail if the implementation changes in the future.

zabetak · 2025-11-21T14:55:57Z

ql/src/test/org/apache/hadoop/hive/ql/exec/vector/mapjoin/TestMapJoinOperator.java

+   * @throws Exception Exception
+   */
+  @Test
+  public void testSmallTableKeyOnlyProjectionWithEmptyValueString() throws Exception {


Adding tests in this class are useful but it may not be the best option in every situation. These tests depend on random generation of input/output and they are good for covering general behavior of the joins operators but for edge cases and very specific bugs having fixed input & output and join configuration would be much easier to reason about.

For showcasing the bug in this PR (if there is one), it would really help to have a dedicated test case possibly in another class and have well-defined and minimal input/output and join settings. Then we can discuss if we also need these randomized tests. The bug implies a problem in a binary join operator so we should be able to demonstrate the issue by correctly picking the schema/data for the left and right side of the join having a few rows on each side.

zabetak · 2025-11-21T15:04:00Z

.../java/org/apache/hadoop/hive/ql/exec/vector/mapjoin/VectorMapJoinGenerateResultOperator.java

+      throws HiveException {
+
+    // Check if the small table value is empty.
+    boolean isSmallTableValueEmpty = byteSegmentRef.getLength() == 0;


The fact that we need to check the actual data in order to decide how to evaluate the join (or rather the creation of the resulting row) is somewhat suspicious and a bit brittle. Ideally, the compiler should be able to determine exactly how the operator should behave via the query plan. Can we exploit (or add) information in the query plan in order to drive the copy decision below?

zabetak · 2025-11-21T15:05:24Z

ql/src/java/org/apache/hadoop/hive/ql/exec/vector/mapjoin/VectorMapJoinCommonOperator.java

+    if (smallTableValueMapping.getCount() > 0 &&
+        smallTableValueMapping.getCount() == bigTableKeyColumnMap.length) {


I don't understand the reasoning/intuition behind this check. Why do we care about values and keys being of the same length?

HIVE-26653: Wrong results when (map) joining multiple tables on parti…

d720a3e

…tion column

asf-ci-hive added the tests pending label Nov 4, 2025

asf-ci-hive added tests passed and removed tests pending labels Nov 4, 2025

zabetak requested changes Nov 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HIVE-26653: Wrong results when (map) joining multiple tables on parti… #6165

HIVE-26653: Wrong results when (map) joining multiple tables on parti… #6165

soumyakanti3578 commented Nov 4, 2025

Uh oh!

sonarqubecloud bot commented Nov 4, 2025

Uh oh!

zabetak left a comment

Uh oh!

zabetak Nov 21, 2025

Uh oh!

zabetak Nov 21, 2025

Uh oh!

zabetak Nov 21, 2025

Uh oh!

zabetak Nov 21, 2025 •

edited

Loading

Uh oh!

zabetak Nov 21, 2025

Uh oh!

zabetak Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		smallTableValueRow[c] =
		VectorizedBatchUtil.getPrimitiveWritable(primitiveTypeInfos[c].getPrimitiveCategory());

		if (smallTableValueMapping.getCount() > 0 &&
		smallTableValueMapping.getCount() == bigTableKeyColumnMap.length) {

HIVE-26653: Wrong results when (map) joining multiple tables on parti… #6165

Are you sure you want to change the base?

HIVE-26653: Wrong results when (map) joining multiple tables on parti… #6165

Conversation

soumyakanti3578 commented Nov 4, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

sonarqubecloud bot commented Nov 4, 2025

Quality Gate passed

Uh oh!

zabetak left a comment

Choose a reason for hiding this comment

Uh oh!

zabetak Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

zabetak Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

zabetak Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

zabetak Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zabetak Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

zabetak Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zabetak Nov 21, 2025 •

edited

Loading