hive icon indicating copy to clipboard operation
hive copied to clipboard

HIVE-29197: Disable vectorization for multi-column COUNT(DISTINCT)

Open Indhumathi27 opened this issue 3 months ago • 4 comments

What changes were proposed in this pull request?

Disabled vectorized execution for multi-column COUNT(DISTINCT) so queries fall back to row mode for unsupported expressions.

Why are the changes needed?

In case of query with filter on Partition column, and if the same column exists in count(distinct, col) udf, Partition column changes to constant.

Vectorized execution does not support multi-column COUNT(DISTINCT). This ensures queries run safely without exceptions.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Added test case

Indhumathi27 avatar Oct 03 '25 13:10 Indhumathi27

@ayushtkn / @deniskuzZ / @okumin can you help to review the PR. Thanks

Indhumathi27 avatar Oct 06 '25 08:10 Indhumathi27

hi @Indhumathi27, is this still relevant? Have you had a chance to see the latest comment?

deniskuzZ avatar Oct 22 '25 11:10 deniskuzZ

hi @Indhumathi27, is this still relevant? Have you had a chance to see the latest comment?

Apologies for the delay, @deniskuzZ. I’ve been busy with other priorities and haven’t checked the comment yet. I’ll get to it and let you know.

Indhumathi27 avatar Oct 22 '25 12:10 Indhumathi27