Adds ignore_order for groupBy agg test that returns multiple rows [databricks] by abellina · Pull Request #14044 · NVIDIA/spark-rapids

abellina · 2025-12-21T17:32:08Z

Description

The test test_avg_divide_by_zero performs a groupBy("k") which returns multiple rows (k=0 and k=1), but it does NOT use @ignore_order, which means that we've been getting lucky in the ordering. That said #14043 documents a failure where spark 3.3.0 and Databricks is returning a different order. Since the order of the keys isn't deterministic, I added an @ignore_order.

The unit test was added here #13192

Checklists

This PR has added documentation for new or modified features or behaviors.
This PR has added new tests or modified existing tests to cover new code paths.
(Please explain in the PR description how the new code paths are tested, such as names of the new/existing tests that cover them.)
Performance testing has been performed and its results are added in the PR description. Or, an issue has been filed with a link in the PR description.

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

abellina · 2025-12-21T17:32:34Z

build

greptile-apps · 2025-12-21T17:33:58Z

Greptile Summary

This PR fixes a test flakiness issue by adding the @ignore_order decorator to test_avg_divide_by_zero. The test performs a groupBy("k") aggregation that returns multiple rows, but the ordering of grouped results is non-deterministic in Spark.

Key changes:

Added @ignore_order decorator to test_avg_divide_by_zero in hash_aggregate_test.py:2888
Fixes test failures on Databricks where GPU and CPU return different orderings for the groupBy key column (k)
Aligns with the pattern used by other groupBy aggregation tests in the same file

The fix is minimal, correct, and follows established conventions in the codebase. The actual aggregation logic and computed values (avg=63.0) are correct on both GPU and CPU - only the ordering differed.

Confidence Score: 5/5

This PR is safe to merge with no risk
The change is a single-line addition of a test decorator that is widely used throughout the test file for the same purpose. The fix correctly addresses a real test failure caused by non-deterministic groupBy ordering, which is expected behavior in Spark. No logic changes, no new functionality - just proper test annotation.
No files require special attention

Important Files Changed

Filename	Overview
integration_tests/src/main/python/hash_aggregate_test.py	Added `@ignore_order` decorator to fix non-deterministic ordering in groupBy aggregation test

Sequence Diagram

sequenceDiagram
    participant Test as test_avg_divide_by_zero
    participant Spark as Spark Engine
    participant GPU as GPU Executor
    participant CPU as CPU Executor
    
    Test->>Spark: Create DataFrame (id % 2 as k, id as v)
    Test->>Spark: Execute groupBy("k").agg(avg(...))
    
    par GPU Execution
        Spark->>GPU: Execute query
        GPU->>GPU: Group by k (0, 1)
        GPU->>GPU: Compute avg(CASE WHEN k>0...)
        Note over GPU: Returns rows in order:<br/>[k=1, avg=63.0]<br/>[k=0, avg=None]
        GPU-->>Test: GPU Result
    and CPU Execution
        Spark->>CPU: Execute query
        CPU->>CPU: Group by k (0, 1)
        CPU->>CPU: Compute avg(CASE WHEN k>0...)
        Note over CPU: Returns rows in order:<br/>[k=0, avg=None]<br/>[k=1, avg=63.0]
        CPU-->>Test: CPU Result
    end
    
    Test->>Test: Compare results with @ignore_order
    Note over Test: Order differences ignored<br/>Test passes ✓

sameerz

Would be good to know the root cause of the ordering change in our underlying systems. Approving given the change is in tests only, and the ordering should not matter.

sameerz · 2025-12-21T17:46:31Z

Possibly related: an upgrade to CCCL 3.2 NVIDIA/spark-rapids-jni#4094

ttnghia · 2025-12-22T02:34:40Z

Would be good to know the root cause of the ordering change in our underlying systems. Approving given the change is in tests only, and the ordering should not matter.

I've bisected the cudf commits and can confirm that this is due to rapidsai/cudf#20796, which changes the behavior of hash-partitioning.

Adds ignore_order for groupBy agg test that returns multiple rows

58ed20c

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

sameerz added the test Only impacts tests label Dec 21, 2025

sameerz approved these changes Dec 21, 2025

View reviewed changes

abellina merged commit e95ea4c into NVIDIA:main Dec 21, 2025
51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds ignore_order for groupBy agg test that returns multiple rows [databricks]#14044

Adds ignore_order for groupBy agg test that returns multiple rows [databricks]#14044
abellina merged 1 commit intoNVIDIA:mainfrom
abellina:add_ignore_order_for_agg_test

abellina commented Dec 21, 2025 •

edited

Loading

Uh oh!

abellina commented Dec 21, 2025

Uh oh!

greptile-apps bot commented Dec 21, 2025

Uh oh!

sameerz left a comment

Uh oh!

sameerz commented Dec 21, 2025

Uh oh!

Uh oh!

ttnghia commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

abellina commented Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklists

Uh oh!

abellina commented Dec 21, 2025

Uh oh!

greptile-apps bot commented Dec 21, 2025

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

sameerz left a comment

Choose a reason for hiding this comment

Uh oh!

sameerz commented Dec 21, 2025

Uh oh!

Uh oh!

ttnghia commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

abellina commented Dec 21, 2025 •

edited

Loading