Skip to content

Conversation

UBarney
Copy link
Contributor

@UBarney UBarney commented Sep 13, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Refine report_partition_bounds Logic in shared_bounds.rs:

  • The partition parameter has been renamed to left_side_partition_id to clarify that it refers to the partition from the left (build) side.
  • New deduplication logic has been added: it skips reporting bounds if the current left_side_partition_id is identical to the previously recorded one. This directly fixes the issue in CollectLeft mode, where all streams report the same left_side_partition_id of 0, thus preventing duplicate expressions.

Are these changes tested?

Yes

Are there any user-facing changes?

No

@github-actions github-actions bot added the physical-plan Changes to the physical-plan crate label Sep 13, 2025
@github-actions github-actions bot added the core Core DataFusion crate label Sep 15, 2025
@UBarney UBarney marked this pull request as ready for review September 16, 2025 03:49
@UBarney
Copy link
Contributor Author

UBarney commented Sep 16, 2025

FYI @adriangb

@adriangb adriangb merged commit 49d49fd into apache:main Sep 16, 2025
29 checks passed
@adriangb
Copy link
Contributor

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Core DataFusion crate physical-plan Changes to the physical-plan crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Consumer receives duplicate bound predicates when join mode is CollectLeft
2 participants