Skip to content

fix: rows_distinct() shouldn't drop null rows#399

Merged
rich-iannone merged 5 commits into
mainfrom
fix-rows-distinct-null-drop-rows
Jun 6, 2026
Merged

fix: rows_distinct() shouldn't drop null rows#399
rich-iannone merged 5 commits into
mainfrom
fix-rows-distinct-null-drop-rows

Conversation

@rich-iannone
Copy link
Copy Markdown
Member

This PR addresses the handling of duplicate row detection, especially in the presence of null values. This improves both correctness and cross-backend consistency. The changes update the implementation to treat nulls as equal when checking for duplicates, revise the test expectations accordingly, and enhance test coverage for both Polars and Pandas backends. Additionally, the validation logic is updated to ensure proper grouping even when nulls are present.

Fixes: #397

@rich-iannone rich-iannone merged commit 27fc70c into main Jun 6, 2026
9 checks passed
@rich-iannone rich-iannone deleted the fix-rows-distinct-null-drop-rows branch June 6, 2026 02:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

rows_distinct() silently excludes rows containing null values

1 participant