-
-
Notifications
You must be signed in to change notification settings - Fork 76
DataFrame distinct with multiple column does not work #20
Copy link
Copy link
Closed
Description
I am trying to use distinct with multiple column select on DataFrame. But I am not getting distinct rows in the result. Distinct with single column select on DataFrame works fine. Example
df.distinct(row => [row.columnA, row.columnB]).toArray()
even tried this
df.distinct(row => ({ columnA: row.columnA, columnB: row.columnB }).toArray()
Both returns same number of rows as original dataframe, was expecting distinct rows from the original dataframe
Single column select
df.distinct(row => row.columnA).toArray() works perfectly fine
I know I am missing something here on the multi-column selector. Could someone help. Thanks in advance.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels