-
-
Notifications
You must be signed in to change notification settings - Fork 19.3k
Closed
Labels
EnhancementNeeds InfoClarification about behavior needed to assess issueClarification about behavior needed to assess issueNeeds TriageIssue that has not been reviewed by a pandas team memberIssue that has not been reviewed by a pandas team member
Description
Feature Type
-
Adding new functionality to pandas
-
Changing existing functionality in pandas
-
Removing existing functionality in pandas
Problem Description
I wish that I could use the duplicated() function on data which has na values
Feature Description
Add a new parameter to the duplicated function:
keepna = {'False, True}, default False
when keepna = False, current functionality works as is.
if keepna = True, the duplicated function would find the duplicate rows but including all rows with na values on any column within the specificed subset
Alternative Solutions
current alternative solution which I have found is to :
- overwrite na values with some particular constant value;
- execute the duplicated function; and then,
- change the all of the constant values from step 1. above back to nan.
Additional Context
No response
Metadata
Metadata
Assignees
Labels
EnhancementNeeds InfoClarification about behavior needed to assess issueClarification about behavior needed to assess issueNeeds TriageIssue that has not been reviewed by a pandas team memberIssue that has not been reviewed by a pandas team member