Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

correlation shows label 'index' is not unique error #487

Closed
reza1615 opened this issue May 20, 2021 · 5 comments
Closed

correlation shows label 'index' is not unique error #487

reza1615 opened this issue May 20, 2021 · 5 comments
Labels
bug Something isn't working

Comments

@reza1615
Copy link

my index has unique data but it shows the below error.
In my opinion, if the index has duplicated still it should force to show the unique

image

@reza1615 reza1615 changed the title correlation shows an error correlation shows label 'index' is not unique error May 20, 2021
@aschonfeld aschonfeld added the bug Something isn't working label May 21, 2021
@aschonfeld
Copy link
Collaborator

@reza1615 can you give me some more information about the structure of your data?

  • is this timeseries data?
  • or is this a correlation of two columns as a whole?

@reza1615
Copy link
Author

1-it is tubular data for binary classification like housing data
2-yes it is correlation of two columns as whole and it shows for all columns

@reza1615
Copy link
Author

I figured out when I delete the index it will resolved.
In my opinion on fly you can run the below snippet before running the correlation code.
df.index = range(len(df))

@aschonfeld
Copy link
Collaborator

So the issue was that when I was calculating the scatter I created a temporary column called index and so if there was an additional column named index being used in the correlation then that would cause our sorting issue with non-unique column names.

I have since updated the code to use _scatter_index (which still may cause a non-unique issue if there is a column of that name) which should be far less commonly used.

aschonfeld added a commit that referenced this issue May 24, 2021
* #486: make 15K point limitation correlations scatter an editable setting
* #487: fix for non-unique column exception in correlation scatter
aschonfeld added a commit that referenced this issue May 28, 2021
* #486: make 15K point limitation correlations scatter an editable setting
* #487: fix for non-unique column exception in correlation scatter
@aschonfeld
Copy link
Collaborator

fixed in v1.48.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants