New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Index out of bounds error when a col has all different value #8
Comments
Can you please share a minimal example that reproduces this issue? |
Add data.csv like this for example: and test.py like this: from datacleaner import autoclean
import pandas as pd
raw_data = pd.read_csv("data.csv")
clean_data = autoclean(raw_data)
clean_data.to_csv("new_data.csv", sep=',', index=False) and execute it and get the error like this:
|
That does indeed seem like a bug, albeit a strange one! Can you send a PR with a patch to fix it? |
Merged the PR - thanks for your help! |
datacleaner v0.1.5 has your changes. |
Hi, just want to check in, does this issue solved? I have an exact bug as yours, how did you address it in the end? Many thanks! @fndjjx |
Hi
I find a issue in datacleaner. When I use this tool to deal with my dataset, it generates a index out of bounds error. I check the code and I find this row in function autoclean:
when a col has no same value, the mode will return empty, so the index will out of bound.
I think this is the reason, could you confirm it. Thank you!
The text was updated successfully, but these errors were encountered: