Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

.csv input errors: "key error 'id'" and length of data #10

Closed
SanniH opened this issue Apr 22, 2021 · 5 comments
Closed

.csv input errors: "key error 'id'" and length of data #10

SanniH opened this issue Apr 22, 2021 · 5 comments

Comments

@SanniH
Copy link

SanniH commented Apr 22, 2021

Hello again!

So I went to run DnoisE with a .csv input and output, but got a similar error to when I used a fasta, where somewhere along the code it doesn't recognize my naming anymore. So I decided to try to run my csv inputs with the name "id" for the otu/esv identifier column as the error seems to point to that direction, and it ran smooth until it began to write the output dir, when it ran into an issue of "ValueError: Length of passed values is 57, index implies 56.". My .csv contains a total 55 columns, with sample columns starting at 4 and ending at 55. Not sure what happened there along the way... This happens with all of my datasets, where the input ncol is one less than the index value implied, and the length of passed values is 2 more than the original ncol.

I've attached a zip file containing my updated .csv (the one with "id" as id
entifier col name), the error for when I got the id key error, and the full output of my batch job I get when DnoisE starts to write the outputs. I'm hoping its something as simple as me not saving my csv file right!

csv_input.zip

The key error is not that big a deal, but I thought I'd add it here just in case you want it to be customisable for users, though personally if your future how-to guide tells users to have it named "id" then that solves that problem! :)

@adriantich
Copy link
Owner

adriantich commented Apr 22, 2021 via email

@adriantich
Copy link
Owner

Hello Sannis!
I think it's fixed now. I've tried to run with your data and worked well. Please check also the installation of the pandas module on your computer. In 'slurm-931755.out' line 21 there is a warning concerning this matter.
..../site-packages/pandas/compat/init.py:120: UserWarning: Could not import the lzma module. Your installed Python is incomplete. Attempting to use lzma compression will result in a RuntimeError.
The problem was not that but just in case.

Thanks a lot for your comments. It is really helpful!

Adrià Antich

@SanniH
Copy link
Author

SanniH commented Apr 22, 2021 via email

@adriantich
Copy link
Owner

Hello Sanni,
I have done a last update now. If you have git pull please update again. It is not a major change but just in case ;)
Tell me if runs well or not please!

Cheers,
A.

@SanniH
Copy link
Author

SanniH commented Apr 23, 2021

Hi,

It ran smooth, no issues :)

Thanks!!
Sanni

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants