Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for non mapped accession with taxid #1

Merged
merged 7 commits into from
Jan 31, 2017

Conversation

horkko
Copy link
Contributor

@horkko horkko commented Dec 19, 2016

Hi,

First of all, I've found your project very interesting and tried to use it. It works well except some case where an accession number is not mapped to a taxid, see method _unmapped_taxid.
So I've made some modifications to your code to try to support this and avoid a stack trace.
I've also tried to speed up a little bit the data loading using insert_many when loading other files than names.dmp and nodes.dmp.
Have a look at the PR and let me know.

Best

Emmanuel

- Support for chunk number during data insertion
…sion number, the table exists in the database

- Also surrounded call to yield in case a taxid is not found in the database (E.g.: 0), do not throw stack trace but
  print a small error message on stderr
…speed up a little bit

- Chunk value can be passed with 'create' sub option
- If a taxid is not found in the database, a message is thrown to the user
@HadrienG HadrienG self-assigned this Dec 20, 2016
@HadrienG
Copy link
Owner

Hi,

Thank you for your comments and your improvements!
I read the code quickly and it looks great, I'll review and test the PR in detail over the holidays and come back with comments and/or changes.

Cheers,
Hadrien.

@tbigot
Copy link

tbigot commented Jan 9, 2017

Thanks for your work. I was about to ask for an indexation, but I can see @horkko ’s commit b448358 exactly matches my concern.
Thank you.

@HadrienG HadrienG self-requested a review January 31, 2017 07:42
The line `for table, acc_file in acc_dl_dict.items():` was duplicated and caused the db with multiple accession tables to fail
@HadrienG HadrienG removed their request for review January 31, 2017 07:44
@HadrienG
Copy link
Owner

Everything looks ready to be merged.

Thanks again for your contribution!

@HadrienG HadrienG merged commit fcb37e6 into HadrienG:master Jan 31, 2017
@horkko
Copy link
Contributor Author

horkko commented Jan 31, 2017

Thanks for merging 👍 and sorry for duplicated loop, a wrong copy/paste

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants