Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Doc2vec on proteins example in iPython notebook #711

Closed
wants to merge 1 commit into from

Conversation

ziky90
Copy link
Contributor

@ziky90 ziky90 commented May 29, 2016

Do not merge yet!

This PR is a draft of simple example how to use doc2vec on proteins. This aims to solve #645

TODO:

  • Run whole iPython notebook on some smaller dataset.

@tmylk
Copy link
Contributor

tmylk commented Oct 4, 2016

Ping @ziky90

@ziky90
Copy link
Contributor Author

ziky90 commented Oct 5, 2016

@tmylk I still haven't made it to generate smaller example dataset for Doc2Vec on proteins. Could the PR still stay open (possibly for bioinformaticians to play with or to finish)?

@piskvorky piskvorky added wishlist Feature request difficulty medium Medium issue: required good gensim understanding & python skills labels Nov 9, 2016
@piskvorky piskvorky changed the title WIP Doc2vec on proteins example in iPython notebook [WIP] Doc2vec on proteins example in iPython notebook Nov 9, 2016
@parulsethi
Copy link
Contributor

parulsethi commented Mar 23, 2017

@ziky90 This is a really nice addition! I'm interested in completing this PR

As this notebook will only be for demo purposes, and accuracy of the results don't matter much, so I guess we can simply use a subset of sequences in fasta file already used in notebook, rather than whole proteome (with explicit mention of taking this approach). wdyt?
It would still convey the idea of using doc2vec on protein sequences

@menshikh-iv
Copy link
Contributor

Ping @ziky90, what status of this PR? Will you finish it soon?

@menshikh-iv
Copy link
Contributor

I close PR because it is abandoned.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
difficulty medium Medium issue: required good gensim understanding & python skills wishlist Feature request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

6 participants