Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which types of genes should we allow in the GENCODE GTF file? #12

Closed
dewyman opened this issue Apr 10, 2018 · 1 comment
Closed

Which types of genes should we allow in the GENCODE GTF file? #12

dewyman opened this issue Apr 10, 2018 · 1 comment
Assignees
Labels
question Further information is requested

Comments

@dewyman
Copy link
Member

dewyman commented Apr 10, 2018

Should we be filtering out things like snoRNAs, miRNA, etc. At first, my plan was to stick to protein coding genes and lncRNAs, but then I visited the GENCODE page and found out that there is an absurd number of biotypes.

https://www.gencodegenes.org/gencode_biotypes.html

@dewyman
Copy link
Member Author

dewyman commented Oct 2, 2018

In the end, I dealt with this by applying an optional length filtering the database initialization step.

@dewyman dewyman closed this as completed Oct 2, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant