Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rank off targets #2

Closed
bretonics opened this issue Sep 1, 2016 · 3 comments
Closed

Rank off targets #2

bretonics opened this issue Sep 1, 2016 · 3 comments

Comments

@bretonics
Copy link
Owner

Get match hits for each CRISPR and rank matches according to how many bp are matched. Not only how many occurrences per target throughout genome but which has the least matching base pairs.

@bretonics
Copy link
Owner Author

Commit 4d3c31a adds output with:

  • Number of significant hits in entire sequence
  • Number of matching nucleotides per hit

Occurrence in format length : matches

Length = window length (CRISPR sequence size)
Matches = number of nucleotide matches in hit

bretonics added a commit that referenced this issue Sep 19, 2016
@bretonics
Copy link
Owner Author

3591c4f and b7925f0 adds support.

Need to switch ranking priority to identities as primary sorting, then by number of occurrences.

Name    Sequence    Strand  Reverse Occurrences Identities
CRISPR_3    TGTGATCACGTACTATTATGCGG plus    GGCGTATTATCATGCACTAGTGT 3   23,8,8
CRISPR_2    AAAAATTTTCTCTATCTAACGGG minus   GGGCAATCTATCTCTTTTAAAAA 4   23,15,8,8
CRISPR_1    AAAAAATTTTCTCTATCTAACGG minus   GGCAATCTATCTCTTTTAAAAAA 4   23,16,8,8
CRISPR_8    AAAAAAAATTTTCCCTATCGGGG minus   GGGGCTATCCCTTTTAAAAAAAA 2   23,9
CRISPR_9    AAAAAAATTTTCCCTATCGGGGG minus   GGGGGCTATCCCTTTTAAAAAAA 2   23,9
CRISPR_6    CGAAAAAAAATTTTCCCTATCGG minus   GGCTATCCCTTTTAAAAAAAAGC 2   23,9
CRISPR_7    GAAAAAAAATTTTCCCTATCGGG minus   GGGCTATCCCTTTTAAAAAAAAG 2   23,9
CRISPR_4    AAAAATCCCATCGATCTAGCAGG minus   GGACGATCTAGCTACCCTAAAAA 8   23,9,7,7,7,7,7,7
CRISPR_0    ATGTAGCTAGCTAGCTAGTAGGG plus    GGGATGATCGATCGATCGATGTA 5   23,14,12,10,10
CRISPR_5    TCCCATCGATCTAGCAGGCCCGG minus   GGCCCGGACGATCTAGCTACCCT 7   23,15,9,7,7,7,7

Less base pair matches in match hit (identities) == better CRISPR, followed by fewer occurrences.

@bretonics
Copy link
Owner Author

e512c8f closes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant