Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RdRp Explorer - Search by palmID #234

Open
ababaian opened this issue Feb 25, 2023 · 0 comments
Open

RdRp Explorer - Search by palmID #234

ababaian opened this issue Feb 25, 2023 · 0 comments
Labels
enhancement Enhancement to current feature feature New feature good first issue Good for newcomers

Comments

@ababaian
Copy link
Member

Currently the RdRp Explorer page allows for searching by Virus Family, Genbank ID, or SRA Accession Code.
Screenshot from 2023-02-25 12-23-05

It is useful to search in which datasets a specific RdRp sequence occurs, BLAST-like search is one obvious feature (implemented in palmID) but a simpler interface would be the ability to retrieve all datasets in which a given palmprint sequence occurs in.

This information is already stored on the SQL server in the palm_sra Table. So one could query for a given palm_id (palmprint accession code) and get the SRA accession codes.

For example to find u1337 and u43110 you can search those codes with

where palm_id in ('u1337', 'u43110')

returns

"SRR10873916"	"u43110"	100	2.97e-66	"LAVDWTEFDSSQNNLEHALLRTVLGKLGAPLDVLDIIDQMKHTRYVENINVAGTVHSKKDSGAPDTLVGNTLFNLCVVLSCVDNYRQLEIAAFKGDDAIL"
"SRR10875094"	"u43110"	100	3.78e-66	"LAVDWTEFDSSQNNLEHALLRTVLGKLGAPLDVLDIIDQMKHTRYVENINVAGTVHSKKDSGAPDTLVGNTLFNLCVVLSCVDNYRQLEIAAFKGDDAIL"
"SRR10873446"	"u43110"	100	3.56e-66	"LAVDWTEFDSSQNNLEHALLRTVLGKLGAPLDVLDIIDQMKHTRYVENINVAGTVHSKKDSGAPDTLVGNTLFNLCVVLSCVDNYRQLEIAAFKGDDAIL"
"SRR8735790"	"u1337"	100	9.47e-71	"FSYDTRCFDSTVTEKDIRTEEEVYQCCDLEPEARKVIAALTERLYVGGPMHNSKGDLCGYRRCRASGVYTTSFGNTLTCYLKATAAIKAAGLRDCTMLVCGDDLVV"
"SRR10873776"	"u43110"	100	3.58e-65	"LAVDWTEFDSSQNNLEHALLRTVLGKLGAPLDVLDIIDQMKHTRYVENINVAGTVHSKKDSGAPDTLVGNTLFNLCVVLSCVDNYRQLEIAAFKGDDAIL"

So the interface should be able to "search" for one or more palm_id accessions. These would be "tags" which can be selected and added together. The graph should be updated to show the frequency count of matching libraries (ignore score for now, but ultimately score should be drawn as -log(evalue))

When the search is run, it should retrieve in the right panel the matching libraries which were build up in the query editor (left panel).

@ababaian ababaian added feature New feature enhancement Enhancement to current feature good first issue Good for newcomers labels Feb 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Enhancement to current feature feature New feature good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

1 participant