Implementation algorithm

Alex Warwick Vesztrocy, Christophe Dessimoz, Henning Redestig, Prioritising candidate genes causing QTL using hierarchical orthologous groups, *Bioinformatics*, Volume 34, Issue 17, 01 September 2018, Pages i612–i619, https://doi.org/10.1093/bioinformatics/bty615

In [1]:
import qtlsearch
import pandas as pd
from IPython.display import Image,SVG
search = qtlsearch.SEARCH(
    "http://pbg-ld.candygene-nlesc.surf-hosted.nl:8890/sparql", 
    "http://sparql.omabrowser.org/sparql",
    "https://sparql.uniprot.org/sparql")

## Brix, Soluble Solids, Sugars

GO-terms: `GO:0006094` `GO:0046370` `GO:0046369` `GO:0005985` `GO:0015770`

QTL from: Chromosome `9`, around `3474710`

Candidate: `Lin5` (`Solyc09g010080`)

Define the QTL and compute genes within this interval

In [2]:
d=100000
intervalT = search.make_interval(
    "http://localhost:8890/genome/Solanum_lycopersicum/chromosome/9", 
    3474710-d, 
    3474710+d)

#genes for interval
genesT = search.interval_genes(intervalT)

Compute the list of GO annotations

In [3]:
qtls = [genesT.index]
go_annotations = pd.concat([search.get_child_annotations("GO:0006094"), search.get_child_annotations("GO:0046370"), search.get_child_annotations("GO:0046369"), search.get_child_annotations("GO:0005985"), search.get_child_annotations("GO:0015770")])
print(go_annotations)

                                                                                       label
go_annotation                                                                               
http://purl.obolibrary.org/obo/GO_0006094                                    gluconeogenesis
http://purl.obolibrary.org/obo/GO_0046370                      fructose biosynthetic process
http://purl.obolibrary.org/obo/GO_1901358        beta-D-galactofuranose biosynthetic process
http://purl.obolibrary.org/obo/GO_0046369                     galactose biosynthetic process
http://purl.obolibrary.org/obo/GO_0005985                          sucrose metabolic process
http://purl.obolibrary.org/obo/GO_0005986                       sucrose biosynthetic process
http://purl.obolibrary.org/obo/GO_0061705  sucrose catabolic process to fructose-6-phosph...
http://purl.obolibrary.org/obo/GO_0005987                          sucrose catabolic process
http://purl.obolibrary.org/obo/GO_0061704                    glycolyti

Get data and do computations

In [4]:
result = qtlsearch.QTLSEARCH(search, qtls, go_annotations, True)

Create report

In [5]:
report_list = result.report()
for report in report_list:    
    display(report)

Unnamed: 0_level_0,alias,uniprot_id,description,chromosome,location,score
gene_id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1
Solyc09g010090.2,LIN7,Q8L4N2,Cell-wall invertase,9,3480545-3484159,0.202943
Solyc09g010080.2,lin5,Q9LD97,"Beta-fructofuranosidase, insoluble isoenzyme 1",9,3475480-3479343,0.172502
Solyc09g010020.2,,K4CR31,1-aminocyclopropane-1-carboxylate oxidase-like...,9,3447416-3449839,0.043606
Solyc09g010040.1,101248415,K4CR33,1-aminocyclopropane-1-carboxylate oxidase-like...,9,3454295-3455561,0.043606
Solyc09g010000.2,101249481,K4CR29,1-aminocyclopropane-1-carboxylate oxidase-like...,9,3444303-3445806,0.037065
Solyc09g009900.2,,K4CR19,Pollen-specific protein - like,9,3382755-3386531,0.034483
Solyc09g009910.2,101252320,K4CR20,Lipid A ABC transporter permease/ATP-binding p...,9,3386287-3395367,0.034483
Solyc09g009920.1,101252019,K4CR21,F-box family protein,9,3397388-3398827,0.034483
Solyc09g009930.1,101251713,K4CR22,Unknown Protein,9,3400106-3400900,0.034483
Solyc09g009940.2,101251215,K4CR23,Signal recognition particle protein,9,3401811-3408955,0.034483
