# Occurence of different databases in PubMed Central

This notebooks uses the SCAIView API to query the number of publications in PubMed Central that mention a given pathway database.

In [1]:
import simplejson as json
import requests
import urllib
from requests.exceptions import HTTPError
from IPython.core.display import display, HTML

For security reasons the security token is not displayed here

In [None]:
def fetchToken():
    "fetch token from keycloak"
    clientSecret = getpass.getpass('Password:')

    response = requests.post(
        keycloak_uri+'protocol/openid-connect/token',
        data={'grant_type':'client_credentials', 'client_id':'temowl-backend', 'client_secret':clientSecret}
    )
    
    return response.json()['access_token']


In [3]:
def query_scaiview(query, limit=10):
    response = requests.get(
        scaiview_uri+'v2/solr/search?',
        params={'q':query, 'rows':limit, 'sortField':'date', 'sortOrder':'DESC', 'collection':'PMC_2019'},
        headers={'Accept': 'application/json', 'Authorization': 'Bearer '+ fetchToken},
    )
    json_response = response.json()
    documents = json_response['documents']
    numFound = json_response['numFound']
    return numFound

### Query for documents in PMC 2019 Central

In [5]:
queries = [
    'kegg AND pathway enrichment',
    'reactome AND pathway enrichment',
    'wikipathways AND pathway enrichment',
    'msigdb AND pathway enrichment',
    'consensuspathdb AND pathway enrichment',
    'pathway commons AND pathway enrichment',
]

for query in queries:
    results = query_scaiview(query, 10)
    print(f'{query} resulted in {results} publications')

kegg AND pathway enrichment resulted in 27713 publications
reactome AND pathway enrichment resulted in 3765 publications
wikipathways AND pathway enrichment resulted in 651 publications
msigdb AND pathway enrichment resulted in 2892 publications
consensuspathdb AND pathway enrichment resulted in 339 publications
pathway commons AND pathway enrichment resulted in 1640 publications


Number of publications citing major pathway resources for pathway enrichment in PubMed Central (PMC) 2019. In order to achieve a rough estimate on how many publications used several pathway databases for pathway enrichment, SCAIView (http://academia.scaiview.com/academia/; indexed on 01/03/2019) was used to conduct the following query using the PMC corpus: “<database name>” AND “pathway enrichment”.