# Download citation data from NIH-OCC
NIH-OCC: Nation Institute of Health's Open Citation Collection https://icite.od.nih.gov/

## 1) Load Python interface to NIH-OCC's API
One NIH entry per PubMed ID (PMID) will be downloaded to the directory, `./icite`.

It is not advisable to add these files to a repo or revision manage them because the number of files will multitudinous.
So ensure `./icite` is listed in the `.gitignore` file.

NOTE: The Python API always downloads data from the NIH-OCC, even if it has been requested before. See the notebook, *Download or import citation data from the NIH-OCC*.

In [1]:
from os import mkdir
from os.path import exists

if not exists('./icite'):
    mkdir('./icite')

## 2) Load the NIH Downloader
The NIH downloader will use the API to download data from NIH if it is not stored locally or if the user has requested to always download and over-write the older citation file, allowing new citations to be seen.

The NIH downloader will read already downloaded NIH-OCC data if it is available. This makes it possible to work offline using previously downloaded citation data.

In [2]:
from pmidcite.icite.pmid_dnlder import NIHiCiteDownloader

force_download = False
dnldr = NIHiCiteDownloader('./icite', force_download)

## 3) Download NIH-OCC data for one PMID

Print the column headers first. Then print the citation data.

In [3]:
nih_entry = dnldr.get_icite(22882545)

print(nih_entry.hdr); print(nih_entry)

PMID     RP HAMCc   % G YEAR   cit cli ref au[00](authors) title
22882545 .. .A...  64 2 2013    37  0  24 au[03](P J N de Bruyn) Killer whale ecotypes: is there a global model?


## 4) Optionally, print the column header descriptions

In [4]:
# nih_entry.prt_keys()

## 5) Download NIH-OCC data for multiple PMIDs

In [5]:
pmids = [31461780, 22882545, 20050301]
nih_entries = dnldr.get_icites(pmids)

for entry in nih_entries:
    print(entry)

31461780 R. .A...   8 1 2020     1  0   0 au[06](Robert L Pitman) Enigmatic megafauna: type D killer whale in the Southern Ocean.
22882545 .. .A...  64 2 2013    37  0  24 au[03](P J N de Bruyn) Killer whale ecotypes: is there a global model?
20050301 R. .A...  71 2 2009    58  0  25 au[05](Andrew D Foote) Ecological, morphological and genetic divergence of sympatric North Atlantic killer whale populations.


## 4) Print all the NIH-OCC data for one PMID

In [6]:
for key, val in nih_entry.get_dict().items():
    print(f'{key:>27} {val}')

                       pmid 22882545
                       year 2013
                      title Killer whale ecotypes: is there a global model?
                    authors ['P J N de Bruyn', 'Cheryl A Tosh', 'Aleks Terauds']
                    journal Biol Rev Camb Philos Soc
        is_research_article False
    relative_citation_ratio 1.46
             nih_percentile 64.3
                      human 0.0
                     animal 1.0
         molecular_cellular 0.0
                        apt 0.05
                is_clinical False
             citation_count 37
         citations_per_year 3.3636363636363638
expected_citations_per_year 2.2999145273529242
        field_citation_rate 5.224693905781195
                provisional False
                    x_coord 0.8660254037844386
                    y_coord -0.5
              cited_by_clin []
                   cited_by [31230140, 25297864, 35233242, 37055915, 31215081, 29895580, 31631360, 26937049, 34750442, 31131963, 37839906, 37

Copyright (C) 2019-present, DV Klopfenstein, PhD. All rights reserved.