In [1]:
%env KIF_DEBUG=
%env KIF_RESOLVE_ENTITIES=1

env: KIF_DEBUG=
env: KIF_RESOLVE_ENTITIES=1


# PubChem

In [2]:
from kif_lib import *
from kif_lib.vocabulary import pc, wd

Create a SPARQL store with PubChem mappings pointing to a PubChem RDF SPARQL endpoint:

In [3]:
import os
kb = Store('pubchem-sparql', os.getenv('PUBCHEM', 'https://qlever.cs.uni-freiburg.de/api/pubchem'))

If a limit on the number of results is not specified, assume 3:

In [4]:
kb.limit = 3

## Label, aliases, and description (pseudo-properties)

Gets the label of the compound with CID 241 in English:

In [5]:
it = kb.filter(subject=pc.CID(241), property=wd.label, language='en')
display(*it)

(**Statement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** **LabelProperty** "benzene"@en))

Gets the aliases of the CID 241 in English:

In [6]:
it = kb.filter(subject=pc.CID(241), property=wd.alias, language='en')
display(*it)

(**Statement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** **AliasProperty** "\[6]annulene"@en))

(**Statement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** **AliasProperty** "Benzen"@en))

(**Statement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** **AliasProperty** "benzene"@en))

Gets the description of the CID 241 in English:

In [7]:
it = kb.filter(subject=pc.CID(241), property=wd.description, language='en')
display(*it)

(**Statement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** **DescriptionProperty** "A six-carbon aromatic annulene in which each carbon atom donates one of its two 2p electrons into a delocalised pi system. A toxic, flammable liquid byproduct of coal distillation, it is used as an industrial solvent. Benzene is a carcinogen that also damages bone marrow and the central nervous system."@en))

## Compounds

Lists some properties of CID 241:

In [8]:
it = kb.filter_annotated(
    subject=pc.CID(241),
    property=(
        wd.instance_of |
        wd.chemical_formula | 
        wd.canonical_SMILES |
        wd.isomeric_SMILES |
        wd.InChI |
        wd.InChIKey |
        wd.ChEBI_ID |
        wd.ChEMBL_ID |
        wd.mass |
        wd.partition_coefficient_water_octanol
    ), limit=10)
display(*it)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [canonical SMILES](http://www.wikidata.org/entity/P233)) "C1=CC=CC=C1")
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [ChEBI ID](http://www.wikidata.org/entity/P683)) "16716")
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [ChEMBL ID](http://www.wikidata.org/entity/P592)) "CHEMBL277500")
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [chemical formula](http://www.wikidata.org/entity/P274)) "C6H6")
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [InChI](http://www.wikidata.org/entity/P234)) "InChI=1S/C6H6/c1-2-4-6-5-3-1/h1-6H")
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [InChIKey](http://www.wikidata.org/entity/P235)) "UHOVQNZJYSORNB-UHFFFAOYSA-N")
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [instance of](http://www.wikidata.org/entity/P31)) (**Item** [type of a chemical entity](http://www.wikidata.org/entity/Q113145171)))
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [isomeric SMILES](http://www.wikidata.org/entity/P2017)) "C1=CC=CC=C1")
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [mass](http://www.wikidata.org/entity/P2067)) (**Quantity** 78.11 (**Item** [gram per mole](http://www.wikidata.org/entity/Q28924752))))
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

(**AnnotatedStatement** (**Item** [benzene](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID241)) (**ValueSnak** (**Property** [partition coefficient water/octanol](http://www.wikidata.org/entity/P2993)) (**Quantity** 2.1))
- (**QualifierRecord**)
- (**ReferenceRecordSet**)
- **NormalRank**)

Lists the mass of the compound whose chemical formula is H2O:

In [9]:
it = kb.filter(subject=wd.chemical_formula('H2O'), property=wd.mass)
display(*it)

(**Statement** (**Item** [oxygen-15 atom](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID10129877)) (**ValueSnak** (**Property** [mass](http://www.wikidata.org/entity/P2067)) (**Quantity** 17.019 (**Item** [gram per mole](http://www.wikidata.org/entity/Q28924752)))))

(**Statement** (**Item** [oxygen-17 atom](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID10197601)) (**ValueSnak** (**Property** [mass](http://www.wikidata.org/entity/P2067)) (**Quantity** 19.015 (**Item** [gram per mole](http://www.wikidata.org/entity/Q28924752)))))

(**Statement** (**Item** [oxygen-16 atom](http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID10214376)) (**ValueSnak** (**Property** [mass](http://www.wikidata.org/entity/P2067)) (**Quantity** 18.011 (**Item** [gram per mole](http://www.wikidata.org/entity/Q28924752)))))

## Patents

In [10]:
it = kb.filter(property=wd.instance_of, value=wd.patent)
display(*it)

(**Statement** (**Item** [300cc water bottle](http://rdf.ncbi.nlm.nih.gov/pubchem/patent/AP-00214-S1)) (**ValueSnak** (**Property** [instance of](http://www.wikidata.org/entity/P31)) (**Item** [patent](http://www.wikidata.org/entity/Q253623))))

(**Statement** (**Item** [600cc Water bottle](http://rdf.ncbi.nlm.nih.gov/pubchem/patent/AP-00215-S1)) (**ValueSnak** (**Property** [instance of](http://www.wikidata.org/entity/P31)) (**Item** [patent](http://www.wikidata.org/entity/Q253623))))

(**Statement** (**Item** [350ml Water bottle.](http://rdf.ncbi.nlm.nih.gov/pubchem/patent/AP-00216-S1)) (**ValueSnak** (**Property** [instance of](http://www.wikidata.org/entity/P31)) (**Item** [patent](http://www.wikidata.org/entity/Q253623))))

## Sources (vendors)

In [11]:
it = kb.filter(property=wd.instance_of, value=wd.vendor)
display(*it)

(**Statement** (**Item** [AAA Chemistry](http://rdf.ncbi.nlm.nih.gov/pubchem/source/AAA_Chemistry)) (**ValueSnak** (**Property** [instance of](http://www.wikidata.org/entity/P31)) (**Item** [vendor](http://www.wikidata.org/entity/Q1762621))))

(**Statement** (**Item** [AKos Consulting & Solutions](http://rdf.ncbi.nlm.nih.gov/pubchem/source/AKos_Consulting___Solutions)) (**ValueSnak** (**Property** [instance of](http://www.wikidata.org/entity/P31)) (**Item** [vendor](http://www.wikidata.org/entity/Q1762621))))

(**Statement** (**Item** [Alinda Chemical](http://rdf.ncbi.nlm.nih.gov/pubchem/source/Alinda_Chemical)) (**ValueSnak** (**Property** [instance of](http://www.wikidata.org/entity/P31)) (**Item** [vendor](http://www.wikidata.org/entity/Q1762621))))