Skip to content

lmyint/rsemmed

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Programmatic querying of the Semantic MEDLINE database

The Semantic MEDLINE database (SemMedDB) is a collection of annotations of sentences from the abstracts of articles indexed in PubMed. These annotations take the form of subject-predicate-object triples of information. These triples are also called predications.

An example predication is "Interleukin-12 INTERACTS_WITH IFNA1". Here, the subject is "Interleukin-12", the object is "IFNA1" (interferon alpha-1), and the predicate linking the subject and object is "INTERACTS_WITH". The Semantic MEDLINE database consists of tens of millions of these predications.

The predications in SemMedDB can be represented in graph form. Nodes represent concepts, and directed edges represent predicates (concept linkers). In particular, the Semantic MEDLINE graph is a directed multigraph because multiple predicates are often present between pairs of nodes (e.g., "A ASSOCIATED_WITH B" and "A INTERACTS_WITH B"). rsemmed relies on the igraph package for efficient graph operations.

The full processed graph representation is available here. It is a processed version of the PREDICATION table (a SQL dump file) available from the National Library of Medicine site for Semantic MEDLINE. See the package vignette for details about the processing.

Software status

Resource: Travis CI
Platforms: Linux
R CMD check Build status
Test coverage Code Coverage Status

About

R package for exploring the Semantic Medline database

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages