Python library for work with BioCreative files
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
src
test_input
BioC.dtd
CHANGES.txt
LICENSE.txt
README.txt

README.txt

PyBioC is a native python library to deal with BioCreative XML data,
i. e. to read from and to write to it.

Usage:
------
Two example programs, test_read+write.py and stemming.py are shipped in the
src/ folder.

test_read+write.py shows the very basic reading and writing capability of the 
library.

stemming.py uses the Python Natural Language Toolkit (NLTK) library to 
manipulate a BioC XML file read in before; it then tokenizes the corresponding 
text, does stemming on the tokens and transforms the manipulated PyBioC 
objects back to valid BioC XML format.