Initial attempt to do entity name recognition in CDS use case
Java
Switch branches/tags
Nothing to show
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
src
.gitignore
Notes.txt
README
log_conversion.txt
pom.xml
short-list.ttl
short-list.yaml
sources.ttl
sources.txt
sources.yaml
two-lists.ttl

README

Libraries used
	- Stanford NLP library http://nlp.stanford.edu/software/CRF-NER.html
	- PDF Box library http://pdfbox.apache.org/
	- OWL API http://owlapi.sourceforge.net/
	- Lucene indexer	 

	
#Maven profiles:

	- mvn -P create-snapshot
	  Maven profile to process sources.txt in the main maven directory containing local file, url to create local snapshot of a source html file.
	  Results will be in directory results/snapshots
	
	- mvn -P call-bioportal
	  Maven  profile to annotate snapshots created from previous profile, and call bioportal annotator
	  Results will be in directory results/bioportal-annotation
	  
	- mvn -P create-annotation-ontology
	  Maven profile to create annotation ontology from previous profile.
	  Results will be in directory results/annotation-ontology