Skip to content

clulab/LemPORT

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

What is it?

This is the CLU Lab's fork of LemPort, a lemmatizer for Portuguese, modified to build with sbt 1.x and work with the Universal POS tagset.

Citations

If you use LemPORT in your project, please cite the following work:

@InProceedings{rodrigues_et_al:OASIcs:2014:4575,
  author ={Ricardo Rodrigues and Hugo Gon{\c{c}}alo Oliveira and Paulo Gomes},
  title ={{LemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portuguese}},
  booktitle ={3rd Symposium on Languages, Applications and Technologies},
  pages ={267--274},
  series ={OpenAccess Series in Informatics (OASIcs)},
  ISBN ={978-3-939897-68-2},
  ISSN ={2190-6807},
  year ={2014},
  volume ={38},
  editor ={Maria Jo{\~a}o Varanda Pereira and Jos{\'e} Paulo Leal and Alberto Sim{\~o}es},
  publisher ={Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address ={Dagstuhl, Germany},
  URL ={http://drops.dagstuhl.de/opus/volltexte/2014/4575},
  URN ={urn:nbn:de:0030-drops-45753},
  doi ={10.4230/OASIcs.SLATE.2014.267},
  annote ={Keywords: lemmatization, normalization, rules, lexicon}
}

Changes

Please see the CHANGES file

Use

Compilation

sbt compile

Releases

Releases of this fork are published to the Sonatype Central Maven repository:

If using sbt, add the following entry to your libraryDependencies (replace "x.y.z" with your selected version):

"org.clulab" % "lemport" % "x.y.z"

Building a fat jar

sbt assembly

About

Portuguese lemmatizer (fork of LemPORT)

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Languages

  • Java 69.9%
  • HTML 28.3%
  • Scala 1.8%