Skip to content
This repository has been archived by the owner on May 24, 2022. It is now read-only.

hypertornado/czech-stemmer

Repository files navigation

czech-stemmer

Czech stemmer is pure Ruby port of CzechStemmer Java class from Lucene.

Installation

gem install czech-stemmer

Usage

require 'czech-stemmer'

CzechStemmer.stem("předseda") # => "předsd"
CzechStemmer.stem("mladými") # => "mlad"

Stemmer works only with lowercased letters in suffixes. Based on Lucene CzechStemmer with all test passed. Note the difference between stemming and lemmatization.

Copyright

Copyright (c) 2014 Ondrej Odchazel. See LICENSE.txt for further details.

About

czech words stemmer in pure Ruby

Resources

License

Stars

Watchers

Forks

Packages

No packages published