Skip to content

v1.6.7: Rakefile: Added files for Porter stemmer to packages. Fixed that LP_F…

Compare
Choose a tag to compare
@blackwinter blackwinter released this 16 Sep 08:42
· 572 commits to master since this release
  • Initiale Bereitstellung von Lingo in Subversion
  • BUGFIX: Token wurden durch den Caching-Mechanismus in ihrer Schreibweise
    verändert (Groß-/Kleinschreibung).
  • Englische Wörterbücher, ein Beispiel-Artikel und eine Konfiguration für das
    Englische wurden aufgenommen. Aufruf mit lingo.rb -c lingo-en.cfg -l en <datei>.
  • Integration der Testumgebung für Entwickler.
  • Die Attendees in den Lingo-Konfigurationsdateien werden automatisch
    verkettet. Dies erspart mühsame Verlinkung von Hand und macht die
    Konfiguration lesbarer.
  • Modified parameters for Tokenizer and building of databases to handle UTF-8
    correctly. Printable characters from the ASCII charset and every UTF
    character greater than x80 is now recognized as a non word breaking
    character. so now diacritical characters are handled well.
  • Added a command line option -p for lingo.rb for making detailed
    performance mesurements possible for each single attendee.
  • Allow Lingo.new to be called from a different directory. (Still need to
    Dir.chdir to #talk because of the dictionaries...).
  • Allow Textreader to read from STDIN.
  • Allow Lingo to be called from an external library or program; pass a string
    and get an array back.
  • Allow Textwriter to write to STDOUT.
  • Match suffixes case-insensitively.
  • Don't exclude synonyms that are eponymous with the original word.
  • Don't stop variating hyphenated words if any component was taken as-is.
  • Basic Ruby 1.9 compatibility.
  • Words should be counted per record, not per document, in LIR mode.
  • Added Porter stemmer.