Skip to content

v1.8.7

Compare
Choose a tag to compare
@blackwinter blackwinter released this 15 Feb 13:02
· 71 commits to master since this release
  • Added Lingo::Attendee::LsiFilter to correlate semantically related terms
    (LSI) over the
    "corpus" of all files processed during a single program invocation; requires
    lsi4r which in turn requires
    rb-gsl. [EXPERIMENTAL: Interface may
    be changed or removed in next release.]
  • Added Lingo::Attendee::HalFilter to correlate semantically related terms
    (HAL) over
    individual documents; requires hal4r
    which in turn requires rb-gsl.
    [EXPERIMENTAL: Interface may be changed or removed in next release.]
  • Added Lingo::Attendee::AnalysisFilter and associated lingoctl tooling.
  • Multiword dictionaries can now identify hyphenated variants (e.g.
    automatic data-processing); set hyphenate: true in the
    dictionary config.
  • Lingo::Attendee::Tokenizer no longer considers hyphens at word edges as part
    of the word. As a consequence, Lingo::Attendee::Dehyphenizer has been
    dropped.
  • Dropped Lingo::Attendee::NonewordFilter; use Lingo::Attendee::VectorFilter
    with option lexicals: '\?' instead.
  • Lingo::Attendee::TextReader and Lingo::Attendee::TextWriter learned
    encoding option to read/write text that is not UTF-8 encoded;
    configuration files and dictionaries still need to be UTF-8, though.
  • Lingo::Attendee::TextReader and Lingo::Attendee::TextWriter learned to
    read/write Gzip-compressed files (file extension .gz or .gzip).
  • Lingo::Attendee::Sequencer learned to recognize 0 in the pattern to match
    number tokens.
  • Fixed Lingo::Attendee::TextReader to recognize BOM in input files; does not
    apply to input read from STDIN.
  • Fixed regression introduced in 1.8.6 where Lingo::Attendee::Debugger would
    no longer work immediately behind Lingo::Attendee::TextReader.
  • Fixed lingoctl copy commands when overwriting existing files.
  • Refactored Lingo::Database::Crypter into a module.
  • JRuby 9000 compatibility.