Morfologik Polish Lemmatizer plugin for Elasticsearch
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
gradle/wrapper
src
.gitignore
.travis.yml
LICENSE
README.md
build.gradle
gradlew
gradlew.bat
settings.gradle

README.md

Morfologik Polish Lemmatizer plugin for Elasticsearch

Build Status Maven Central

Morfologik plugin for elasticsearch 6.x, 5.x and 2.x. It's lucene-analyzers-morfologik wrapper for elasticsearch.

Plugin provide "morfologik" analyzer and "morfologik_stem" token filter.

Originally created by https://github.com/monterail/elasticsearch-analysis-morfologik

Build

./gradlew clean build

Examples

"morfologik" analyzer

Request:

GET _analyze
{
  "analyzer": "morfologik",
  "text": "jestem"
}

Response:

{
  "tokens": [
    {
      "token": "być",
      "start_offset": 0,
      "end_offset": 6,
      "type": "<ALPHANUM>",
      "position": 0
    }
  ]
}

"morfologik_stem" token filter

Request:

GET _analyze
{
  "tokenizer": "standard",
  "filter": ["morfologik_stem"],
  "text": "jestem"
}

Response:

{
  "tokens": [
    {
      "token": "być",
      "start_offset": 0,
      "end_offset": 6,
      "type": "<ALPHANUM>",
      "position": 0
    }
  ]
}

Supported elasticsearch versions:

All ready to install plugins are deployed to maven central: http://central.maven.org/maven2/pl/allegro/tech/elasticsearch/plugin/elasticsearch-analysis-morfologik/

Elasticsearch 6.x

  • 6.0.x (6.0.0, 6.0.1)
  • 6.1.x (6.1.0, 6.1.1, 6.1.2, 6.1.3, 6.1.4)
  • 6.2.x (6.2.1, 6.2.2, 6.2.3, 6.2.4)
  • 6.3.x (6.3.0, 6.3.1, 6.3.2)
  • 6.4.x (6.4.0, 6.4.1, 6.4.2, 6.4.3)
  • 6.5.x (6.5.0, 6.5.1, 6.5.2, 6.5.3)

Elasticsearch 5.x

  • 5.0.x (5.0.0, 5.0.1, 5.0.2)
  • 5.1.x (5.1.1, 5.1.2)
  • 5.2.x (5.2.0, 5.2.1, 5.2.2)
  • 5.3.x (5.3.0, 5.3.1, 5.3.2, 5.3.3)
  • 5.4.x (5.4.0, 5.4.1, 5.4.2, 5.4.3)
  • 5.5.x (5.5.0, 5.5.1, 5.5.2)
  • 5.6.x (5.6.0, 5.6.1, 5.6.2, 5.6.3, 5.6.4, 5.6.5, 5.6.10)

Install in Elasticsearch 5.x and 6.x

<ES directory>/bin/elasticsearch-plugin install \
  pl.allegro.tech.elasticsearch.plugin:elasticsearch-analysis-morfologik:6.5.3

for version <= 5.4.0

<ES directory>/bin/elasticsearch-plugin install \
  http://central.maven.org/maven2/pl/allegro/tech/elasticsearch/plugin/elasticsearch-analysis-morfologik/5.3.1/elasticsearch-analysis-morfologik-5.3.1-plugin.zip

tip: select proper version in url to plugin

Elasticsearch 2.x

  • 2.4.x (2.4.1, 2.4.2, 2.4.3, 2.4.4, 2.4.6)

Install in Elasticsearch 2.x

<ES directory>/bin/plugin install \
  http://central.maven.org/maven2/pl/allegro/tech/elasticsearch/plugin/elasticsearch-analysis-morfologik/2.4.2/elasticsearch-analysis-morfologik-2.4.2-plugin.zip

tip: select proper version in url to plugin