Hungarian NLP tools API
Switch branches/tags
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin
src
.gitignore
Dockerfile
LICENSE.md
Makefile
README.md
pom.xml
procfile

README.md

HuNLP: Hungarian language processing for Humans

The aim of this project to provide a unified access to Hungarian NLP tools. As such it provides wrapper classes and a REST API. Currently the project integrates:

Usage

Before running the application, make sure you have at least 3GB free memory.

Get the library

You can compile the library locally: $ make dependencies install (You would need Oracle Java 8 and Maven.)

Or find zipped jars here

Alternatively, you can directly use the prebuilt docker image: $ docker pull oroszgy/hunlp

Running the application

In case you directly compiled the sources: $ make serve

Having the binairies downloaded: $ hunlp.sh

Or running the docker container: $ docker run -it -p 9090:9090 oroszgy/hunlp

Using the application

Through the REST API

$ curl -X POST -H "Content-Type: application/json" -d '{"text": "Szia világ!"}' "http://localhost:9090/v1/annotate"

Java

import hu.nlp.api.HuNlp;

class MainApp {
    public static void main(String args[]) {
        HuNLlp nlp = new HuNlp();
        Document doc = nlp("Ez itt egy példa. Vajon működik?");
    }
}

Kotlin

import hu.nlp.api.HuNlp

fun main(args: Array<String>) {
    val nlp = HuNlp()
    val doc: Document = nlp("Ez itt egy példa. Vajon működik?")
}

Python

Make sure you installed the wrapper: pip install https://github.com/oroszgy/hunlp/releases/download/0.2/hunlp-0.2.0.tar.gz

from hunlp import HuNlp

nlp = HuNlp()
doc = nlp("Ez itt egy példa. Vajon működik?")

print(doc.ents)

for sent in doc:
    for tok in sent:
        print(tok.text, tok.lemma, tok.tag)

Changelog

0.2

Experimental Kotlinization

0.1

First experimental release containing magyarlanc (21-01-2016) and Szeged NER (20-06-2014).

License

HuNLP is licensed under LGPL3, however libraries integrated might use different (commercial) licenses:

(c) Gyorgy Orosz, 2017