NLP microservice for performing string tokenization.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
sonnet-api
sonnet-app
sonnet-model
sonnet-models
sonnet-services
.gitignore
.travis.yml
LICENSE
README.md
pom.xml

README.md

Sonnet Tokenization Engine

Build Status

Sonnet Tokenization Engine is a stateless microservice for performing NLP string tokenization. It is powered by Idyl NLP. Sonnet Tokenization Engine is available on DockerHub, AWS Marketplace, and Azure Marketplace.

Sonnet Tokenization Engine can use trained models to perform the string tokenization or default internal tokenizers.

Quick Start

  • To build: mvn clean install
  • To run: java -jar sonnet-app/target/sonnet.jar
  • To tokenize: curl "http://localhost:9040/api/tokenize?language=eng" -d "George Washington was president." -H "Content-Type: text/plain"

This command produces the response:

["George","Washington","was","president"]

The array of tokens can then be utilized by other NLP microservices and applications.

Clients

The NLP Building Blocks Java SDK includes a client for Sonnet's API.

License

Sonnet Tokenization Engine is licensed under the Apache License, version 2.0.

Copyright 2018 Mountain Fog, Inc.