Sonnet Tokenization Engine
Sonnet Tokenization Engine is a stateless microservice for performing NLP string tokenization. It is powered by Idyl NLP. Sonnet Tokenization Engine is available on DockerHub, AWS Marketplace, and Azure Marketplace.
Sonnet Tokenization Engine can use trained models to perform the string tokenization or default internal tokenizers.
- To build:
mvn clean install
- To run:
java -jar sonnet-app/target/sonnet.jar
- To tokenize:
curl "http://localhost:9040/api/tokenize?language=eng" -d "George Washington was president." -H "Content-Type: text/plain"
This command produces the response:
The array of tokens can then be utilized by other NLP microservices and applications.
The NLP Building Blocks Java SDK includes a client for Sonnet's API.
Sonnet Tokenization Engine is licensed under the Apache License, version 2.0.
Copyright 2018 Mountain Fog, Inc.