This package references 2 repositories, OpenNLP and CherubNLP. UnityNLP is a collection of natural language processing tools written in C# that is targeted towards the Unity Engine. Currently it provides the following NLP tools:
- tokenizer
- sentence splitter
- part-of-speech tagger
- chunker (used to "find non-recursive syntactic annotations such as noun phrase chunks")
- parser
- name finder
- coreference tool
- interface to the WordNet lexical database
- topical classifier
You can find the documentation at: https://unitynlp.readthedocs.io/
External dependencies:
- Clone the UnityUtil repository into your
Packages
folder. - Clone the UnityAI repository into your
Packages
folder. - Clone this repository into your
Packages
folder. - Download all essential models and import them into the project.
- Place the models in the StreamingAssets folder.
- And you are ready to go!
Make a feature request in the issues tab if you think there is something missing or if you have new ideas!
All the sample code are in the Samples~
folder which can be imported into unity from the package manager.
For the full list of part of speech abbreviations, please refer to the Penn Treebank Project.
This repository as a whole is licensed under the Apache License 2.0. Individual files may have a different, but compatible license.
See license file for details.