==========
This project provides a WebSocket server that wraps the Stanford Part-of-Speech tagger. This makes it easier to get part-of-speech tags from JavaScript for arbitrary text. Please note that this project is just a prototype that illustrates the utility of WebSockets for NLP.
- Java
- Ant
- Stanford CoreNLP
- Java WebSocket
- A modern browser that supports WebSockets (See http://caniuse.com/websockets). I recommend the latest version of Mozilla Firefox.
- Clone the Java WebSocket repository and build
java_websocket.jar
(you may also download the jar directly from the repository). Add this jar to your CLASSPATH. - Download
stanford-corenlp-1.3.5.jar
andstanford-corenlp-1.3.5-models.jar
and add them to your CLASSPATH (Note: make sure to download the latest version which at this point happens to be 1.3.5) - Run
ant
to build the StanfordTaggerServer class. - Run the included script
start_server.sh
to start the WebSocket server. - Load
tag.html
into Firefox. - Tag away.
- Please note that currently the server needs to be killed manually since this is what I want. Obviously, this can be easily changed with the same thinking that goes into how to properly kill a regular socket server.
- It should be simple to extend the server to do anything else CoreNLP can do besides tagging.
Here's a screenshot of the jQuery powered tag.html
page in action: