Skip to content
JSON-NLP Schema for transfer of NLP output using JSON
Branch: master
Clone or download
Oren Baldinger
Latest commit a1fee65 Apr 23, 2019
Type Name Latest commit message Commit time
Failed to load latest commit information.


Brought to you by the!

Version pre-alpha 0.001

The JSON-NLP Schema is a standard middle-ware type for the transfer of NLP output from various NLP pipelines and chains to a uniform JSON format.

This project provides the JSON-NLP specification, a proposed JavaScript Object Notation (JSON) format and JSON Schema for Natural Language Processing (NLP) annotations and example use-cases for common NLP pipelines like spaCy, Stanford CoreNLP, OpenNLP, LingPipe, and so on.

The proposed schema provides an abstraction and normalization layer, and a uniform API for NLP annotations generated by various heterogeneous NLP pipelines and components.

An exhaustive list of major NLP components is provided in the whitepaper (Cavar et al., to appear 2019, see on Arxiv) that describes the different data structures that these components can generate. For the majority of these components, the output conversion to JSON-NLP is explained and discussed in the whitepaper. In addition to a detailed description of the JSON-NLP schema properties, various JSON-NLP examples are provided and discussed.

The JSON-NLP schema provides a wider annotation framework than most common NLP-pipelines might need. There are various related projects that will explain the advanced properties of the JSON-NLP format.

Used Standards and Technologies

Standards used:

Copyright, License, Citing

For licensing details see the LICENSE file. All NLP-Lab code is published under the Apache License Version 2.0.

To cite this work, refer to the paper:

Damir Cavar, Oren Baldinger, Joshua Herring, Umang Mehta, Yiwen Zhang, Shantanu Bedekar, Shreejith Panicker (2019) An Annotation Encoding Schema for Natural Language Processing using JSON: NLP JSON Schema Version 0.1. Technical Report, NLP Lab, Indiana University, Version 1.0 from November 2018.

(Link to paper is coming soon)

This code, documentation, and examples are brought to you by the

(C) 2018-2019 The

You can’t perform that action at this time.