Skip to content

Latest commit

 

History

History
88 lines (87 loc) · 6.72 KB

CoreNLP_packages.md

File metadata and controls

88 lines (87 loc) · 6.72 KB

CoreNLP packages

The main packages of CoreNLP along with a description of what they contain, links to official documentation, and - possibly - information about whether/how they have been wrapped in DataLinguist:

  • edu.stanford.nlp.classify
    • The classify package provides facilities for training classifiers.
  • edu.stanford.nlp.coref
    • The CorefAnnotator finds mentions of the same entity in a text, such as when “Theresa May” and “she” refer to the same person.
  • edu.stanford.nlp.dcoref
    • The deterministic coreference resolution system is still supported in StanfordCoreNLP by using the annotator dcoref, but is superseded by the coref annotator.
  • edu.stanford.nlp.fsm
    • Contains two implementations of finite state machines.
  • edu.stanford.nlp.graph
    • Contsins the base graph implementation, plus a few graph algorithms.
  • edu.stanford.nlp.ie
    • This package implements various subpackages for information extraction.
    • The Open Information Extraction (OpenIE) annotator extracts open-domain relation triples, representing a subject, a relation, and the object of the relation.
  • edu.stanford.nlp.international
    • Contains various language-specific classes.
  • edu.stanford.nlp.io
    • Contains various IO-related utility classes.
  • edu.stanford.nlp.ling
    • This package contains the different data structures used by JavaNLP throughout the years for dealing with linguistic objects in general, of which words are the most generally used. Most data structures in this package are deprecated.
    • edu.stanford.nlp.ling.tokensregex
  • edu.stanford.nlp.math
    • Classes for Simple Math Functionality, such as Min, Max, WeightedAverage, Scientific Notation, etc.
  • edu.stanford.nlp.maxent
    • This package deals with defining and solving maximum entropy problems.
  • edu.stanford.nlp.naturalli
  • edu.stanford.nlp.net
    • Contains a couple useful utility methods related to networks.
  • edu.stanford.nlp.neural
    • Contains classes related to neural networks, in particular recursive neural networks (RNN).
  • edu.stanford.nlp.objectbank
    • The ObjectBank class is designed to make it easy to change the format/source of data read in by other classes and to standardize how data is read in javaNLP classes.
  • edu.stanford.nlp.optimization
    • Numerical optimization, including a conjugate gradient implementation.
  • edu.stanford.nlp.paragraphs
    • Contains the ParagraphAnnotator class.
  • edu.stanford.nlp.parser
  • edu.stanford.nlp.patterns
  • edu.stanford.nlp.pipeline
  • edu.stanford.nlp.process
    • Contains classes for processing documents.
  • edu.stanford.nlp.quoteattribution
    • Deterministically picks out quotes from a text.
  • edu.stanford.nlp.semgraph
    • This package provides a representation of dependency graphs (normally the collapsed Stanford Dependencies representation) as a graph.
    • edu.stanford.nlp.semgraph.semgrex
      • A package for dependency graph (i.e. SemanticGraph) pattern expressions and matching these expressions to IndexedFeatureLabel instances.
  • edu.stanford.nlp.sentiment
    • StanfordCoreNLP includes the sentiment tool and various programs which support it. SentimentAnnotator implements Socher et al’s sentiment model.
    • See also: https://nlp.stanford.edu/sentiment/
  • edu.stanford.nlp.sequences
    • This package contains classes that support sequence modeling and inference (CMMs and CRFs).
  • edu.stanford.nlp.simple
    • In addition to the fully-featured annotator pipeline interface to CoreNLP, Stanford provides a simple API for users who do not need a lot of customization.
  • edu.stanford.nlp.stats
    • A set of tools for collecting, representing, and inferring with statistics.
  • edu.stanford.nlp.swing
    • Collection of useful classes for building Swing GUIs.
  • edu.stanford.nlp.tagger
    • This subpackage is a home for taggers. They are defined in subpackages.
    • edu.stanford.nlp.tagger.maxent
      • A Maximum Entropy Part-of-Speech Tagger.
  • edu.stanford.nlp.time
    • StanfordCoreNLP includes SUTime, a library for processing temporal expressions such as February 4th, 2019. SUTime is built on top of TokensRegex.
  • edu.stanford.nlp.trees
    • A package for (NLP) trees, sentences, and similar things.
    • edu.stanford.nlp.trees.international
      • This package and its subpackages define processes related to language-specific (or perhaps language- and corpus-specific) processing.
    • edu.stanford.nlp.trees.tregex
      • Tregex is a utility for matching patterns in trees, based on tree relationships and regular expression matches on nodes (the name is short for "tree regular expressions"). Tregex comes with Tsurgeon, a tree transformation language.
  • edu.stanford.nlp.util
    • A collection of useful general-purpose utility classes.
  • edu.stanford.nlp.wordseg
    • A package for doing Chinese word segmentation.