Skip to content
master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 

Plain English Summarization of Contracts

Please cite our NAACL NLLP Workshop paper as:

@inproceedings{manor-li-2019-plain,
    title = "Plain {E}nglish Summarization of Contracts",
    author = "Manor, Laura  and
      Li, Junyi Jessy",
    booktitle = "Proceedings of the Natural Legal Language Processing Workshop 2019",
    month = jun,
    year = "2019",
    address = "Minneapolis, Minnesota",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/W19-2201",
    pages = "1--11",
}

Files

Data

This dataset consists of the data included in the quantitative analysis as described in the paper. Sets which had a reference summary with more tokens than the original text are excluded. Each set consists of:

  • uid: unique identification string
  • original_text: cleaned & parsed original text
  • reference_summary: cleaned & parsed reference summary
  • doc: name of the document which the original text is from

TL;DRLegal

Each tldrlegal set may also have:

  • title: "title" of the set (from website)

TOS;DR

Each tos;dr set may also have:

  • note: Mostly empty, a space for additional notes from the annotator
  • urls: URL of the respective company
  • case_text: raw text from the "case" (from website)
  • case_code: annotation for "case" text
  • title_text: raw text from the "title" (from website)
  • title_code: annotation for "title" text
  • tldr_text: raw text from the "tldr" (from website)
  • tldr_code: annotation for "tldr"

A note on annotation:

  • Each code begins with a number, which is the 'ranking' of the quality of the respective text as chosen by the annotator
  • A code may also have a letter, which stands for:
    • s: standard -- this exact text (without company names) occurs at least one other time in a different set.
    • j: jurisdication -- this text deals with jurisdiction
    • q-: quote(-) -- this text is excerpt of the original text
    • d: description -- this text is a description of what the original text talks about, but may not give details

About

No description, website, or topics provided.

Resources

Releases

No releases published

Packages

No packages published