The CLiGS textbox
This repository contains several text collections made available by the CLiGS junior research group (see http://cligs.hypotheses.org). As of May 2018, these are:
- Collection of 19th Century Spanish-American Novels (1880-1916), edited by Ulrike Henny-Krahmer: 24 novels
- Corpus of Spanish Novels from 1880-1940, edited by José Calvo Tello: 39 novels
- Collection de nouvelles françaises du dix-neuvième siècle, edited by Stefanie Popp: 28 novellas
- Collection de romans français du dix-neuvième siècle, edited by Christof Schöch and Stefanie Popp: 36 novels.
- Collection de pièces the théâtre français du dix-septième siècle, edited by Christof Schöch: 100 plays.
- Collection of 19th Century Portuguese Novels (1840-1910), edited by Ulrike Henny-Krahmer: 30 novels
- Collection of Italian Short Stories and Novellas (1880s-1920s), edited by Ulrike Henny-Krahmer: 90 short stories and novellas
- Corpus of Spanish Short Stories from 1880-1940, edited by José Calvo Tello: 20 collections of short stories, 302 single short stories
- Collection of Italian Novels (1850 and 1915), edited by Katrin Betz and Christof Schöch: 21 novels
The text collections are available by cloning the repository, downloading the entire repository as a ZIP file or downloading individual text collections as ZIP files.
All texts are in the public domain. The markup and metadata we have added are provided with a CC-BY (Creative Commons Attribution, see http://creativecommons.org/licenses/by/4.0/) license.
Each collection includes a description of the criteria of text selection, the available data formats, a citation suggestion, etc. More information about formal schemas (for example the TEI schema linked to from all the TEI files) can be found in the reference repository of CLiGS available at https://github.com/cligs/reference.
There is a collection of Python utility scripts which can be used for several tasks related to the textbox, for example to extract metadata automatically from the TEI files. This set of scripts is called the CLiGS toolbox and is available at https://github.com/cligs/toolbox. The toolbox reflects ongoing work and is not organized in official releases.