The pceec2 repository contains
a revised and corrected version of the parsed files in the PCEEC
The original 2006 version (https://llds.ling-phil.ox.ac.uk/llds/xmlui/handle/20.500.14106/2510) was deposited with the Oxford Text Archive (OTA). The OTA has not responded to repeated requests to deposit the corrected version with them. The main author of the original version, Terttu Nevalainen, and the main author of the parsed version, Ann Taylor, have given permission to distribute the corrected version on github.
a lemmatized version of those files based on the "New English Dictionary based on historical principles" (https://archive.org/details/oed01arch/page/522/mode/2up). See the annotation guidelines for details (https://github.com/beatrice57/annotation-guidelines-for-ppche | https://www.ling.upenn.edu/~beatrice/corpus-ling/annotation-2022/lemmatization.html).
a corrected version of the accompanying sociolinguistic information
accompanying documentation concerning the source texts
Parsed Corpus of Early English Correspondence, second edition. 2022. Annotated by Ann Taylor, Arja Nurmi, Anthony Warner, Susan Pintzuk, and Terttu Nevalainen. Revised, corrected, and lemmatized by Beatrice Santorini. Compiled by the CEEC Project Team. https://github.com/beatrice57/pceec2
Beatrice Santorini (beatrice DOT santorini AT gmail DOT com)
Same as for 2006 release. https://llds.ling-phil.ox.ac.uk/llds/xmlui/handle/20.500.14106/2510