Skip to content
No description or website provided.
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
M321-G1.xml
README.md
reader.py
requirements.txt

README.md

Referenzkorpus Mittelhochdeutsch

Source and license

Main page of the project

License:

Das Referenzkorpus Mittelhochdeutsch ist lizenziert unter einer Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International Lizenz.

No change is made on the corpus. This code is intended to parse the corpus.

Corpus retrieval

  1. Go to https://www.linguistics.rub.de/rem/access/index.html.
  2. Click on "CORA-XML AKS .TAR.XZ" or "CORA-XML ALS .ZIP"
  3. Click on "Herunterladen".
  4. Uncompress the dowloaded file.
  5. You have a folder, named rem-corraled-20161222 (2019-09-18) with a list of XML files which are annotated texts.

Code

The available code will parse individual XML files.

You can’t perform that action at this time.