This repository contains text and annotations of the Quran.
The corpus is modelled as a text-fabric data resource which is optimized for processing the text as data. This makes it easier to do all kinds of research on the text.
2019-01-07: the Quran TF-app and data and tutorials released.
This is work in progress: we need to do more testing and documenting
We have taken the source materials from Quranic Arabic Corpus 0.4 (2011) by Kais Dukes and Tanzil.
The data in TF format is licensed as CC-BY 4.0.
The conversion code and all other materials are licensed as Unlicense, i.e. public domain.
Read more about provenance and license in About.
Start with the tutorial.
The features are documented in features.
English and Dutch translations of the Quran are included in the feature data.