title | author | date | ORCID |
---|---|---|---|
Readme: Digital Manār |
Till Grallert |
2019-03-29 |
orcid.org/0000-0002-5739-8094 |
This repository is part of OpenArabicPE. It is home to the digital (TEI XML) edition of Rashīd Riḍā's journal al-Manār. It was originally set up in early 2016, inspired by and following the principles of Digital Muqtabas. However, the files lay dormant for almost three years. If you just want to browse the edition in a more human-readable view, start here. All bibliographic metadata is available as part of OpenArabicPE's public Zotero group. A simple Google search has been set-up to search the full-text of OpenArabicPE's entire corpus.
The digital text was downloaded from shamela.ws. Digital facsimiles are available from
- According to Florian Zemmin, the text from shamela.ws is missing all of Riḍā's tafṣīr that made up "more than one fifth" of al-Manār's text corpus [-@Zemmin+2016, 232].
- The EPub / HTML from shamela.ws looks very different from other works I have seen and which were easly converted to TEI XML (re-)using the same code. I therefore asume that it is of a different origin than the other early twentieth-century texts.
- I used regex to structure the file into
- volumes:
<div type="volume" n="\d+"/>
- issues:
<div type="issue"/>
as children of the volumes
- volumes:
- This was then split into one file per issue using the volume and the issue counts
- The division into articles and sections is imperfect as shamela did not record all section titles
- I used regex to structure the file into
- Authorship information: a lot of the articles carry generic bylines with the following pattern: "الكاتب: محمد رشيد رضا". These are not found in the originals, if one consults the facsimiles available from sakhrit (formerly archive.sakhrit.co), but must have been added by later editors to the text. Their accuracy is unclear, some articles by Riḍā where either published posthumously after his death in 1935 or the attribution is wrong.
- consider, for instance v.25(3) published in March 1924 (facsimiles are here):
- The text "سكة الحديد الحجازية" clearly states in its opening paragraph that it was written by an anonymous "Syrian scholar" but shamela added Muḥammad Rashīd Riḍā as author anyway.
- The same is true for the following article "الوثائق الرسمية في المسألة العربية", which provides some texts from other sources and adds a very short comment from al-Manār. Again shamela added Muḥammad Rashīd Riḍā as author.
- consider, for instance v.25(3) published in March 1924 (facsimiles are here):
- The facsimiles from sakhrit confirm that shamela's page numbers correspond to the second printed edition (published in 1327 aH.). However, there are only page numbers between articles! This means, they can only be used as an approximation.
- there are many articles, which are still marked-up as notes
- paragraphs: the mark-up contains only line breaks but not paragraphs
- poetry:
- sections:
- titles:
- new publications/ review section
- done تقريظ المطبوعات الجديدة
- done تقريظ المطبوعات الحديثة
- done تقريظ المطبوعات
- done الأخبار والآراء
- new publications/ review section
- dividers of articles in sections
- a line with 3 asterisks
- the next line is wrapped in brackets
- workflow:
- Regex: search and replace
- find:
(<div[^>]+type=)"item" subtype="article"([^>]+>\s+<head[^>]+>تقريظ المطبوعات الجديدة</head>)
- replace:
$1"section"$2
- find:
- XSLT: wrap the entire content of these
<div>
s in<div type="item" subtype="article">
- Regex: search and replace
- titles:
- empty notes