A baseline results towards constructing readability corpus ARC-WMI, a new Arabic collection of written medicine information annotated with readability levels. This corpus contains 4476 sentences with over 61k words, extracted from 94 sources of Arabic written medicine information. These sentences were manually annotated and assigned a readability level (“Easy,” “Intermediate,” or “Difficult”) by a panel of health-care professionals.
Abeer Aldayel, Hend Al-Khalifa, Sinaa Alaqeel, Norah Abanmy, Maha Al-Yahya, Mona Diab, (2018) ARC-WMI: Towards Building Arabic Readability Corpus for Written Medicine Information, IN OSACT3.
This data made available by IWAN research group under the Creative Commons Attribution 4.0 International license. To view a copy of this license, visit [https://creativecommons.org/licenses/by-nc-sa/4.0/].