Towards a Complete Interactive Mapping of Kurdish Dialectology
Explore the docs »
View Demo (TODO)
·
Report Bug
·
Request Feature
Table of Contents
This will be an interactive map for various dialects from low-resource languages, with a focus on and starting with Kurdish.
There exist a plathora of Kurdish dialects that are often unintelligible to the native speakers of each other. However, Kurdish is considered to be a dialect continuum with at times sudden language shifts and sometimes smooth transitions from one dialect to the neighboring one. At first it might seem confusing and counter intuitive that it has been found, that Central Kurdish is considered to be very understandable by speakers of Northern Kurdish and Southern Kurdish to be understandable by speakers of Central Kurdish- but speakers of Northern Kurdish hardly being able to understand Southern Kurdish. An explanation for this can be found in the prior mentioned dialect continuum- where "Central Kurdish" is not necessarily "Central Kurdish", since there exist many differen subdialects of Central Kurdish.
Utilizing an interactive map, visualizing the different aspects and geographical distribution of the separate dialects might enable new insights and at the very least an easier understanding of Kurdish linguistics.
The International Conference on Kurdish Linguistics (ICKL) is a biannual conference serving as a forum of scientific exchange for linguists working on any aspect of Kurdish, including the interactions with its neighboring languages.
List of major frameworks/libraries used to bootstrap this project.
- R
TODO
TODO: More Details
- Install R
- Set library directory for R
echo 'export R_LIBS_USER="/media/CrazyProjects/LowResDialectology/DialectMapping/DialectMapping/R"' >> ~/.bashrc
source ~/.bashrc
- Install packages (via setup script)
-
- We assume German to be spoken in Germany and maybe in the surrounding countries, so we look on google maps (or more old-school: A real atlas) and write the country names into the DialectMapping/data/info/german_countries.txt file (line-by-line).
-
- We run the bash script to convert our list of country names into country codes used by gadm and executable R-Code.
bash country_names2codes.sh
- Found countries are written into: ./data/inter/german_countries_codes.txt
- Missing countries are written into: ./data/inter/german_countries_not_found.txt
- (TODO: Link with DialectOntology to automate this part)
-
For more examples, please refer to the Documentation
- Set up this Repository
- Section 1: Dialect Information
- Classification/Typology
- (partially) Languege Family
- (partially) (Sub-)Dialect(-Group)
- Linguistic Information
- (Meta) Data
- (partially) Areas where spoken
- Native Speaker
- (partially) Link to Publications
- Classification/Typology
- Section 2: Geolocation Data
- Data Source(s)
- Comfortable Access
- Up to date
- (partially) Proper Format
- Preprocess and Aggregate
- Normalize Naming
- Link GeoData to Dialect Data
- Data Source(s)
- Section 3: Interactive Map
- Map Functionalities
- (partially) R-Scripts and Packages
- Accessible Visualisations
- Stress-Testing
- Online Hosting
- Server Setup
- Stress-Testing
- Map Functionalities
See the TODO: open issues for a full list of proposed features (and known issues).
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Distributed under the Apache License. See LICENSE.txt
for more information.
Christian Schuler - christianschuler8989(4T)gmail.com
Raman Ahmad - raman.ahmad2022(4T)gmail.com
Related project of the authors: Analysis of Phonology and Morphology in the Kobani Dialect
A list of helpful resources we would like to give credit to:
- Best-README-Template
- GADM provides maps and spatial data for all countries and their sub-divisions. You can browse our maps or download the data to make your own maps.