Skip to content

Portophon/PtLanka

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to PtLanka

(released in https://sites.google.com/view/opencor-2021/)

PtLanka is an online corpus of Sri Lanka Portuguese lexicon and phonology. This repository assembles a total of 2524 words of Sri Lanka Portuguese from its most complete dictionary (Dalgado,1900), which covers the lexicon from the extinct dialect of Colombo from the Sinhala-dominant region, and make it available online in a consistent format. Apart from the original orthography and notes, the database also includes a broad phonetic transcription, updated definition in Portuguese and English for each word, and some morphological annotation. This database makes it possible for researchers to explore the linguistic data, to formulate hypotheses on how the Sri Lanka Portuguese was formed and evolved and to assess the effects of contact with other languages, namely Dutch and English. Databases like this are also instrumental in language documentation and conservation.

The folder "Syllable_split" (https://github.com/Portophon/PtLanka/tree/main/Syllable_split) contains the notebook and replacement csv files that were used to split sylables -- presented in the Demo session -- Propor 2022.

How to use PtLanka

Users can browse or search lexical items or any particular features provided by clicking on "PtLanka.csv" above and using the tab "search this file". Data can be downloaded by clicking the code button and then download zip. If you use PtLanka in your research, please cite appropriately, following the recommended citation below.

How to cite PtLanka

Silva, C. & Trigo, L. 2022. PtLanka. Porto: CLUP-Centro de Linguística da Universidade do Porto.