https://alicecengal.github.io/Neomalay-Lexeme-Synthesizer/
Athran Abdul Rahman
Universiti Teknologi Malaysia
September 2017
Neo-malay here means that the language to be used in this study is not exactly any extant Malay language variety, but rather is a constructed language inspired by the Malay language.
Lexeme Synthesizer is a procedure whereby lexemes are synthesized according to the phonological and phonotactical rules of a language.
Goal --- to synthesize new lexemes that visually and aurally resemble extant native Malay words.
Significance --- a lexeme synthesizer may be useful for building a lorem ipsum generator localized for the Malay language. It may also be useful for inventing new names for writing fictions.
Methods
- Analyse the phonology of Malay, then derive a phonology for Neomalay
- Analyse the phonotactic rule of Malay, then derive a phonotactic rule for Neomalay
- Write a computer program that uses the rules set out in Step 1 & 2 to generate new lexemes in Neomalay
For the linguistics details, refer to the PDF paper.
The synthesizer is implemented in the file synth.scala
.
Please don't tell me about Prolog.
Sample Output
gahang | manoi | awor | barum | hawik | niso | niwal | togil |
---|---|---|---|---|---|---|---|
lasa | jerau | golap | barah | cingat | cungsoi | ninjak | kido |
acau | angsal | guyok | nayik | jalo | roliu | yucop | sacuh |
cakup | hamba | ajui | bodau | tayu | adiu | gapam | kana |
ajo | locir | gongo | hujip | gahur | anti | akah | lala |
intup | benap | ecal | awau | poyut | taga | sota | laba |
lumo | angau | maca | oho | reka | ihol | yunyol | tahi |
ikam | luna | eso | saging | abal | umap | beca | amap |
saku | anyim | ura | aga | yaja | edap | meca | anyui |
kahar | nanga | idun | upo | palan | guwuh | suwan | pekah |
ada | yayo | hakan | rarum | pendul | gotang | yakip | catin |
yungan | tonah | napa | sana | mobih | aja | lusah | riho |
elat | inya | haba | mupat | pihung | karan | gayi | acal |
dagi | anyom | edo | nuho | nasan | debar | enoh | loti |