Skip to content

Arabic deep-learning based diacritization models (Shakkala, Shakkelha) in the ONNX format.

License

Notifications You must be signed in to change notification settings

nipponjo/arabic_vocalizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Arabic deep-learning based diacritization models (Shakkala, Shakkelha) in the ONNX format.

Install with:

pip install git+https://github.com/nipponjo/arabic_vocalizer.git

Example:

from arabic_vocalizer import vocalize

# %% unvocalized input
input_text = "اللغة العربية هي أكثر اللغات السامية تحدثا، وإحدى أكثر اللغات انتشارا في العالم، يتحدثها أكثر من 467 مليون نسمة"

# %% shakkala output
print(vocalize(input_text, model='shakkala'))
>> اللُّغَةُ الْعَرَبِيَّةُ هِيَ أَكْثَرُ اللُّغَاتِ السَّامِيَةِ تَحَدُّثًا، وَإِحْدَى أَكْثَرِ اللُّغَاتِ انْتِشَارًا فِي الْعَالِمِ، يَتَحَدَّثُهَا أَكْثَرُ مَنْ 467 مُلْيُونُ نُسْمَةَ

# %% shakkelha output
print(vocalize(input_text, model='shakkelha'))
>> اللُّغَةُ الْعَرَبِيَّةُ هِيَ أَكْثَرُ اللُّغَاتِ السَّامِيَةِ تَحَدُّثًا، وَإِحْدَى أَكْثَرِ اللُّغَاتِ انْتِشَارًا فِي الْعَالِمِ، يَتَحَدَّثُهَا أَكْثَرُ مِنْ 467 مَلْيُونٍ نَسَمَةً

About

Arabic deep-learning based diacritization models (Shakkala, Shakkelha) in the ONNX format.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages