The camel_data
tool allows you to diacritize Arabic text.
Below is the usage information that can be generated by running camel_data --help
.
Usage:
camel_data [-d <DIR> | --data-dir=<DIR>] <PACKAGE>
camel_data (-l | --list)
camel_data (-v | --version)
camel_data (-h | --help)
Options:
-l --list
Show a list of packages available for download.
-h --help
Show this screen.
-v --version
Show version.
Below is a table of available packages and a comparison of their contents.
full | light | |
---|---|---|
Size | 1.8 GB | 19 MB |
Morphology |
|
|
Disambiguation |
|
|
Taggers |
|
|
Tokenization |
|
|
Dialect Identification |
|
|
Sentiment Analysis |
|
|
Named Entity Recognition |
|