Skip to content

SinaLab/sinatools

Repository files navigation

SinaTools

Open Source Toolkit for Arabic NLP and NLU developed by SinaLab at Birzeit University. SinaTools is available through Python APIs, command lines, colabs, and online demos.

See the full list of Available Packages, which include: (1) Morphology Tagging, (2) Named Entity Recognition (NER), (3) Word Sense Disambiguation (WSD), (4) Semantic Relatedness, (5) Synonymy Extraction and Evaluation, (6) Relation Extraction, (7) Utilities (diacritic-based word matching, Jaccard similarly, parser, tokenizers, corpora processing, transliteration, etc).

See Demo Pages.

See the benchmarking, which shows that SinaTools outperformed all related toolkits.

Installation

To install SinaTools, ensure you are using Python version 3.10.8, then clone the GitHub repository.

Alternatively, you can execute the following command:

pip install sinatools

Installing Models and Data Files

Some modules in SinaTools require some data files and fine-tuned models to be downloaded. To download these models, please consult the DataDownload.

Documentation

For information, please refer to the main page or the online domuementation.

Citation

Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia: SinaTools: Open Source Toolkit for Arabic Natural Language Understanding. In Proceedings of the 2024 AI in Computational Linguistics (ACLing 2024), Procedia Computer Science, Dubai. ELSEVIER.

License

SinaTools is available under the MIT License. See the LICENSE file for more information.

Reporting Issues

To report any issues or bugs, please contact us at "sina.institute.bzu@gmail.com" or visit SinaTools Issues.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages