Disambiguation Study for Arabic Applied on Text Classification
-
Updated
Sep 14, 2024 - Jupyter Notebook
Disambiguation Study for Arabic Applied on Text Classification
An advanced Arabic fake news detection model using LSTM and AraBERT. This project leverages the Arabic Fake News Dataset (AFND) to classify news articles as credible, not credible, or undecided. Includes preprocessing steps, model building, and evaluation using TensorFlow.
Many countries speak Arabic; however, each country has its own dialect, the aim of this project is to build a model that predicts the dialect given the text.
Emotion Prediction in Arabic Text
Diacritics are short vowels with a constant length that are spoken. The same word in the Arabic language can have different meanings and different pronunciations based on how it is diacritized. In this project, we implement a pipeline to predict the diacritic of each character in an Arabic text using Natural Language Processing techniques.
Dialectical Arabic Sentiment Analysis
Mental health diagnosis tool using NLP and ML for Arabic inputs, with a Laravel web application interface
Fine-tune BERT models to classify Arabic text by different dialects.
Used “aubmindlab/bert-base-arabertv2” from Aub-mind AraBERT to create a simple Arabic text tokenizer.
Sentiment analysis with arabert in Tunisian dialect
This is an experiment for Qur'an QA for the shared task at the OSCAT workshop
Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)
Fine-tuning / pre-training AraElectra on a specific domain for QA system.
Arabic Dialect Sentimenal Analysis
Arabic_Dialect_Identification_NLP-AIM-Task
Arabic Dialect Identification between 18 country-level Arabic dialects using QADI dataset and pretrained language model AraBERT
Easy to use extractive text summarization with AraBERT
After collecting 40 thousand tweets and preprocessing it, I used word embeddings with arabert and tf-idf along with two neural network architectures and 5 machine learning algorithms. Due to the huge size of the dataset, I chose Amazon SageMaker to train the models
Simple Script to undo Farasa Segmentation, compatible with AraBERT pre-segmentation
Add a description, image, and links to the arabert topic page so that developers can more easily learn about it.
To associate your repository with the arabert topic, visit your repo's landing page and select "manage topics."