NLP Arabic Dialect Identification and Next Word Prediction

Project Overview

Welcome to the NLP Arabic Dialect Identification and Next Word Prediction project! This project leverages advanced natural language processing techniques to offer two main functionalities:

Next Word Prediction (Knowledge-Based): Uses an n-gram model to predict the next word in a given sentence with the MADAR dataset.
Arabic Dialect Identification (Machine Learning): Utilizes a BERT model with lexicon features to identify the Arabic dialect of a given text, leveraging the MADAR dataset.

✨ Experience the project live on Streamlit! ✨

Features

Next Word Prediction using n-gram model
Arabic Dialect Identification using BERT
Interactive UI with Streamlit
Comprehensive text preprocessing for Arabic

Installation

Prerequisites

Python 3.x
pip
Streamlit

Instructions

Clone the repository:

git clone https://github.com/maans2001/UJ-NLP-Project
cd UJ-NLP-Project

How to Run

Windows

Open Command Prompt or PowerShell.
Navigate to the project directory and run:
```
run.bat
```

macOS and Linux

Open Terminal.

Navigate to the project directory and make the script executable:

chmod +x run.sh
./run.sh # use run_macosx.sh if you're on a mac machine

Usage

Next Word Prediction (Knowledge-Based)

Select "برنامج خمن الكلمة التالية (Knowledge Based)" from the sidebar.
Enter a sentence in Arabic.
Click "خمن الكلمات الجاية" to predict the next words.

Arabic Dialect Identification (Machine Learning)

Select "برنامج تحديد اللهجات (Machine Learning)" from the sidebar.
Enter an Arabic text.
Click "حدد اللهجة" to identify the dialect.

Roadmap

Add more dialects
Improve the prediction model
Enhance the UI/UX

Contributing

Contributions are welcome! Please fork this repository and submit a pull request. For major changes, please open an issue first to discuss what you would like to change.

Fork the Project
Create your Feature Branch (`git checkout -b feature/AmazingFeature`)
Commit your Changes (`git commit -m 'Add some AmazingFeature'`)
Push to the Branch (`git push origin feature/AmazingFeature`)
Open a Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
app		app
.gitattributes		.gitattributes
HOWTORUN.md		HOWTORUN.md
README.md		README.md
requirements.txt		requirements.txt
run.bat		run.bat
run.sh		run.sh
run_macosx.sh		run_macosx.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Arabic Dialect Identification and Next Word Prediction

Table of Contents

Project Overview

Features

Installation

Prerequisites

Instructions

How to Run

Windows

macOS and Linux

Usage

Next Word Prediction (Knowledge-Based)

Arabic Dialect Identification (Machine Learning)

Roadmap

Contributing

License

Contact

Acknowledgments

About

Releases

Packages

Languages

maans2001/UJ-NLP-Project

Folders and files

Latest commit

History

Repository files navigation

NLP Arabic Dialect Identification and Next Word Prediction

Table of Contents

Project Overview

Features

Installation

Prerequisites

Instructions

How to Run

Windows

macOS and Linux

Usage

Next Word Prediction (Knowledge-Based)

Arabic Dialect Identification (Machine Learning)

Roadmap

Contributing

License

Contact

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages