Discover the Beauty of Urdu with Enhanced Search Capabilities
UrduSearchPlus is a powerful search engine designed to facilitate searching and indexing of Urdu text. It offers advanced features such as natural language processing, stopword removal, and indexing algorithms for efficient searching. With UrduSearchPlus, you can explore and analyze Urdu documents with ease.
- Users can perform searches in Urdu.
- Search is powered by TF-IDF scoring and an inverted index.
- Simple and responsive web interface for search.
- Stylish design with Font Awesome icons.
- Extraction of documents from a compressed file on startup.
- Handling of Urdu stopwords for improved search accuracy.
- Flask-based backend for handling search requests.
- Generation of term IDs and document IDs.
- Dynamic display of search results without page reload.
- Results include relevant information like document ID, score, and document name.
- Reading stopwords and finding document IDs.
- Saving term IDs and document IDs to text files.
- Construction of a term ID dictionary, forward index, and inverted index.
- Clone the repository:
git clone https://github.com/AnasMohammad4321/UrduSearchPlus.git
- Install dependencies:
pip install -r requirements.txt
- Run the Flask application:
python app.py
- Open your web browser and go to http://localhost:5000 to access UrduSearchPlus.
This project is licensed under the MIT License.