- Introduction
- Prerequisites
- Installation Steps
- Updating the Environment
- Deactivating and Removing the Environment
- Jupyter Use
- Datasets
- Additional References
- License
TYMBert is our submission for NCIM 2025, a spam classifier that makes use of knowledge distillation to compress the model while preserving accuracy
This repository provides a Conda environment configuration file (environment.yml) for setting up the tymbert environment. Follow the steps below to install and configure it correctly on your system.
-
Clone the Repository
git clone https://github.com/mirzaazwad/TYMBert.git cd TYMBert -
Create the Conda Environment Run the following command to create the
tymbertenvironment from theenvironment.ymlfile:conda env create -f environment.yml
-
Update the Environment Prefix The
environment.ymlfile may contain an absolute path under theprefixfield, which may not match your system's Conda installation directory. To fix this:- Open the
environment.ymlfile in a text editor - Locate the
prefix:field at the bottom of the file (if present) - Change it to your own Conda environment path, which can be found using:
conda info --envs
- Alternatively, create the environment without using the prefix by running:
conda env create --name tymbert --file environment.yml
- Open the
-
Activate the Environment
conda activate tymbert
-
Verify Installation Check that the necessary dependencies are installed:
conda list
If you make changes to environment.yml and need to update the existing environment:
conda env update --name tymbert --file environment.yml --pruneTo deactivate the environment:
conda deactivateTo remove the environment completely:
conda env remove --name tymbertAfter this environment is setup, use this environment as your kernel and you can use it via Jupyter Notebook or VSCode with the Jupyter extension.
| Dataset Name | Description | Link |
|---|---|---|
| SPStudy | A dataset for spam research, containing various studies and data points. | GitHub - SPStudy |
| SMS Spam Collection Dataset | A dataset containing SMS messages labeled as spam or ham. | Kaggle - SMS Spam Collection |
Quantization Logic and Code was used with the help of GitHub - BERT-Quantization by srimoyee1212
This project is licensed under the MIT License. See the LICENSE file for details.