NLP Test Project

Overview

This project is designed to evaluate the open-source Mistral 7B model on two datasets, MMLU and MATH, using various evaluation techniques. Users can run evaluations either through a Streamlit UI for an easy and interactive experience or by running scripts directly from the terminal for more controlled testing.

Note: In this project, OpenAI and HuggingFace APIs are used for text generation. WandB API is used for output generation. Please set the .env file using your API keys. All API key variables are left empty.

Installation

Clone the repository and install the required packages:

git clone git@github.com:emirkocer/NLP-test-project.git project cd project
pip install -r requirements.txt

Running the Streamlit App

To start the Streamlit UI, navigate to the project directory and run:

streamlit run app.py

On the UI, you can select a dataset and an evaluation mode to run the desired inference.

Running Inferences from Terminal

For direct command line interactions, you can use the provided Bash scripts to run evaluations:

chmod +x run_evaluation.sh
./run_evaluation.sh mmlu <evaluation_mode>
./run_evaluation.sh math <evaluation_mode>

Available evaluation modes for MMLU: 'baseline', 'few-shot' and 'few-shot-and-cot' Available evaluation modes for MATH: 'baseline' and 'few-shot'

Fine-Tuning Models

In the finetune directory, you will find subfolders for both MATH and MMLU datasets containing the necessary Python files for fine-tuning Mistral 7B base model on a GPU using the UnSloth library for training optimization.

Two Colab notebooks are provided for inference with both fine-tuned models.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
finetune		finetune
math_splits		math_splits
.DS_Store		.DS_Store
.gitignore		.gitignore
MATH_finetuned_inference.ipynb		MATH_finetuned_inference.ipynb
MMLU_finetuned_inference.ipynb		MMLU_finetuned_inference.ipynb
README.md		README.md
app.py		app.py
math_inference.py		math_inference.py
mmlu_inference.py		mmlu_inference.py
prompts.py		prompts.py
requirements.txt		requirements.txt
run_evaluation.sh		run_evaluation.sh
utils_inference.py		utils_inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Test Project

Overview

Installation

Running the Streamlit App

Running Inferences from Terminal

Fine-Tuning Models

About

Uh oh!

Releases

Packages

Uh oh!

Languages

emirkocer/NLP-project

Folders and files

Latest commit

History

Repository files navigation

NLP Test Project

Overview

Installation

Running the Streamlit App

Running Inferences from Terminal

Fine-Tuning Models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages