Fine tuning LLMs for classification

Dataset used: https://huggingface.co/datasets/dair-ai/emotion (Transformed in such a way that label is word not int, can be found compressed in the repo)

Order of operations:

Some of these python files need to be edited for your use. This code is a supplement of this workshop: https://www.youtube.com/watch?v=fyydvBcJTn8 (In slovak language)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
utlis		utlis
.gitignore		.gitignore
balance_dataset.py		balance_dataset.py
create_transformers_dataset.py		create_transformers_dataset.py
custom_trainer.py		custom_trainer.py
evaluate_model.py		evaluate_model.py
generate_data.py		generate_data.py
readme.md		readme.md
requirements.txt		requirements.txt
serve.py		serve.py
show_dataset_class_distribution.py		show_dataset_class_distribution.py
split_dataset.py		split_dataset.py
sweep.yml		sweep.yml
train.py		train.py

Provide feedback