GitHub - wildercb/llm_training: A mini repo for llm training and inference from huggingface on personal datasets

A mini-repo for inferencing and finetuning llms from huggingface from local data

Data -

Contains scripts for example of how to load data into instruction tuning csvs from NLI data Contains scripts to merge datasets for finetuning

Inference -

Scripts to load a model and inference from huggingface, working on adding ollama support - requires models to be stored in quantized format.

for base inference

python inference/hf_inf_base.py --input_csv, --model_path (local or huggingface) , --n (number of prompts from input csv)

default saves to a csv which is named model_input_csv_n.csv (probably could be made better)

Training -

Scripts for finetuning and prompt tuning for choice of dataset and model

For finetuning on huggingface data

python training/hf_finetune.py --base_model (model weights to start with (local or from huggingface)) --data_path (path to csv in instruction tuning format) --model_name (name of output model after finetuning) --checkpoint_dir (directory to save new weights) --batch_size , --num_epochs

Models -

Meant to store our different model training architectures and model argument parameters if needed in the future

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.vscode		.vscode
core		core
data		data
inference		inference
models		models
training		training
.gitattributes		.gitattributes
.gitignore		.gitignore
GPT - qs.txt		GPT - qs.txt
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A mini-repo for inferencing and finetuning llms from huggingface from local data

Data -

Inference -

Training -

Models -

About

Releases

Packages

Languages

wildercb/llm_training

Folders and files

Latest commit

History

Repository files navigation

A mini-repo for inferencing and finetuning llms from huggingface from local data

Data -

Inference -

Training -

Models -

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages