Neuron-Hacking

Finetuning LLMs to act as key-value stores.

This repository contains base code that can be used to understand the research on fine-tuning Language Learning Models (LLMs) to act as key-value stores. It is not a complete project, but it provides a foundation for further exploration and experimentation.

The research report from the results this code returned can be found here: Neuron Hacking Report on Weights and Biases

The necessary files to repeat the experiments outlined in the report are below:

ai.py: This file contains the AI and FineTuner classes. The AI class is used to interact with OpenAI's API, calculate the cost of completions, and log the cost. The FineTuner class is used to fine-tune models, upload files for fine-tuning, retrieve and cancel fine-tuning jobs, and manage fine-tuned models. This is a highly reusable class for other implementations if you desire.
finetune_cost_estimator.py: This script is used to estimate the cost of fine-tuning a model. It calculates the number of tokens in a dataset and uses the pricing information to estimate the cost. It doesn't work very well. For some reason my costs were signficantly higher than this suggested.
numeric_key_value_generator.py: This script generates datasets for fine-tuning. Each dataset consists of unique keys and corresponding values. The keys are alpha-numeric strings of a specified length, and the values are unique integers.
numeric_finetuning_loop.py: This script automates the process of fine-tuning a model on multiple datasets for multiple epochs. It checks if a fine-tuned model already exists before starting a new fine-tuning job, and it waits for a fine-tuning job to complete before starting the next one.
finetune_progress_grid.py: This script visualizes the progress of fine-tuning jobs. It creates a grid where the rows represent different datasets and the columns represent different epochs. The cells in the grid are colored based on whether a fine-tuning job for the corresponding dataset and epoch has been completed.
numeric_recall_testing_loop.py: This script tests the ability of fine-tuned models to recall values. It iterates over each item in a dataset and uses the model to recall the value corresponding to the unique key. The recalled values are then saved to a JSON file.

Please note that this code is intended for research purposes and should be adapted to fit your specific needs and constraints.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
datasets		datasets
degradation_tests		degradation_tests
plots		plots
value_recall_tests		value_recall_tests
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
ai.py		ai.py
degradation_testing.ipynb		degradation_testing.ipynb
finetune_cost_estimator.py		finetune_cost_estimator.py
finetune_progress_grid.py		finetune_progress_grid.py
helpers.py		helpers.py
numeric_finetuning_loop.py		numeric_finetuning_loop.py
numeric_key_value_generator.py		numeric_key_value_generator.py
numeric_recall_testing_loop.py		numeric_recall_testing_loop.py
pricing.json		pricing.json
recall_analytics.ipynb		recall_analytics.ipynb
requirements.txt		requirements.txt
text_finetuner.ipynb		text_finetuner.ipynb
text_results.json		text_results.json

License

samshapley/Neuron-Hacking

Folders and files

Latest commit

History

Repository files navigation

Neuron-Hacking

About

Resources

License

Stars

Watchers

Forks

Languages