LOTR Storyteller: the AI of Sauron 🧙

Write your own Lord Of The Rings story!

Version 1.1 / 23 May 2023

Description

In this project, we have designed an AI assistant that completes your stories in the LOTR style. During the development of the app, we have:

Extracted the text from the official book,
Prepared the dataset,
Trained BLOOM-3B using Low-Rank-Adapters,
Deployed the model on Inference Endpoints from Hugging Face,
Built the app using Streamlit,
Deployed it into Streamlit cloud.

Notes: regarding the cost of deploying a model this large, the app is not available for testing

⚙️ Model fine-tuning [code]

This LLM is fine-tuned on Bloom-3B with texts extracted from the book "The Lord of the Rings".

Check the article: Fine-tune your own “GPT-4” on your data: create a “The Lord of the Rings” storyteller

The Hugging Face model card: JeremyArancio/llm-tolkien

🚀 Model deployment and app [code]

The model is deployed on Inference Endpoints from Hugging Face, and the applicaiton is built and deployed on Streamlit.

Load the model

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftConfig, PeftModel

# Import the model
config = PeftConfig.from_pretrained("JeremyArancio/llm-tolkien")
model = AutoModelForCausalLM.from_pretrained(config.base_model_name_or_path, return_dict=True, load_in_8bit=True, device_map='auto')
tokenizer = AutoTokenizer.from_pretrained(config.base_model_name_or_path)
# Load the Lora model
model = PeftModel.from_pretrained(model, "JeremyArancio/llm-tolkien")

Run the model

prompt = "The hobbits were so suprised seeing their friend"

inputs = tokenizer(prompt, return_tensors="pt")
tokens = model.generate(
    **inputs,
    max_new_tokens=100,
    temperature=1,
    eos_token_id=tokenizer.eos_token_id,
    early_stopping=True
)

# The hobbits were so suprised seeing their friend again that they did not 
# speak. Aragorn looked at them, and then he turned to the others.</s>

Training parameters

# Dataset
context_length = 2048

# Training
model_name = 'bigscience/bloom-3b'
lora_r = 16 # attention heads
lora_alpha = 32 # alpha scaling
lora_dropout = 0.05
lora_bias = "none"
lora_task_type = "CAUSAL_LM"

## Trainer config
per_device_train_batch_size = 1 
gradient_accumulation_steps = 1
warmup_steps = 100 
num_train_epochs=3
weight_decay=0.1
learning_rate = 2e-4 
fp16 = True
evaluation_strategy = "no"

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
images		images
llm		llm
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

llm

llm

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

LOTR Storyteller: the AI of Sauron 🧙

Write your own Lord Of The Rings story!

Description

⚙️ Model fine-tuning [code]

🚀 Model deployment and app [code]

Load the model

Run the model

Training parameters

About

Releases

Packages

Languages

License

jeremyarancio/llm-tolkien

Folders and files

Latest commit

History

Repository files navigation

LOTR Storyteller: the AI of Sauron 🧙

Write your own Lord Of The Rings story!

Description

⚙️ Model fine-tuning [code]

🚀 Model deployment and app [code]

Load the model

Run the model

Training parameters

About

Resources

License

Stars

Watchers

Forks

Languages