# Open source LLM kickstarter

In [None]:
import torch
from transformers import pipeline

## Load open source models

Check out the [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard), or chat with some open source models [here](https://chat.lmsys.org/) for inspiration. We've chosen a GPU to allow some of the bigger models to run, but if you want to go really big, you may need to refer to some tricks like [8-bit quantization](https://colab.research.google.com/github/huggingface/blog/blob/main/notebooks/HuggingFace_int8_demo.ipynb), or model parallellism.

In [None]:
# https://huggingface.co/MBZUAI/LaMini-T5-738M
lamini = pipeline(
    'text2text-generation', 
    model="MBZUAI/LaMini-T5-738M", 
    max_length=512, 
    do_sample=True, 
    device=0
)

In [None]:
# https://huggingface.co/databricks/dolly-v2-3b
dolly = pipeline(
    'text2text-generation', 
    model="databricks/dolly-v2-3b", 
    torch_dtype=torch.bfloat16, 
    trust_remote_code=True, 
    device=0,
)

In [None]:
# we've added device=0 to make sure we're using the GPU
# we can double check this as well by running the following
dolly.device, lamini.device

## Prompting

Let's see if they're doing what we expect them to do.
Be creative, play around, have fun!

In [None]:
prompt = "What's so neat about open source large language models?"

In [None]:
lamini(prompt)

In [None]:
dolly(prompt)