# **Generating Text With A Mistral Model + Hugging Face Transformers**

With a few simple lines you can download a Mistral model from the Hugging Face Model Hub and run text generation.

### Install Transformers

First make sure to install the latest (nightly) version of transformers!

### Download Model, Tokenize, and Generate Output

Then all you need to do is load the model from Hugging Face Hub, tokenize your prompt, and generate the output!

In [1]:
from transformers import AutoModelForCausalLM, AutoConfig, AutoTokenizer
import torch

#model_path="tiiuae/falcon-40b-instruct"
model_path="mistralai/Mistral-7B-v0.1"

config = AutoConfig.from_pretrained(model_path, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True, load_in_4bit=True, device_map="auto")

tokenizer = AutoTokenizer.from_pretrained(model_path)

input_text = "Describe the solar system."
input_ids = tokenizer(input_text, return_tensors="pt").input_ids.to("cuda")

outputs = model.generate(input_ids, max_length=100)
print(tokenizer.decode(outputs[0]))



  from .autonotebook import tqdm as notebook_tqdm
Loading checkpoint shards: 100%|██████████| 2/2 [00:08<00:00,  4.06s/it]
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.
Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


<s> Describe the solar system.

The solar system is a collection of planets, moons, asteroids, comets, and other objects that orbit the Sun.

What is the difference between a planet and a dwarf planet?

A planet is a celestial body that orbits the Sun and is large enough to be rounded by its own gravity. A dwarf planet is a celestial body that orbits the Sun and is large enough to be rounded by its


In [2]:
input_ids = tokenizer.encode(
    "Here is what I did during my safari in Africa:", return_tensors="pt"
)

sample_output = model.generate(input_ids, do_sample=True, max_length=500, top_k=50)

print("Output:\n" + 100 * "-")
print(tokenizer.decode(sample_output[0], skip_special_tokens=True))

The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.
Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.


Output:
----------------------------------------------------------------------------------------------------
Here is what I did during my safari in Africa:
I saw a lion.
I saw some baboons.
I saw a bunch of giraffes.
I saw a bunch of bazillion zebras.
I saw buffalos and elephants.
I heard the growl of the lion.
I saw a water buffalo.
I was on a jeep.
I saw some monkeys.
I was trying to take photos of elephants, giraffes, hippos and water buffaloes.
I saw a male cheetah.
I smelled poop (giraffe, elephant).
I saw male lions.
I saw lots of giraffe.
I saw many male lions, too.
I smelled poop again.
I saw a female lion.
I saw male lions, lots of giraffe, and lots of zebra.
I climbed into another jeep and went on safari again.
I saw baby lions.
I did not see any baby lions, but lots of male/female/young lions.
I saw two cheetahs.
I saw my dad and my mom.
I met two Kenyan boys.
I saw an elephant and a rhino.
I saw baby lions.
I saw a crocodile and hippo.
