1.You are trying to use the T5 model to perform English-to-French translation. The model expects tokenized input.

In [1]:
# Fixed: Force PyTorch usage and avoid TensorFlow conflict
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained('t5-small')
model = AutoModelForSeq2SeqLM.from_pretrained('t5-small')

text = 'translate English to German: Hello world'
inputs = tokenizer(text, return_tensors='pt')
outputs = model.generate(**inputs)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))


model.safetensors:   0%|          | 0.00/242M [00:00<?, ?B/s]

To support symlinks on Windows, you either need to activate Developer Mode or to run Python as an administrator. In order to activate developer mode, see this article: https://docs.microsoft.com/en-us/windows/apps/get-started/enable-your-device-for-development


generation_config.json:   0%|          | 0.00/147 [00:00<?, ?B/s]

Hallo Welt


2.You wrote a RAG-like prompt flow but skipped the vector retrieval step. Complete it.

In [2]:
from sentence_transformers import SentenceTransformer
import faiss
import numpy as np

model = SentenceTransformer('all-MiniLM-L6-v2')

documents = ["Gemini is powerful", "Vector search is fast"]
doc_embeddings = model.encode(documents).astype('float32')

index = faiss.IndexFlatL2(384)
index.add(doc_embeddings)

query = "Tell me about Gemini"
query_embedding = model.encode([query]).astype('float32')

_, indices = index.search(query_embedding, k=1)
retrieved_doc = documents[indices[0][0]]

print("Answer:", retrieved_doc)



Answer: Gemini is powerful


In [3]:
# Fixed: Force PyTorch usage and avoid TensorFlow conflict
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained('t5-small')
model = AutoModelForSeq2SeqLM.from_pretrained('t5-small')

text = 'translate English to German: Hello world'
inputs = tokenizer(text, return_tensors='pt')
outputs = model.generate(**inputs)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))


Hallo Welt


In [4]:
from sentence_transformers import SentenceTransformer, util

model = SentenceTransformer('all-MiniLM-L6-v2')

documents = ["AI is evolving fast", "FAISS is used for similarity search", "Gemini is a generative model"]
query = "Tell me about FAISS"

doc_embeddings = model.encode(documents, convert_to_tensor=True)
query_embedding = model.encode(query, convert_to_tensor=True)

cosine_scores = util.pytorch_cos_sim(query_embedding, doc_embeddings)
top_result = int(cosine_scores.argmax())

print("Most relevant document:", documents[top_result])


Most relevant document: FAISS is used for similarity search


In [5]:
# Fixed text generation using PyTorch
from transformers import pipeline

gen = pipeline('text-generation', model='distilgpt2', framework='pt')
out = gen('AI is changing the world', max_length=30)
print(out[0]['generated_text'])


config.json:   0%|          | 0.00/762 [00:00<?, ?B/s]

To support symlinks on Windows, you either need to activate Developer Mode or to run Python as an administrator. In order to activate developer mode, see this article: https://docs.microsoft.com/en-us/windows/apps/get-started/enable-your-device-for-development


model.safetensors:   0%|          | 0.00/353M [00:00<?, ?B/s]

generation_config.json:   0%|          | 0.00/124 [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/26.0 [00:00<?, ?B/s]

vocab.json:   0%|          | 0.00/1.04M [00:00<?, ?B/s]

merges.txt:   0%|          | 0.00/456k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/1.36M [00:00<?, ?B/s]

Device set to use cpu
Truncation was not explicitly activated but `max_length` is provided a specific value, please use `truncation=True` to explicitly truncate examples to max length. Defaulting to 'longest_first' truncation strategy. If you encode pairs of sequences (GLUE-style) with the tokenizer you can select this strategy more precisely by providing a specific strategy to `truncation`.
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.
Both `max_new_tokens` (=256) and `max_length`(=30) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation)


AI is changing the world for the first time. The world needs a real man, and that needs to include a man.












































































































































































































































