## Handling multiple sequences

- transformer models expect multiple sequences/sentences by default
- for example:
- 

In [2]:
import tensorflow as tf
from transformers import AutoTokenizer, TFAutoModelForSequenceClassification

checkpoint = "distilbert-base-uncased-finetuned-sst-2-english"
tokenizer = AutoTokenizer.from_pretrained(checkpoint)
model = TFAutoModelForSequenceClassification.from_pretrained(checkpoint)

sequence = "I've been waiting for a HuggingFace course my whole life."

tokens = tokenizer.tokenize(sequence)
ids = tokenizer.convert_tokens_to_ids(tokens)

input_ids = tf.constant([ids])
print("Input IDs:", input_ids)

output = model(input_ids)
print("Logits:", output.logits)



All PyTorch model weights were used when initializing TFDistilBertForSequenceClassification.

All the weights of TFDistilBertForSequenceClassification were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFDistilBertForSequenceClassification for predictions without further training.


Input IDs: tf.Tensor(
[[ 1045  1005  2310  2042  3403  2005  1037 17662 12172  2607  2026  2878
   2166  1012]], shape=(1, 14), dtype=int32)
Logits: tf.Tensor([[-2.7276196  2.8789363]], shape=(1, 2), dtype=float32)


Let's explain what is happening in the code above:

tensorflow:- is an ml library often used for building and training neural networks. in the context of the above code it is being usef to manage data as tensors which are mulyi-dimensional arrays.

AutoTokenizer and TFAutoModelForSequenceClassification are classes from the transformers library by Hugging Face, which provides tools for using pre-trained language models.

the above code uses a pre-trained DistilBERT model to classify the sentiment of a given sentence. It converts the sentence into a format the model understands, runs the sentence through the model, and outputs the raw prediction scores (logits). These scores can then be further processed to determine the predicted sentiment of the input text.


