In [13]:
from transformers import pipeline

In [14]:
#PES2UG23CS703 EXPERIMENT 1 PROMPTING
prompt = "The future of Artificial Intelligence is"

models = {
    "BERT": "bert-base-uncased",
    "RoBERTa": "roberta-base",
    "BART": "facebook/bart-base"
}

for name, model in models.items():
    try:
        gen = pipeline("text-generation", model=model)
        print(f"\n{name} output:")
        print(gen(prompt, max_new_tokens=20))
    except Exception as e:
        print(f"\n{name} failed:", e)

If you want to use `BertLMHeadModel` as a standalone, add `is_decoder=True.`
Device set to use cpu



BERT output:


If you want to use `RobertaLMHeadModel` as a standalone, add `is_decoder=True.`


[{'generated_text': 'The future of Artificial Intelligence is....................'}]


Device set to use cpu



RoBERTa output:
[{'generated_text': 'The future of Artificial Intelligence is'}]


Some weights of BartForCausalLM were not initialized from the model checkpoint at facebook/bart-base and are newly initialized: ['lm_head.weight', 'model.decoder.embed_tokens.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
Device set to use cpu



BART output:
[{'generated_text': 'The future of Artificial Intelligence isbeginbeginbegin jur Patricia Patricia 2014 Reaction Information Informationntax edited po Nazis edited Nazis Writer Writer�'}]


In [15]:
#PES2UG23CS703 EXPERIMENT 2
sentence = "The goal of Generative AI is to [MASK] new content."

for name, model in models.items():
    try:
        fm = pipeline("fill-mask", model=model)
        print(f"\n{name} predictions:")
        print(fm(sentence))
    except Exception as e:
        print(f"\n{name} failed:", e)

Some weights of the model checkpoint at bert-base-uncased were not used when initializing BertForMaskedLM: ['bert.pooler.dense.bias', 'bert.pooler.dense.weight', 'cls.seq_relationship.bias', 'cls.seq_relationship.weight']
- This IS expected if you are initializing BertForMaskedLM from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertForMaskedLM from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Device set to use cpu



BERT predictions:
[{'score': 0.5396932363510132, 'token': 3443, 'token_str': 'create', 'sequence': 'the goal of generative ai is to create new content.'}, {'score': 0.15575720369815826, 'token': 9699, 'token_str': 'generate', 'sequence': 'the goal of generative ai is to generate new content.'}, {'score': 0.05405500903725624, 'token': 3965, 'token_str': 'produce', 'sequence': 'the goal of generative ai is to produce new content.'}, {'score': 0.04451530799269676, 'token': 4503, 'token_str': 'develop', 'sequence': 'the goal of generative ai is to develop new content.'}, {'score': 0.01757744885981083, 'token': 5587, 'token_str': 'add', 'sequence': 'the goal of generative ai is to add new content.'}]


Device set to use cpu



RoBERTa predictions:

RoBERTa failed: No mask_token (<mask>) found on the input


Device set to use cpu



BART predictions:

BART failed: No mask_token (<mask>) found on the input


In [16]:
#PES2UG23CS703 EXPERIMENT 3
question = "What are the risks?"
context = "Generative AI poses significant risks such as hallucinations, bias, and deepfakes."

for name, model in models.items():
    try:
        qa = pipeline("question-answering", model=model)
        print(f"\n{name} answer:")
        print(qa(question=question, context=context))
    except Exception as e:
        print(f"\n{name} failed:", e)

Some weights of BertForQuestionAnswering were not initialized from the model checkpoint at bert-base-uncased and are newly initialized: ['qa_outputs.bias', 'qa_outputs.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
Device set to use cpu



BERT answer:
{'score': 0.009666637051850557, 'start': 46, 'end': 82, 'answer': 'hallucinations, bias, and deepfakes.'}


Some weights of RobertaForQuestionAnswering were not initialized from the model checkpoint at roberta-base and are newly initialized: ['qa_outputs.bias', 'qa_outputs.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
Device set to use cpu



RoBERTa answer:
{'score': 0.004378531593829393, 'start': 14, 'end': 67, 'answer': 'poses significant risks such as hallucinations, bias,'}


Some weights of BartForQuestionAnswering were not initialized from the model checkpoint at facebook/bart-base and are newly initialized: ['qa_outputs.bias', 'qa_outputs.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
Device set to use cpu



BART answer:
{'score': 0.03242884669452906, 'start': 14, 'end': 81, 'answer': 'poses significant risks such as hallucinations, bias, and deepfakes'}


#PES2UG23CS703
## Observation Table – Unit 1 Model Benchmark Results

| Task        | Model    | Success / Failure | Observation | Architectural Reason |
|------------|----------|-------------------|-------------|----------------------|
| Generation | BERT     | Failure | Generated repeated symbols without meaningful continuation | Encoder-only architecture; not trained for autoregressive text generation |
| Generation | RoBERTa  | Failure | Repeated the prompt without generating new tokens | Encoder-only model designed for understanding tasks |
| Generation | BART     | Partial | Generated incoherent and noisy text | Encoder–decoder architecture, but not trained for causal language modeling |
| Fill-Mask  | BERT     | Success | Correctly predicted words like “create” and “generate” | Trained using Masked Language Modeling with [MASK] token |
| Fill-Mask  | RoBERTa  | Failure | Failed due to missing expected mask token | Uses a different tokenizer (<mask>), highlighting preprocessing dependency |
| Fill-Mask  | BART     | Failure | Mask token not detected in input | Not primarily trained for MLM and uses different masking scheme |
| QA         | BERT     | Partial | Extracted relevant answer span with low confidence | Not fine-tuned for question answering tasks |
| QA         | RoBERTa  | Failure | Returned irrelevant or empty span | Base model not trained for extractive QA |
| QA         | BART     | Partial | Extracted a keyword from context | Not fine-tuned for extractive question answering |
