### About
### Generate response for the question from pre-defined text using LLM(Extractive Question-Answering(QA) Model).

### I have directly used a pre-trained model without fine-tuning it on the training data, primarily due to my limited knowledge in NLP-QA tasks. I loaded and predicted the test data using the transformers inference pipeline.

### Install Necessary Libraries & Packages

In [None]:
! pip install -U git+https://github.com/huggingface/transformers.git
! pip install -U git+https://github.com/huggingface/accelerate.git

In [1]:
import numpy as np # linear algebra
import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)
from tqdm import tqdm_notebook
import seaborn as sns
from datasets import Dataset
import re
import shutil
from transformers import AutoModelForQuestionAnswering, AutoTokenizer, pipeline



### Load test data

In [2]:
test=pd.read_csv("/kaggle/input/mh-intel-oneapi-hackathon-the-llm-challenge/test.csv")

### Load deepset_roberta-base-squad2 model

In [9]:
model_name = 'deepset/roberta-base-squad2'

### Transformers Inference pipeline

In [10]:

nlp_qa = pipeline('question-answering', model=model_name, tokenizer=model_name,device=0)
QA_input = {
    'question': test['Question'][1888],
    'context': test['Story'][1888]
}
res = nlp_qa(QA_input)

Downloading (…)lve/main/config.json:   0%|          | 0.00/571 [00:00<?, ?B/s]

Downloading model.safetensors:   0%|          | 0.00/496M [00:00<?, ?B/s]

Downloading (…)okenizer_config.json:   0%|          | 0.00/79.0 [00:00<?, ?B/s]

Downloading (…)olve/main/vocab.json:   0%|          | 0.00/899k [00:00<?, ?B/s]

Downloading (…)olve/main/merges.txt:   0%|          | 0.00/456k [00:00<?, ?B/s]

Downloading (…)cial_tokens_map.json:   0%|          | 0.00/772 [00:00<?, ?B/s]

### Sample content

In [11]:
test['Story'][1888]

'Bloomberg L.P. is a privately held financial software, data, and media company headquartered in Midtown Manhattan, New York City. Bloomberg L.P. was founded by Michael Bloomberg in 1981 with the help of Thomas Secunda, Duncan MacMillan, Charles Zegar, and a 30% ownership investment by Merrill Lynch. \n\nBloomberg L.P. provides financial software tools such as an analytics and equity trading platform, data services, and news to financial companies and organizations through the Bloomberg Terminal (via its Bloomberg Professional Service), its core revenue-generating product. Bloomberg L.P. also includes a wire service (Bloomberg News), a global television network (Bloomberg Television), digital websites, a radio station (WBBR), subscription-only newsletters, and three magazines: "Bloomberg Businessweek", "Bloomberg Markets", and "Bloomberg Pursuits". In 2014, Bloomberg L.P. launched Bloomberg Politics, a multiplatform media property that merged the company\'s political news teams, and ha

### Sample question

In [12]:
test['Question'][1888]

'What does IMS stand for?'

### Sample answer generated from pipeline.

In [13]:
res

{'score': 0.9113449454307556,
 'start': 1346,
 'end': 1371,
 'answer': 'Innovative Market Systems'}

### Predict test data

In [None]:
test_ans=[]
for i  in tqdm_notebook(range(test.shape[0])):
    QA_input = {
    'question': test['Question'][i],
    'context': test['Story'][i]
        }
    res = nlp_qa(QA_input)
    if res:
        test_ans.append(res['answer'])
    else:
        test_ans.append("unknown")

### Test Result

In [None]:
res_final=pd.DataFrame({'Answer':test_ans})


In [None]:
res_final.head(10)

In [None]:
res_final.to_csv("deepset_roberta-base-squad2.csv",index=False)