---

## 🔹 Installing Required Libraries  
To perform text summarization, we need to install the `transformers` library, which provides access to pre-trained models that can summarize text efficiently.  

---

In [None]:
!pip install transformers -q


## 🔹 Importing Necessary Modules  
To utilize the summarization model, we import the `pipeline` function from `transformers`. This allows us to easily use pre-trained models for various NLP tasks, including summarization.  

---

In [None]:
from transformers import pipeline

---

## 🔹 Initializing the Summarization Model  
A summarization model is loaded using the `pipeline` function. This model is pre-trained on large datasets and is capable of generating concise summaries of input text.  

---

In [None]:
summarizer_model = pipeline("summarization")

No model was supplied, defaulted to sshleifer/distilbart-cnn-12-6 and revision a4f8f3e (https://huggingface.co/sshleifer/distilbart-cnn-12-6).
Using a pipeline without specifying a model name and revision in production is not recommended.
The secret `HF_TOKEN` does not exist in your Colab secrets.
To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.
You will be able to reuse this secret in all of your notebooks.
Please note that authentication is recommended but still optional to access public models or datasets.


config.json:   0%|          | 0.00/1.80k [00:00<?, ?B/s]

pytorch_model.bin:   0%|          | 0.00/1.22G [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/26.0 [00:00<?, ?B/s]

vocab.json:   0%|          | 0.00/899k [00:00<?, ?B/s]

merges.txt:   0%|          | 0.00/456k [00:00<?, ?B/s]

Device set to use cuda:0


---

## 🔹 Input Text  
For demonstration, we use a paragraph describing the **Eiffel Tower**. The text provides details about its height, construction history, and how it remained the tallest structure for several years.  

----

In [None]:
text = """The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building, and the tallest structure in Paris. Its base is square, measuring 125 metres (410 ft) on each side. During its construction, the Eiffel Tower surpassed the Washington Monument to become the tallest man-made structure in the world, a title it held for 41 years until the Chrysler Building in New York City was finished in 1930. It was the first structure to reach a height of 300 metres. Due to the addition of a broadcasting aerial at the top of the tower in 1957, it is now taller than the Chrysler Building by 5.2 metres (17 ft). Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct."""

In [None]:
text

'The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building, and the tallest structure in Paris. Its base is square, measuring 125 metres (410 ft) on each side. During its construction, the Eiffel Tower surpassed the Washington Monument to become the tallest man-made structure in the world, a title it held for 41 years until the Chrysler Building in New York City was finished in 1930. It was the first structure to reach a height of 300 metres. Due to the addition of a broadcasting aerial at the top of the tower in 1957, it is now taller than the Chrysler Building by 5.2 metres (17 ft). Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct.'

In [None]:
summarized_text = summarizer_model(text, max_length = 130, min_length = 30 , do_sample = False)

---

## 🔹 Checking the Length of the Text  
Before summarization, we check the number of words in the original text. This helps us compare the input with the summarized output and understand the degree of compression achieved.  

----

In [None]:
summarized_text

[{'summary_text': ' The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building . It was the first structure to reach a height of 300 metres . It is now taller than the Chrysler Building by 5.2 metres (17 ft) Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France .'}]

In [None]:
len(text.split())

130

---

## 🔹 Performing Summarization  
The text is passed through the summarization pipeline, and the model generates a shorter version while preserving key details. We set constraints like **minimum and maximum length** to ensure that the summary remains meaningful and informative.  

---

In [None]:
summarized_text1 = summarizer_model(text, max_length = 130, min_length = 30 , do_sample = True)

In [None]:
summarized_text1

[{'summary_text': ' The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building . It was the first structure to reach a height of 300 metres . It is now taller than the Chrysler Building by 5.2 metres (17 ft) Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct .'}]

In [None]:
len(summarized_text1[0]['summary_text'].split())

61

---

## 🔹 Using an Alternative Summarization Model  
To compare results, we use a different pre-trained model (`facebook/bart-large-cnn`). This model is specifically fine-tuned for summarization tasks and may provide a more refined summary.  

---

In [None]:
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("summarization", model="facebook/bart-large-cnn")

config.json:   0%|          | 0.00/1.58k [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/1.63G [00:00<?, ?B/s]

generation_config.json:   0%|          | 0.00/363 [00:00<?, ?B/s]

vocab.json:   0%|          | 0.00/899k [00:00<?, ?B/s]

merges.txt:   0%|          | 0.00/456k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/1.36M [00:00<?, ?B/s]

Device set to use cuda:0


In [None]:
summarized_text2 = pipe(text, max_length = 130, min_length = 30 , do_sample = False)

In [None]:
summarized_text2

[{'summary_text': 'The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building. Its base is square, measuring 125 metres (410 ft) on each side. It is the second tallest free-standing structure in France after the Millau Viaduct.'}]

---

## 🔹 Comparing Summaries  
After generating summaries from different models, we compare them based on length and content. We observe that both models retain crucial details but may phrase the summaries differently.  

---

## 🔹 Summary of Findings  
- The **original text** contains **130 words**.  
- The **summarized text** is significantly shorter while maintaining important details.  
- The summaries generated by different models are slightly different but effectively condense the content.  
- The **BART model** provides a more structured summary, making it a suitable option for high-quality summarization.  

---