# 1. Introduction
**Text Summarization** is a natural language processing (NLP) technique that involves
* *reducing a large body of text into a shorter version while retaining its main ideas and essential information.*
* The goal is to create a concise summary that accurately reflects the key points of the original content.

There are two primary types of text summarization:
1. Extractive Summarization
2. Abstractive Summarization

## 1. Extractive Summarization
* **Definition:** This method involves *selecting important sentences, phrases, or sections,* directly from the original text to form the summary.
* **How it works:** The algorithm identifies the most relevant parts based on factors like *sentence importance, keyword frequency, or similarity to the main topic.*
* **Pros:** Maintains the original context and is straightforward.
* **Cons:** May result in a less coherent summary since it only pieces together parts of the original text.

**Example:** Given a paragraph about climate change, an extractive summary might pick out sentences like:
* "Climate change is a significant global challenge."
* "Greenhouse gas emissions contribute to rising temperatures."


## 2. Abstractive Summarization
* **Definition:** This approach generates new sentences that convey the main ideas of the original text, often paraphrasing or rephrasing the content.
* **How it works:** The algorithm uses deep learning models like transformers (e.g., BERT, GPT) to understand the text's meaning and then creates a new, concise version.
* **Pros:** Can provide more coherent and natural summaries, mimicking human summarization.
* **Cons:** More complex and may require significant computational power.


**Example:** The same paragraph on climate change might be summarized as:
* "Rising greenhouse gas emissions are driving global warming, posing serious environmental challenges."

## Applications of Text Summarization
* **News Aggregators:** Providing brief summaries of news articles.
* **Search Engines:** Displaying snippets of relevant information in search results.
* **Research:** Summarizing academic papers or articles for quick review.
* **Customer Support:** Summarizing customer inquiries or support tickets.

# 2. Import libraries

In [1]:
!pip install transformers
from transformers import pipeline



# 3. Import model
**BART (facebook/bart-large-cnn):** Fine-tuned specifically for summarization tasks with high-quality, coherent summaries.



In [2]:
# Choose the model ('facebook/bart-large-cnn' for BART, 't5-base' for T5)
model_name = "facebook/bart-large-cnn"
summarizer = pipeline("summarization", model=model_name)

The secret `HF_TOKEN` does not exist in your Colab secrets.
To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.
You will be able to reuse this secret in all of your notebooks.
Please note that authentication is recommended but still optional to access public models or datasets.


config.json:   0%|          | 0.00/1.58k [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/1.63G [00:00<?, ?B/s]

generation_config.json:   0%|          | 0.00/363 [00:00<?, ?B/s]

vocab.json:   0%|          | 0.00/899k [00:00<?, ?B/s]

merges.txt:   0%|          | 0.00/456k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/1.36M [00:00<?, ?B/s]

# 4. Load Dataset

In [3]:
# Input text (you can replace this with any long text you want to summarize)
text = """
Climate change refers to long-term changes in temperatures and weather patterns. These changes may be natural,
such as through variations in the solar cycle. However, since the 1800s, human activities have been the main driver
of climate change, primarily due to burning fossil fuels like coal, oil, and gas. Burning these materials releases
what are called greenhouse gases into Earth’s atmosphere. These gases trap heat from the sun, causing Earth’s average
temperature to rise. This rise in temperature is known as global warming.
"""

# 5. Model Deployment

In [4]:
# Generate summary (adjust max_length and min_length as needed)
summary = summarizer(text, max_length=100, min_length=30, do_sample=False)
print("Summary:")
print(summary[0]['summary_text'])

Summary:
Since the 1800s, human activities have been the main driver of climate change. This is due to burning fossil fuels like coal, oil, and gas. Burning these materials releases greenhouse gases into the atmosphere. These gases trap heat from the sun, causing Earth’s average temperature to rise. This rise in temperature is known as global warming.
