In [4]:
!pip install transformers torch sentencepiece -q


In [5]:
import warnings
warnings.filterwarnings('ignore')
print("✓ Warnings suppressed")




In [6]:
from transformers import pipeline
print("✓ Transformers library imported successfully")


✓ Transformers library imported successfully


In [7]:
import torch
device = 0 if torch.cuda.is_available() else -1
device_name = "GPU" if device == 0 else "CPU"
print(f"✓ Using device: {device_name}")


✓ Using device: CPU


In [8]:
print("Loading DistilBART model... (this may take 30-60 seconds)")
summarizer = pipeline(
    'summarization',
    model='sshleifer/distilbart-cnn-12-6',
    device=device
)
print("✓ Model loaded successfully!")

Loading DistilBART model... (this may take 30-60 seconds)


config.json: 0.00B [00:00, ?B/s]

pytorch_model.bin:   0%|          | 0.00/1.22G [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/1.22G [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/26.0 [00:00<?, ?B/s]

vocab.json: 0.00B [00:00, ?B/s]

merges.txt: 0.00B [00:00, ?B/s]

Device set to use cpu


✓ Model loaded successfully!


In [9]:
test_text = """
Artificial intelligence is transforming the world. Machine learning algorithms
can now recognize images, translate languages, and even drive cars. Deep learning
has enabled breakthroughs in natural language processing.
"""

print("Testing with short sample...")
result = summarizer(test_text, max_length=50, min_length=10)
print(f"\nOriginal ({len(test_text.split())} words):\n{test_text}")
print(f"\nSummary:\n{result[0]['summary_text']}")

Your max_length is set to 50, but your input_length is only 44. Since this is a summarization task, where outputs shorter than the input are typically wanted, you might consider decreasing max_length manually, e.g. summarizer('...', max_length=22)


Testing with short sample...

Original (28 words):

Artificial intelligence is transforming the world. Machine learning algorithms 
can now recognize images, translate languages, and even drive cars. Deep learning 
has enabled breakthroughs in natural language processing.


Summary:
 Machine learning algorithms can now recognize images, translate languages, and even drive cars . Deep learning has enabled breakthroughs in natural language processing .


In [10]:
article_tech = """
OpenAI has announced a major breakthrough in artificial intelligence with the
release of their latest language model. The new system demonstrates unprecedented
capabilities in understanding context, generating human-like text, and solving
complex reasoning tasks. Researchers at the company spent over two years developing
the architecture, which uses advanced transformer networks and massive computational
resources. The model was trained on a diverse dataset including books, articles,
and websites, allowing it to generate coherent responses across a wide range of topics.

Industry experts are calling this a watershed moment for AI development. The system
can now engage in multi-turn conversations, remember context from earlier in the
dialogue, and even admit when it doesn't know something. However, the technology
also raises important ethical questions about the future of work, the spread of
misinformation, and the need for robust safety measures.

Several tech giants have already expressed interest in licensing the technology
for their own products. Microsoft, a major investor in OpenAI, plans to integrate
the model into its suite of productivity tools. Meanwhile, competitors like Google
and Meta are racing to develop their own comparable systems. The AI arms race is
heating up, with billions of dollars being invested in research and development.

Critics warn that the rapid pace of AI advancement could outstrip society's ability
to adapt. They call for comprehensive regulations, transparency in AI development,
and public discourse about the implications of these powerful technologies. Some
experts predict that AI will fundamentally reshape education, healthcare, and
creative industries within the next decade.
"""

print(f"Tech article loaded: {len(article_tech.split())} words")

Tech article loaded: 251 words


In [11]:
print("Summarizing tech article with default settings...")
summary_default = summarizer(article_tech, max_length=130, min_length=30)
print(f"\n--- DEFAULT SUMMARY ---")
print(summary_default[0]['summary_text'])
print(f"\nSummary length: {len(summary_default[0]['summary_text'].split())} words")


Summarizing tech article with default settings...

--- DEFAULT SUMMARY ---
 New language model developed by OpenAI uses transformer networks and massive computational resources . System can now engage in multi-turn conversations, remember context from earlier in the conversation, and even admit when it doesn't know something . Microsoft, a major investor in OpenAI, plans to integrate the model into its suite of productivity tools .

Summary length: 55 words


In [12]:
print("Creating an ULTRA-SHORT summary...")
summary_short = summarizer(article_tech, max_length=50, min_length=20)
print(f"\n--- ULTRA-SHORT SUMMARY ---")
print(summary_short[0]['summary_text'])
print(f"\nSummary length: {len(summary_short[0]['summary_text'].split())} words")


Creating an ULTRA-SHORT summary...

--- ULTRA-SHORT SUMMARY ---
 New language model developed by OpenAI uses transformer networks and massive computational resources . System can now engage in multi-turn conversations, remember context from earlier in the conversation, and even admit when it doesn't know something . Technology raises important ethical

Summary length: 41 words


In [13]:
print("Creating a DETAILED summary...")
summary_long = summarizer(article_tech, max_length=200, min_length=100)
print(f"\n--- DETAILED SUMMARY ---")
print(summary_long[0]['summary_text'])
print(f"\nSummary length: {len(summary_long[0]['summary_text'].split())} words")


Creating a DETAILED summary...

--- DETAILED SUMMARY ---
 New language model developed by OpenAI uses transformer networks and massive computational resources . System can now engage in multi-turn conversations, remember context from earlier in the conversation, and even admit when it doesn't know something . Technology raises important ethical questions about the future of work, the spread of misinformation, and the need for robust safety measures . Microsoft, a major investor in OpenAI, plans to integrate the model into its suite of productivity tools . Meanwhile, competitors like Google and Meta are racing to develop their own comparable systems .

Summary length: 92 words


In [14]:
article_science = """
Scientists at CERN have made a groundbreaking discovery that could revolutionize
our understanding of particle physics. Using the Large Hadron Collider, researchers
detected unusual quantum behavior in subatomic particles that defies current
theoretical models. The experiment involved colliding protons at nearly the speed
of light and analyzing the resulting particle decay patterns.

The data suggests the existence of a previously unknown force that operates at
extremely small scales. This fifth fundamental force could help explain dark matter,
the mysterious substance that makes up 85% of the universe's mass. Lead physicist
Dr. Elena Martinez stated that the findings have been replicated multiple times
and peer-reviewed by independent teams around the world.

If confirmed, this discovery would be the most significant in physics since the
detection of the Higgs boson in 2012. It could lead to new technologies we can
barely imagine today, from quantum computers to revolutionary energy sources.
However, more experiments are needed to rule out alternative explanations and
validate the extraordinary claims.

The scientific community is cautiously optimistic. While some researchers celebrate
the potential paradigm shift, others urge patience and rigorous testing. The next
phase of research will involve building more sensitive detectors and conducting
experiments at even higher energy levels. Funding agencies have already committed
hundreds of millions of dollars to support this work.
"""

print(f"Science article loaded: {len(article_science.split())} words")

Science article loaded: 216 words


In [15]:
print("Summarizing science article...")
summary_science = summarizer(article_science, max_length=130, min_length=30)
print(f"\n--- SCIENCE ARTICLE SUMMARY ---")
print(summary_science[0]['summary_text'])


Summarizing science article...

--- SCIENCE ARTICLE SUMMARY ---
 Researchers at CERN have made a groundbreaking discovery that could revolutionize understanding of particle physics . Using the Large Hadron Collider, researchers detected unusual quantum behavior in subatomic particles that defies current theoretical models . The data suggests the existence of a previously unknown force that operates at extremely small scales .


In [16]:
def compare_summaries(article, title):
    """Generate multiple summaries with different lengths"""
    print(f"\n{'='*60}")
    print(f"ARTICLE: {title}")
    print(f"{'='*60}")
    print(f"Original length: {len(article.split())} words\n")

    # Short
    short = summarizer(article, max_length=50, min_length=20)
    print(f"SHORT (max 50 words):\n{short[0]['summary_text']}\n")

    # Medium
    medium = summarizer(article, max_length=100, min_length=50)
    print(f"MEDIUM (max 100 words):\n{medium[0]['summary_text']}\n")

    # Long
    long = summarizer(article, max_length=150, min_length=80)
    print(f"LONG (max 150 words):\n{long[0]['summary_text']}\n")

In [17]:
compare_summaries(article_tech, "Tech News: AI Breakthrough")


ARTICLE: Tech News: AI Breakthrough
Original length: 251 words

SHORT (max 50 words):
 New language model developed by OpenAI uses transformer networks and massive computational resources . System can now engage in multi-turn conversations, remember context from earlier in the conversation, and even admit when it doesn't know something . Technology raises important ethical

MEDIUM (max 100 words):
 New language model developed by OpenAI uses transformer networks and massive computational resources . System can now engage in multi-turn conversations, remember context from earlier in the conversation, and even admit when it doesn't know something . Technology raises important ethical questions about the future of work, the spread of misinformation, and the need for robust safety measures .

LONG (max 150 words):
 New language model developed by OpenAI uses transformer networks and massive computational resources . System can now engage in multi-turn conversations, remember context from 

In [18]:
your_article = """
Paste your own news article here! It can be from any news website.
The longer the article, the better the summarization will work.
Aim for at least 200-300 words for best results.

This is a placeholder text that you should replace with real content.
Go to any news website (BBC, CNN, TechCrunch, etc.) and copy an article
you want to summarize.
"""

print("Ready to summarize your custom article!")
print(f"Current article length: {len(your_article.split())} words")


Ready to summarize your custom article!
Current article length: 61 words


In [19]:
print("\n🎯 Summarizing YOUR article...")
your_summary = summarizer(your_article, max_length=130, min_length=30)
print(f"\n--- YOUR ARTICLE SUMMARY ---")
print(your_summary[0]['summary_text'])

Your max_length is set to 130, but your input_length is only 89. Since this is a summarization task, where outputs shorter than the input are typically wanted, you might consider decreasing max_length manually, e.g. summarizer('...', max_length=44)



🎯 Summarizing YOUR article...

--- YOUR ARTICLE SUMMARY ---
 The longer the article, the better the summarization will work . Aim for at least 200-300 words for best results . This is a placeholder text that you should replace with real content .


In [20]:
print("📚 Batch processing multiple articles...\n")

articles_batch = [
    "Climate change continues to impact global weather patterns. Scientists warn of increasing temperatures.",
    "The stock market reached new highs today as investors remain optimistic about economic recovery.",
    "A new study shows that exercise can significantly improve mental health and reduce anxiety symptoms."
]

for i, article in enumerate(articles_batch, 1):
    summary = summarizer(article, max_length=30, min_length=10)
    print(f"Article {i}: {summary[0]['summary_text']}")
    print()


Your max_length is set to 30, but your input_length is only 17. Since this is a summarization task, where outputs shorter than the input are typically wanted, you might consider decreasing max_length manually, e.g. summarizer('...', max_length=8)


📚 Batch processing multiple articles...



Your max_length is set to 30, but your input_length is only 17. Since this is a summarization task, where outputs shorter than the input are typically wanted, you might consider decreasing max_length manually, e.g. summarizer('...', max_length=8)


Article 1:  Climate change continues to impact global weather patterns . Scientists warn of increasing temperatures .



Your max_length is set to 30, but your input_length is only 18. Since this is a summarization task, where outputs shorter than the input are typically wanted, you might consider decreasing max_length manually, e.g. summarizer('...', max_length=9)


Article 2:  The stock market reached new highs today as investors remain optimistic about economic recovery .

Article 3:  A new study shows that exercise can significantly improve mental health and reduce anxiety symptoms . Exercise can significantly reduce anxiety and mental health, according to



In [21]:
import time

print("⏱️ Testing summarization speed...\n")

test_articles = [article_tech, article_science]

for i, article in enumerate(test_articles, 1):
    start_time = time.time()
    summary = summarizer(article, max_length=130, min_length=30)
    end_time = time.time()

    print(f"Article {i}:")
    print(f"  - Words: {len(article.split())}")
    print(f"  - Time: {end_time - start_time:.2f} seconds")
    print(f"  - Summary: {summary[0]['summary_text'][:100]}...")
    print()

print("✓ All cells completed! Your TL;DR News Summarizer is ready to use.")

⏱️ Testing summarization speed...

Article 1:
  - Words: 251
  - Time: 9.41 seconds
  - Summary:  New language model developed by OpenAI uses transformer networks and massive computational resource...

Article 2:
  - Words: 216
  - Time: 8.24 seconds
  - Summary:  Researchers at CERN have made a groundbreaking discovery that could revolutionize understanding of ...

✓ All cells completed! Your TL;DR News Summarizer is ready to use.
