In [1]:
#A part of speech of Neils bohr receiving the nobel prize
speech="""
Your Royal Highnesses, Ladies and Gentlemen.

In attempting to give expression to my deep and heartfelt gratitude for the great honour that the Royal Swedish Academy of Sciences has bestowed upon me by awarding me the Nobel Prize for Physics for this year, I am naturally forcibly reminded of Alfred Nobel’s insistence upon the international character of science, which indeed forms the very basis of his most munificent bequest.

That point of view – the international character of science – suggests itself all the more readily to myself, as the contributions that I may have had the good fortune to make to the development of physical science consist in a combination of the results arrived at by a number of fellow-investigators, belonging to a variety of nations, on the basis of study carried on under widely differing scientific traditions.

The grand discoveries which scientific experiment yielded at and about the turn of the century, in which investigators in many countries took an eminent part and which were destined all unexpectedly to give us a fresh insight into the structure of atoms, were due in the first instance, as all are aware, to the work of the great investigators of the English school, Sir Joseph Thomson and Sir Ernest Rutherford, who have inscribed their names on the tablets of the history of scientific research as distinguished witnesses to the truth that imagination and acumen are capable of penetrating the crowded mass of registered experience and of revealing Nature’s simplicity to our gaze.

On the other hand, abstract thinking, which throughout the ages has been one of the most powerful of man’s aids in lifting the veil that shrouds the laws of Nature from the eyes of the uninitiated observer, has proved of the utmost importance for enabling the insight into the structure of atoms so obtained to be applied practically in elucidating the properties of those elements that are immediately accessible to our perceptions.

To this branch of the work too, men of many nations have made important contributions; but it was the great German investigators, Planck and Einstein, who, as a result of their systematic abstract investigations, were to show us for the first time that the laws of the movements of atomic particles, which determine the properties of the elements, are of an essentially different character from those laws by the aid of which science has hitherto sought to bring order into the mass of our observations of natural phenomena.

"""

In [2]:
len(speech)

2496

In [3]:
import spacy
from spacy.lang.en.stop_words import STOP_WORDS
from string import punctuation

In [4]:
nlp = spacy.load('en_core_web_sm')

In [5]:
doc = nlp(speech)

In [6]:
tokens = [token.text.lower() for token in doc
          if not token.is_stop and
          not token.is_punct and
          token.text !='\n']

In [7]:
tokens

['royal',
 'highnesses',
 'ladies',
 'gentlemen',
 '\n\n',
 'attempting',
 'expression',
 'deep',
 'heartfelt',
 'gratitude',
 'great',
 'honour',
 'royal',
 'swedish',
 'academy',
 'sciences',
 'bestowed',
 'awarding',
 'nobel',
 'prize',
 'physics',
 'year',
 'naturally',
 'forcibly',
 'reminded',
 'alfred',
 'nobel',
 'insistence',
 'international',
 'character',
 'science',
 'forms',
 'basis',
 'munificent',
 'bequest',
 '\n\n',
 'point',
 'view',
 'international',
 'character',
 'science',
 'suggests',
 'readily',
 'contributions',
 'good',
 'fortune',
 'development',
 'physical',
 'science',
 'consist',
 'combination',
 'results',
 'arrived',
 'number',
 'fellow',
 'investigators',
 'belonging',
 'variety',
 'nations',
 'basis',
 'study',
 'carried',
 'widely',
 'differing',
 'scientific',
 'traditions',
 '\n\n',
 'grand',
 'discoveries',
 'scientific',
 'experiment',
 'yielded',
 'turn',
 'century',
 'investigators',
 'countries',
 'took',
 'eminent',
 'destined',
 'unexpectedly

In [8]:
tokens1=[]
stopwords = list(STOP_WORDS)
allowed_pos = ['ADJ','PROPN','VERB','NOUN']
for token in doc:
    if token.text in stopwords or token.text in punctuation:
        continue
    if token.pos_ in allowed_pos:
        tokens1.append(token.text)

In [9]:
tokens1

['Royal',
 'Highnesses',
 'Ladies',
 'Gentlemen',
 'attempting',
 'expression',
 'deep',
 'heartfelt',
 'gratitude',
 'great',
 'honour',
 'Royal',
 'Swedish',
 'Academy',
 'Sciences',
 'bestowed',
 'awarding',
 'Nobel',
 'Prize',
 'Physics',
 'year',
 'reminded',
 'Alfred',
 'Nobel',
 'insistence',
 'international',
 'character',
 'science',
 'forms',
 'basis',
 'munificent',
 'bequest',
 'point',
 'view',
 'international',
 'character',
 'science',
 'suggests',
 'contributions',
 'good',
 'fortune',
 'development',
 'physical',
 'science',
 'consist',
 'combination',
 'results',
 'arrived',
 'number',
 'fellow',
 'investigators',
 'belonging',
 'variety',
 'nations',
 'basis',
 'study',
 'carried',
 'differing',
 'scientific',
 'traditions',
 'grand',
 'discoveries',
 'scientific',
 'experiment',
 'yielded',
 'turn',
 'century',
 'investigators',
 'countries',
 'took',
 'eminent',
 'destined',
 'fresh',
 'insight',
 'structure',
 'atoms',
 'instance',
 'aware',
 'work',
 'great',
 'i

In [10]:
from collections import Counter

In [11]:
word_freq = Counter(tokens)

In [12]:
word_freq

Counter({'royal': 2,
         'highnesses': 1,
         'ladies': 1,
         'gentlemen': 1,
         '\n\n': 6,
         'attempting': 1,
         'expression': 1,
         'deep': 1,
         'heartfelt': 1,
         'gratitude': 1,
         'great': 3,
         'honour': 1,
         'swedish': 1,
         'academy': 1,
         'sciences': 1,
         'bestowed': 1,
         'awarding': 1,
         'nobel': 2,
         'prize': 1,
         'physics': 1,
         'year': 1,
         'naturally': 1,
         'forcibly': 1,
         'reminded': 1,
         'alfred': 1,
         'insistence': 1,
         'international': 2,
         'character': 3,
         'science': 4,
         'forms': 1,
         'basis': 2,
         'munificent': 1,
         'bequest': 1,
         'point': 1,
         'view': 1,
         'suggests': 1,
         'readily': 1,
         'contributions': 2,
         'good': 1,
         'fortune': 1,
         'development': 1,
         'physical': 1,
         'consist'

In [13]:
max_freq = max(word_freq.values())

In [14]:
max_freq

6

In [15]:
for word in word_freq.keys():
    word_freq[word] = word_freq[word]/max_freq

In [16]:
word_freq

Counter({'royal': 0.3333333333333333,
         'highnesses': 0.16666666666666666,
         'ladies': 0.16666666666666666,
         'gentlemen': 0.16666666666666666,
         '\n\n': 1.0,
         'attempting': 0.16666666666666666,
         'expression': 0.16666666666666666,
         'deep': 0.16666666666666666,
         'heartfelt': 0.16666666666666666,
         'gratitude': 0.16666666666666666,
         'great': 0.5,
         'honour': 0.16666666666666666,
         'swedish': 0.16666666666666666,
         'academy': 0.16666666666666666,
         'sciences': 0.16666666666666666,
         'bestowed': 0.16666666666666666,
         'awarding': 0.16666666666666666,
         'nobel': 0.3333333333333333,
         'prize': 0.16666666666666666,
         'physics': 0.16666666666666666,
         'year': 0.16666666666666666,
         'naturally': 0.16666666666666666,
         'forcibly': 0.16666666666666666,
         'reminded': 0.16666666666666666,
         'alfred': 0.16666666666666666,
       

In [17]:
sent_token = [sent.text for sent in doc.sents]
print(sent_token)

['\nYour Royal Highnesses, Ladies and Gentlemen.\n\n', 'In attempting to give expression to my deep and heartfelt gratitude for the great honour that the Royal Swedish Academy of Sciences has bestowed upon me by awarding me the Nobel Prize for Physics for this year, I am naturally forcibly reminded of Alfred Nobel’s insistence upon the international character of science, which indeed forms the very basis of his most munificent bequest.\n\n', 'That point of view – the international character of science – suggests itself all the more readily to myself, as the contributions that I may have had the good fortune to make to the development of physical science consist in a combination of the results arrived at by a number of fellow-investigators, belonging to a variety of nations, on the basis of study carried on under widely differing scientific traditions.\n\n', 'The grand discoveries which scientific experiment yielded at and about the turn of the century, in which investigators in many co

### Extractive Text Summarization

In [18]:
sent_score = {}
for sent in sent_token:
    for word in sent.split():
        if word.lower() in word_freq.keys():
            if sent not in sent_score.keys():
                sent_score[sent] = word_freq[word]
            else:
                sent_score[sent] +=word_freq[word]
        print(word)

Your
Royal
Highnesses,
Ladies
and
Gentlemen.
In
attempting
to
give
expression
to
my
deep
and
heartfelt
gratitude
for
the
great
honour
that
the
Royal
Swedish
Academy
of
Sciences
has
bestowed
upon
me
by
awarding
me
the
Nobel
Prize
for
Physics
for
this
year,
I
am
naturally
forcibly
reminded
of
Alfred
Nobel’s
insistence
upon
the
international
character
of
science,
which
indeed
forms
the
very
basis
of
his
most
munificent
bequest.
That
point
of
view
–
the
international
character
of
science
–
suggests
itself
all
the
more
readily
to
myself,
as
the
contributions
that
I
may
have
had
the
good
fortune
to
make
to
the
development
of
physical
science
consist
in
a
combination
of
the
results
arrived
at
by
a
number
of
fellow-investigators,
belonging
to
a
variety
of
nations,
on
the
basis
of
study
carried
on
under
widely
differing
scientific
traditions.
The
grand
discoveries
which
scientific
experiment
yielded
at
and
about
the
turn
of
the
century,
in
which
investigators
in
many
countries
took
an
eminent
p

In [19]:
import pandas as pd
from heapq import nlargest

In [20]:
pd.DataFrame(list(sent_score.items()),columns=['Sentence','Score'])

Unnamed: 0,Sentence,Score
0,"\nYour Royal Highnesses, Ladies and Gentlemen....",0.0
1,In attempting to give expression to my deep an...,4.0
2,That point of view – the international charact...,6.5
3,The grand discoveries which scientific experim...,8.833333
4,"On the other hand, abstract thinking, which th...",5.5
5,"To this branch of the work too, men of many na...",7.333333


In [21]:
num_sentences =3
n = nlargest(num_sentences,sent_score,key=sent_score.get)
" ".join(n)

'The grand discoveries which scientific experiment yielded at and about the turn of the century, in which investigators in many countries took an eminent part and which were destined all unexpectedly to give us a fresh insight into the structure of atoms, were due in the first instance, as all are aware, to the work of the great investigators of the English school, Sir Joseph Thomson and Sir Ernest Rutherford, who have inscribed their names on the tablets of the history of scientific research as distinguished witnesses to the truth that imagination and acumen are capable of penetrating the crowded mass of registered experience and of revealing Nature’s simplicity to our gaze.\n\n To this branch of the work too, men of many nations have made important contributions; but it was the great German investigators, Planck and Einstein, who, as a result of their systematic abstract investigations, were to show us for the first time that the laws of the movements of atomic particles, which deter

### Abstractive Text Summarization

In [22]:
from transformers import pipeline

In [23]:
summarizer=pipeline("summarization",model='t5-base',tokenizer='t5-base',framework='pt')

The secret `HF_TOKEN` does not exist in your Colab secrets.
To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.
You will be able to reuse this secret in all of your notebooks.
Please note that authentication is recommended but still optional to access public models or datasets.


config.json:   0%|          | 0.00/1.21k [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/892M [00:00<?, ?B/s]

generation_config.json:   0%|          | 0.00/147 [00:00<?, ?B/s]

spiece.model:   0%|          | 0.00/792k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/1.39M [00:00<?, ?B/s]

Device set to use cpu


In [27]:
summary = summarizer(speech,max_length=200,min_length=10,do_sample=False)

In [28]:
summary

[{'summary_text': 'the great investigators of the english school, Sir Joseph Thomson and Sir Ernest Rutherford, were destined all unexpectedly to give us a fresh insight into the structure of atoms . abstract thinking has been one of the most powerful of man’s aids in lifting the veil that shrouds the laws of Nature from the eyes of the uninitiated observer . it was the great german investigators, Planck and Einstein, who, as a result of their systematic abstract investigations, were to show'}]

In [29]:
print(summary[0]['summary_text'])

the great investigators of the english school, Sir Joseph Thomson and Sir Ernest Rutherford, were destined all unexpectedly to give us a fresh insight into the structure of atoms . abstract thinking has been one of the most powerful of man’s aids in lifting the veil that shrouds the laws of Nature from the eyes of the uninitiated observer . it was the great german investigators, Planck and Einstein, who, as a result of their systematic abstract investigations, were to show
