# StopWords

In [1]:
paragraph = """
Dr. APJ Abdul Kalam’s life story reads like a masterclass in turning adversity into opportunity. Born in 1931 in the pilgrim town of Rameswaram, his early life was marked by financial hardship, but also by the values of harmony and dedication that would shape his future.

As a young boy, Kalam would wake up early to study mathematics, and then distribute newspapers to help his family. His father, a boat owner and imam, and his mother, a housewife, instilled in him the values of honesty, hard work, and spiritual wealth over material success.

His scientific career began at the Aeronautical Development Establishment of DRDO. However, it was at ISRO that he truly found his calling. Under the mentorship of Vikram Sarabhai, Kalam led the development of India’s first satellite launch vehicle, SLV-III, which successfully deployed the Rohini satellite in 1980.

But perhaps his most significant contribution came through the Integrated Guided Missile Development Programme. Under his leadership, India developed strategic missiles like Agni and Prithvi, establishing itself as a military power. Yet, Kalam’s vision went beyond defense – he consistently advocated for India’s self-reliance in critical technologies.
"""

In [2]:
print(paragraph)


Dr. APJ Abdul Kalam’s life story reads like a masterclass in turning adversity into opportunity. Born in 1931 in the pilgrim town of Rameswaram, his early life was marked by financial hardship, but also by the values of harmony and dedication that would shape his future.

As a young boy, Kalam would wake up early to study mathematics, and then distribute newspapers to help his family. His father, a boat owner and imam, and his mother, a housewife, instilled in him the values of honesty, hard work, and spiritual wealth over material success.

His scientific career began at the Aeronautical Development Establishment of DRDO. However, it was at ISRO that he truly found his calling. Under the mentorship of Vikram Sarabhai, Kalam led the development of India’s first satellite launch vehicle, SLV-III, which successfully deployed the Rohini satellite in 1980.

But perhaps his most significant contribution came through the Integrated Guided Missile Development Programme. Under his leadership,

In [3]:
from nltk.stem import PorterStemmer
from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize

In [4]:
import nltk
nltk.download('stopwords')

[nltk_data] Downloading package stopwords to /root/nltk_data...
[nltk_data]   Unzipping corpora/stopwords.zip.


True

In [5]:
stopwords.words('english')[:10]

['a', 'about', 'above', 'after', 'again', 'against', 'ain', 'all', 'am', 'an']

In [6]:
stopwords.words('french')[:10]

['au', 'aux', 'avec', 'ce', 'ces', 'dans', 'de', 'des', 'du', 'elle']

In [7]:
stopwords.words('arabic')[:10]

['إذ', 'إذا', 'إذما', 'إذن', 'أف', 'أقل', 'أكثر', 'ألا', 'إلا', 'التي']

In [8]:
stemmer = PorterStemmer()

In [9]:
import nltk
nltk.download('punkt_tab')

[nltk_data] Downloading package punkt_tab to /root/nltk_data...
[nltk_data]   Unzipping tokenizers/punkt_tab.zip.


True

In [10]:
documents = nltk.sent_tokenize(paragraph)

In [12]:
type(documents)

list

In [11]:
for sentence in documents:
    print(f"{sentence}\n")


Dr. APJ Abdul Kalam’s life story reads like a masterclass in turning adversity into opportunity.

Born in 1931 in the pilgrim town of Rameswaram, his early life was marked by financial hardship, but also by the values of harmony and dedication that would shape his future.

As a young boy, Kalam would wake up early to study mathematics, and then distribute newspapers to help his family.

His father, a boat owner and imam, and his mother, a housewife, instilled in him the values of honesty, hard work, and spiritual wealth over material success.

His scientific career began at the Aeronautical Development Establishment of DRDO.

However, it was at ISRO that he truly found his calling.

Under the mentorship of Vikram Sarabhai, Kalam led the development of India’s first satellite launch vehicle, SLV-III, which successfully deployed the Rohini satellite in 1980.

But perhaps his most significant contribution came through the Integrated Guided Missile Development Programme.

Under his leader

## Apply stopwords and filter and then apply PorterStemmer

In [36]:
for i in range(len(documents)):
  Words = nltk.word_tokenize(documents[i])
  newWords = [stemmer.stem(word) for word in Words if word not in set(stopwords.words('english'))]
  documents[i] = ' '.join(newWords) #Converting all the list of  words into sentences

len(newWords) , len(Words)

(17, 17)

In [37]:
for sentence in documents:
    print(f"{sentence}\n")

dr. apj abdul kalam ’ life stori read like masterclass turn adver opportun .

bear 1931 pilgrim town rameswaram , ear life mark financ hardship , also valu harmoni dedic would shape futur .

young boy , kalam would wake ear studi mathemat , distribut newspap help famili .

hi father , boat owner imam , mother , housewif , instil valu honesti , hard work , spiritu wealth materi success .

hi scientif career begin aeronaut develop establish drdo .

howev , isro truli find call .

mentorship vikram sarabhai , kalam lead develop india ’ first satellit launch vehicl , slv-iii , success deploy rohini satellit 1980 .

perhap signif contribut come integr guid missil develop programm .

leadership , india develop strateg missil like agni prithvi , establish militari power .

yet , kalam ’ vision go beyond defen – consist advoc india ’ self-r critic technolog .



## Apply stopwords and filter and then apply snowballstemmer

In [22]:
from nltk.stem import SnowballStemmer
snowballstemmer = SnowballStemmer('english')

In [38]:
for i in range(len(documents)):
  Words = nltk.word_tokenize(documents[i])
  newWords = [snowballstemmer.stem(word) for word in Words if word not in set(stopwords.words('english'))]
  documents[i] = ' '.join(newWords) #Converting all the list of  words into sentences

len(newWords) , len(Words)

(17, 17)

In [39]:
documents

['dr. apj abdul kalam ’ life stori read like masterclass turn adver opportun .',
 'bear 1931 pilgrim town rameswaram , ear life mark financ hardship , also valu harmoni dedic would shape futur .',
 'young boy , kalam would wake ear studi mathemat , distribut newspap help famili .',
 'hi father , boat owner imam , mother , housewif , instil valu honesti , hard work , spiritu wealth materi success .',
 'hi scientif career begin aeronaut develop establish drdo .',
 'howev , isro truli find call .',
 'mentorship vikram sarabhai , kalam lead develop india ’ first satellit launch vehicl , slv-iii , success deploy rohini satellit 1980 .',
 'perhap signif contribut come integr guid missil develop programm .',
 'leadership , india develop strateg missil like agni prithvi , establish militari power .',
 'yet , kalam ’ vision go beyond defen – consist advoc india ’ self-r critic technolog .']

## Apply stopwords and filter and then apply Lemmatization

In [25]:
from nltk.stem import WordNetLemmatizer

In [29]:
import nltk

nltk.download('wordnet')

[nltk_data] Downloading package wordnet to /root/nltk_data...


True

In [26]:
lemmatizer = WordNetLemmatizer()

In [47]:
for i in range(len(documents)):
  Words = nltk.word_tokenize(documents[i])
  newWords = [lemmatizer.lemmatize(word.lower(),pos='v') for word in Words if word not in set(stopwords.words('english'))]
  documents[i] = ' '.join(newWords) #Converting all the list of  words into sentences


In [49]:
documents

['dr. apj abdul kalam ’ life stori read like masterclass turn adver opportun .',
 'bear 1931 pilgrim town rameswaram , ear life mark financ hardship , also valu harmoni dedic would shape futur .',
 'young boy , kalam would wake ear studi mathemat , distribut newspap help famili .',
 'hi father , boat owner imam , mother , housewif , instil valu honesti , hard work , spiritu wealth materi success .',
 'hi scientif career begin aeronaut develop establish drdo .',
 'howev , isro truli find call .',
 'mentorship vikram sarabhai , kalam lead develop india ’ first satellit launch vehicl , slv-iii , success deploy rohini satellit 1980 .',
 'perhap signif contribut come integr guid missil develop programm .',
 'leadership , india develop strateg missil like agni prithvi , establish militari power .',
 'yet , kalam ’ vision go beyond defen – consist advoc india ’ self-r critic technolog .']

In [35]:
for sentence in documents:
    print(f"{sentence}\n")

dr. apj abdul kalam ’ life stori read like masterclass turn adver opportun .

bear 1931 pilgrim town rameswaram , ear life mark financ hardship , also valu harmoni dedic would shape futur .

young boy , kalam would wake ear studi mathemat , distribut newspap help famili .

hi father , boat owner imam , mother , housewif , instil valu honesti , hard work , spiritu wealth materi success .

hi scientif career begin aeronaut develop establish drdo .

howev , isro truli find call .

mentorship vikram sarabhai , kalam lead develop india ’ first satellit launch vehicl , slv-iii , success deploy rohini satellit 1980 .

perhap signif contribut come integr guid missil develop programm .

leadership , india develop strateg missil like agni prithvi , establish militari power .

yet , kalam ’ vision go beyond defen – consist advoc india ’ self-r critic technolog .

