# NATURAL LANGUAGE PROCESS

NLP enables computers and digital devices to recognize, understand and generate text and speech by combining computational linguistics—the rule-based modeling of human language—together with statistical modeling, machine learning and deep learning. [source](https://www.ibm.com/topics/natural-language-processing)

## TOKENIZATION

Tokenization is a term that describes breaking a document or body of text into small units called tokens.[source](https://www.coursera.org/articles/tokenization-nlp#:~:text=Tokenization%20is%20a%20term%20that,machine%20to%20process%20the%20text.)

In [None]:
!pip install nltk
import nltk



In [None]:
nltk.download('punkt')

[nltk_data] Downloading package punkt to /root/nltk_data...
[nltk_data]   Package punkt is already up-to-date!


True

In [None]:
corpus=""" Hello, Welcome to Dhruv's Tokenization NLP Tutorials.
Please do watch the whole code to understand NLP.
"""

In [None]:
print(corpus)

 Hello, Welcome to Dhruv's Tokenization NLP Tutorials.
Please do watch the whole code to understand NLP.



In [None]:
##Tokenization
##Sentence --> Paragraph
from nltk.tokenize import sent_tokenize

In [None]:
documents = sent_tokenize(corpus)

In [None]:
##Tokenization
##Paragraph -->words
##sentece --> words
from nltk.tokenize import word_tokenize

In [None]:
word_tokenize(corpus)

['Hello',
 ',',
 'Welcome',
 'to',
 'Dhruv',
 "'s",
 'Tokenization',
 'NLP',
 'Tutorials',
 '.',
 'Please',
 'do',
 'watch',
 'the',
 'whole',
 'code',
 'to',
 'understand',
 'NLP',
 '.']

In [None]:
for sentence in documents:
  print(word_tokenize(sentence))

['Hello', ',', 'Welcome', 'to', 'Dhruv', "'s", 'Tokenization', 'NLP', 'Tutorials', '.']
['Please', 'do', 'watch', 'the', 'whole', 'code', 'to', 'understand', 'NLP', '.']


# **STEMMING**

Stemming is the process of producing morphological variants of a root/base word. Stemming programs are commonly referred to as stemming algorithms or stemmers. [source](https://www.geeksforgeeks.org/python-stemming-words-with-nltk/)

In [None]:
words = ["eating", "eat", "eaten", "writing", "writes", "programming", "programs", "history", "finally", "finalized"]

## PorterStemmer

In [None]:
from nltk.stem import PorterStemmer

In [None]:
stemming = PorterStemmer()

In [None]:
for word in words:
  print(word+"--->"+stemming.stem(word))

eating--->eat
eat--->eat
eaten--->eaten
writing--->write
writes--->write
programming--->program
programs--->program
history--->histori
finally--->final
finalized--->final


In [None]:
stemming.stem("congratulations")

'congratul'

## Regexp Stemmer

In [None]:
from nltk.stem import RegexpStemmer

In [None]:
reg_stemmer = RegexpStemmer('ing$|s$|e$|able$', min=4)

In [None]:
reg_stemmer.stem('eating')

'eat'

In [None]:
reg_stemmer.stem('ingeating')

'ingeat'

## Snowball Stemmer

In [None]:
from nltk.stem import SnowballStemmer

In [None]:
snowballsstemmer = SnowballStemmer('english')

In [None]:
for word in words:
  print(word+"--->"+snowballsstemmer.stem(word))

eating--->eat
eat--->eat
eaten--->eaten
writing--->write
writes--->write
programming--->program
programs--->program
history--->histori
finally--->final
finalized--->final


In [None]:
stemming.stem('fairly'),stemming.stem('sportingly')

('fairli', 'sportingli')

In [None]:
snowballsstemmer.stem('fairly'),snowballsstemmer.stem('sportingly')

('fair', 'sport')

# **Lemmatizer**

Lemmatization is a fundamental text pre-processing technique widely applied in natural language processing (NLP) and machine learning. Serving a purpose akin to stemming, lemmatization seeks to distill words to their foundational forms. [source](https://www.geeksforgeeks.org/python-lemmatization-with-nltk/?ref=ml_lbp)

## Wordnet Lemmatizer

In [None]:
#Q&A,chatbots, text summarization
nltk.download('wordnet')

[nltk_data] Downloading package wordnet to /root/nltk_data...
[nltk_data]   Package wordnet is already up-to-date!


True

In [None]:
from nltk import WordNetLemmatizer

In [None]:
lemmatizer = WordNetLemmatizer()

In [None]:
'''
POS - NOUN-n
POS - VERB-v
POS - ADJECTIVE-a
POS - ADVERB-r
'''

lemmatizer.lemmatize("going", pos='v')

'go'

In [None]:
words = ["eating", "eat", "eaten", "writing", "writes", "programming", "programs", "history", "finally", "finalized"]

In [None]:
for word in words:
  print(word+"--->"+lemmatizer.lemmatize(word,pos='n'))

eating--->eating
eat--->eat
eaten--->eaten
writing--->writing
writes--->writes
programming--->programming
programs--->program
history--->history
finally--->finally
finalized--->finalized


In [None]:
for word in words:
  print(word+"--->"+lemmatizer.lemmatize(word,pos='v'))

eating--->eat
eat--->eat
eaten--->eat
writing--->write
writes--->write
programming--->program
programs--->program
history--->history
finally--->finally
finalized--->finalize


In [None]:
lemmatizer.lemmatize('goes')

'go'

In [None]:
lemmatizer.lemmatize("fairly", pos='v'),lemmatizer.lemmatize("sportingly", pos='v')

('fairly', 'sportingly')

# StopWords with NLTK

In [None]:
paragraph ="""As the late afternoon sun dipped lower in the sky, casting a golden hue over the landscape,
              the small village nestled at the foot of the mountains came alive with a quiet, comforting
              energy. Children played in the open fields, their laughter carrying on the wind, while
              farmers returned from the fields, their faces weathered but content after a long day’s
              work. The scent of freshly baked bread wafted from the homes, mingling with the earthy smell
              of the soil, damp from the morning’s light rain. Trees, towering and ancient, stood like silent
              sentinels watching over the village, their leaves shimmering with the fading light. The sound of
              the nearby river, flowing steadily over smooth rocks, added a gentle rhythm to the evening’s symphony.
              In the distance, the mountains loomed, their peaks kissed by the last rays of the sun, casting long
              shadows over the valley below. The air was cool and crisp, promising a quiet night, with only the
              occasional flicker of candlelight visible through the small windows of the cottages. It was a moment
              suspended in time, where nature and humanity seemed to exist in perfect harmony, as though the world
              had briefly forgotten its rush and chaos, offering instead a fleeting glimpse of serenity."""

In [None]:
from nltk.stem import PorterStemmer

In [None]:
from nltk.corpus import stopwords

In [None]:
import nltk
nltk.download('stopwords')

[nltk_data] Downloading package stopwords to /root/nltk_data...
[nltk_data]   Package stopwords is already up-to-date!


True

In [None]:
stopwords.words('english')

['i',
 'me',
 'my',
 'myself',
 'we',
 'our',
 'ours',
 'ourselves',
 'you',
 "you're",
 "you've",
 "you'll",
 "you'd",
 'your',
 'yours',
 'yourself',
 'yourselves',
 'he',
 'him',
 'his',
 'himself',
 'she',
 "she's",
 'her',
 'hers',
 'herself',
 'it',
 "it's",
 'its',
 'itself',
 'they',
 'them',
 'their',
 'theirs',
 'themselves',
 'what',
 'which',
 'who',
 'whom',
 'this',
 'that',
 "that'll",
 'these',
 'those',
 'am',
 'is',
 'are',
 'was',
 'were',
 'be',
 'been',
 'being',
 'have',
 'has',
 'had',
 'having',
 'do',
 'does',
 'did',
 'doing',
 'a',
 'an',
 'the',
 'and',
 'but',
 'if',
 'or',
 'because',
 'as',
 'until',
 'while',
 'of',
 'at',
 'by',
 'for',
 'with',
 'about',
 'against',
 'between',
 'into',
 'through',
 'during',
 'before',
 'after',
 'above',
 'below',
 'to',
 'from',
 'up',
 'down',
 'in',
 'out',
 'on',
 'off',
 'over',
 'under',
 'again',
 'further',
 'then',
 'once',
 'here',
 'there',
 'when',
 'where',
 'why',
 'how',
 'all',
 'any',
 'both',
 'each

In [None]:
from nltk import PorterStemmer

In [None]:
stemmer = PorterStemmer()

In [None]:
sentence = nltk.sent_tokenize(paragraph)

In [None]:
type(sentence)

list

In [None]:
##Apply stopwords and filter and than apply stemming
for i in range(len(sentence)):
  words = nltk.word_tokenize(sentence[i])
  words=[stemmer.stem(word) for word in words if word not in set(stopwords.words('english'))]
  sentence[i] = ' '.join(words) #converting all the list of the words into senteces


In [None]:
sentence

['as late afternoon sun dip lower sky , cast golden hue landscap , small villag nestl foot mountain came aliv quiet , comfort energi .',
 'children play open field , laughter carri wind , farmer return field , face weather content long day ’ work .',
 'the scent freshli bake bread waft home , mingl earthi smell soil , damp morn ’ light rain .',
 'tree , tower ancient , stood like silent sentinel watch villag , leav shimmer fade light .',
 'the sound nearbi river , flow steadili smooth rock , ad gentl rhythm even ’ symphoni .',
 'in distanc , mountain loom , peak kiss last ray sun , cast long shadow valley .',
 'the air cool crisp , promis quiet night , occasion flicker candlelight visibl small window cottag .',
 'it moment suspend time , natur human seem exist perfect harmoni , though world briefli forgotten rush chao , offer instead fleet glimps seren .']

In [None]:
from nltk.stem import SnowballStemmer
snowballstemmer = SnowballStemmer('english')

In [None]:
##Apply stopwords and filter and than apply **SNOWBALL** stemming
for i in range(len(sentence)):
  words = nltk.word_tokenize(sentence[i])
  words=[snowballstemmer.stem(word) for word in words if word not in set(stopwords.words('english'))]
  sentence[i] = ' '.join(words) #converting all the list of the words into senteces


In [None]:
sentence

['late afternoon sun dip lower sky , cast golden hue landscap , small villag nestl foot mountain came aliv quiet , comfort energi .',
 'children play open field , laughter carri wind , farmer return field , face weather content long day ’ work .',
 'scent fresh bake bread waft home , mingl earthi smell soil , damp morn ’ light rain .',
 'tree , tower ancient , stood like silent sentinel watch villag , leav shimmer fade light .',
 'sound nearbi river , flow steadili smooth rock , ad gentl rhythm even ’ symphoni .',
 'distanc , mountain loom , peak kiss last ray sun , cast long shadow valley .',
 'air cool crisp , promi quiet night , occas flicker candlelight visibl small window cottag .',
 'moment suspend time , natur human seem exist perfect harmoni , though world briefli forgotten rush chao , offer instead fleet glimp seren .']

In [None]:
from nltk import WordNetLemmatizer
lemmatizer = WordNetLemmatizer()

In [None]:
##Apply stopwords and filter and than apply **LEMMETIZER**
for i in range(len(sentence)):
  words = nltk.word_tokenize(sentence[i])
  words=[lemmatizer.lemmatize(word, pos='v') for word in words if word not in set(stopwords.words('english'))]
  sentence[i] = ' '.join(words) #converting all the list of the words into senteces


In [None]:
sentence

['late afternoon sun dip lower sky , cast golden hue landscap , small villag nestl foot mountain come aliv quiet , comfort energi .',
 'children play open field , laughter carri wind , farmer return field , face weather content long day ’ work .',
 'scent fresh bake bread waft home , mingl earthi smell soil , damp morn ’ light rain .',
 'tree , tower ancient , stand like silent sentinel watch villag , leav shimmer fade light .',
 'sound nearbi river , flow steadili smooth rock , ad gentl rhythm even ’ symphoni .',
 'distanc , mountain loom , peak kiss last ray sun , cast long shadow valley .',
 'air cool crisp , promi quiet night , occas flicker candlelight visibl small window cottag .',
 'moment suspend time , natur human seem exist perfect harmoni , though world briefli forget rush chao , offer instead fleet glimp seren .']

# Part of Speech Tags

Parts of Speech tagging is a linguistic activity in Natural Language Processing (NLP) wherein each word in a document is given a particular part of speech (adverb, adjective, verb, etc.) or grammatical category. [source](https://www.geeksforgeeks.org/nlp-part-of-speech-default-tagging/)

In [None]:
paragraph ="""As the late afternoon sun dipped lower in the sky, casting a golden hue over the landscape,
              the small village nestled at the foot of the mountains came alive with a quiet, comforting
              energy. Children played in the open fields, their laughter carrying on the wind, while
              farmers returned from the fields, their faces weathered but content after a long day’s
              work. The scent of freshly baked bread wafted from the homes, mingling with the earthy smell
              of the soil, damp from the morning’s light rain. Trees, towering and ancient, stood like silent
              sentinels watching over the village, their leaves shimmering with the fading light. The sound of
              the nearby river, flowing steadily over smooth rocks, added a gentle rhythm to the evening’s symphony.
              In the distance, the mountains loomed, their peaks kissed by the last rays of the sun, casting long
              shadows over the valley below. The air was cool and crisp, promising a quiet night, with only the
              occasional flicker of candlelight visible through the small windows of the cottages. It was a moment
              suspended in time, where nature and humanity seemed to exist in perfect harmony, as though the world
              had briefly forgotten its rush and chaos, offering instead a fleeting glimpse of serenity."""

In [None]:
import nltk
sentence = nltk.sent_tokenize(paragraph)
from nltk.corpus import stopwords

In [None]:
sentence

['As the late afternoon sun dipped lower in the sky, casting a golden hue over the landscape,\n              the small village nestled at the foot of the mountains came alive with a quiet, comforting\n              energy.',
 'Children played in the open fields, their laughter carrying on the wind, while\n              farmers returned from the fields, their faces weathered but content after a long day’s\n              work.',
 'The scent of freshly baked bread wafted from the homes, mingling with the earthy smell\n              of the soil, damp from the morning’s light rain.',
 'Trees, towering and ancient, stood like silent\n              sentinels watching over the village, their leaves shimmering with the fading light.',
 'The sound of\n              the nearby river, flowing steadily over smooth rocks, added a gentle rhythm to the evening’s symphony.',
 'In the distance, the mountains loomed, their peaks kissed by the last rays of the sun, casting long\n              shadows over

In [None]:
##We will find out postTAg

In [None]:
nltk.download('punkt')
nltk.download('stopwords')
nltk.download('averaged_perceptron_tagger')

[nltk_data] Downloading package punkt to /root/nltk_data...
[nltk_data]   Package punkt is already up-to-date!
[nltk_data] Downloading package stopwords to /root/nltk_data...
[nltk_data]   Package stopwords is already up-to-date!
[nltk_data] Downloading package averaged_perceptron_tagger to
[nltk_data]     /root/nltk_data...
[nltk_data]   Package averaged_perceptron_tagger is already up-to-
[nltk_data]       date!


True

In [None]:
for i in range(len(sentence)):
  words = nltk.word_tokenize(sentence[i])
  words = [word for word in words if word not in set(stopwords.words('english'))]
  sentence[i] = ''.join(words)
  pos_tag = nltk.pos_tag(words)
  print(pos_tag)



[('As', 'IN'), ('late', 'JJ'), ('afternoon', 'NN'), ('sun', 'NN'), ('dipped', 'VBD'), ('lower', 'JJR'), ('sky', 'NN'), (',', ','), ('casting', 'VBG'), ('golden', 'JJ'), ('hue', 'NN'), ('landscape', 'NN'), (',', ','), ('small', 'JJ'), ('village', 'NN'), ('nestled', 'JJ'), ('foot', 'NN'), ('mountains', 'NNS'), ('came', 'VBD'), ('alive', 'JJ'), ('quiet', 'JJ'), (',', ','), ('comforting', 'VBG'), ('energy', 'NN'), ('.', '.')]
[('Children', 'NNP'), ('played', 'VBD'), ('open', 'JJ'), ('fields', 'NNS'), (',', ','), ('laughter', 'NN'), ('carrying', 'VBG'), ('wind', 'NN'), (',', ','), ('farmers', 'NNS'), ('returned', 'VBD'), ('fields', 'NNS'), (',', ','), ('faces', 'VBZ'), ('weathered', 'JJ'), ('content', 'NN'), ('long', 'RB'), ('day', 'NN'), ('’', 'NNP'), ('work', 'NN'), ('.', '.')]
[('The', 'DT'), ('scent', 'NN'), ('freshly', 'RB'), ('baked', 'VBN'), ('bread', 'NN'), ('wafted', 'VBN'), ('homes', 'NNS'), (',', ','), ('mingling', 'VBG'), ('earthy', 'JJ'), ('smell', 'NN'), ('soil', 'NN'), (',', 

In [None]:
sentence

['Aslateafternoonsundippedlowersky,castinggoldenhuelandscape,smallvillagenestledfootmountainscamealivequiet,comfortingenergy.',
 'Childrenplayedopenfields,laughtercarryingwind,farmersreturnedfields,facesweatheredcontentlongday’work.',
 'Thescentfreshlybakedbreadwaftedhomes,minglingearthysmellsoil,dampmorning’lightrain.',
 'Trees,toweringancient,stoodlikesilentsentinelswatchingvillage,leavesshimmeringfadinglight.',
 'Thesoundnearbyriver,flowingsteadilysmoothrocks,addedgentlerhythmevening’symphony.',
 'Indistance,mountainsloomed,peakskissedlastrayssun,castinglongshadowsvalley.',
 'Theaircoolcrisp,promisingquietnight,occasionalflickercandlelightvisiblesmallwindowscottages.',
 'Itmomentsuspendedtime,naturehumanityseemedexistperfectharmony,thoughworldbrieflyforgottenrushchaos,offeringinsteadfleetingglimpseserenity.']

In [None]:
for i in "Taj Mahal is beautiful monumnet".split():
  print(nltk.pos_tag([i]))

[('Taj', 'NN')]
[('Mahal', 'NN')]
[('is', 'VBZ')]
[('beautiful', 'NN')]
[('monumnet', 'NN')]


In [None]:
print(nltk.pos_tag("Taj Mahal is a beautiful Monument".split()))

[('Taj', 'NNP'), ('Mahal', 'NNP'), ('is', 'VBZ'), ('a', 'DT'), ('beautiful', 'JJ'), ('Monument', 'NN')]


# Named Entity Recognition

Named Entity Recognition (NER) is a technique in natural language processing (NLP) that focuses on identifying and classifying entities. [source](https://www.geeksforgeeks.org/named-entity-recognition/)

In [None]:
sentence = "The Eiffel Tower was built from 1887 to 1889 by French EngineerGustave Eiffel, whose company speciallized in building metalframework and structures."

In [None]:
import nltk
words = nltk.word_tokenize(sentence)#tokenize hua re

In [None]:
tag_element = nltk.pos_tag(words)#pos tagging hua re

In [None]:
nltk.download('maxent_ne_chunker')

[nltk_data] Downloading package maxent_ne_chunker to
[nltk_data]     /root/nltk_data...
[nltk_data]   Package maxent_ne_chunker is already up-to-date!


True

In [None]:
nltk.download('words')

[nltk_data] Downloading package words to /root/nltk_data...
[nltk_data]   Package words is already up-to-date!


True

In [None]:
print(nltk.ne_chunk(tag_element))


(S
  The/DT
  (ORGANIZATION Eiffel/NNP Tower/NNP)
  was/VBD
  built/VBN
  from/IN
  1887/CD
  to/TO
  1889/CD
  by/IN
  (GPE French/JJ)
  (ORGANIZATION EngineerGustave/NNP Eiffel/NNP)
  ,/,
  whose/WP$
  company/NN
  speciallized/VBN
  in/IN
  building/NN
  metalframework/NN
  and/CC
  structures/NNS
  ./.)


# Word2Vec Practical Implementation
Word2vec is a technique in natural language processing (NLP) for obtaining vector representations of words. These vectors capture information about the meaning of the word based on the surrounding words. [source](https://en.wikipedia.org/wiki/Word2vec#:~:text=Word2vec%20is%20a%20technique%20in,text%20in%20a%20large%20corpus.)

In [None]:
!pip install gensim



In [None]:
import gensim

In [17]:
from gensim.models import Word2Vec, KeyedVectors

In [None]:
import gensim.downloader as api
wv = api.load('word2vec-google-news-300')
vec_king = wv['king']



In [10]:
vec_king

array([ 1.25976562e-01,  2.97851562e-02,  8.60595703e-03,  1.39648438e-01,
       -2.56347656e-02, -3.61328125e-02,  1.11816406e-01, -1.98242188e-01,
        5.12695312e-02,  3.63281250e-01, -2.42187500e-01, -3.02734375e-01,
       -1.77734375e-01, -2.49023438e-02, -1.67968750e-01, -1.69921875e-01,
        3.46679688e-02,  5.21850586e-03,  4.63867188e-02,  1.28906250e-01,
        1.36718750e-01,  1.12792969e-01,  5.95703125e-02,  1.36718750e-01,
        1.01074219e-01, -1.76757812e-01, -2.51953125e-01,  5.98144531e-02,
        3.41796875e-01, -3.11279297e-02,  1.04492188e-01,  6.17675781e-02,
        1.24511719e-01,  4.00390625e-01, -3.22265625e-01,  8.39843750e-02,
        3.90625000e-02,  5.85937500e-03,  7.03125000e-02,  1.72851562e-01,
        1.38671875e-01, -2.31445312e-01,  2.83203125e-01,  1.42578125e-01,
        3.41796875e-01, -2.39257812e-02, -1.09863281e-01,  3.32031250e-02,
       -5.46875000e-02,  1.53198242e-02, -1.62109375e-01,  1.58203125e-01,
       -2.59765625e-01,  

In [14]:
vec_king.shape

(300,)

In [18]:
wv['cricket']

array([-3.67187500e-01, -1.21582031e-01,  2.85156250e-01,  8.15429688e-02,
        3.19824219e-02, -3.19824219e-02,  1.34765625e-01, -2.73437500e-01,
        9.46044922e-03, -1.07421875e-01,  2.48046875e-01, -6.05468750e-01,
        5.02929688e-02,  2.98828125e-01,  9.57031250e-02,  1.39648438e-01,
       -5.41992188e-02,  2.91015625e-01,  2.85156250e-01,  1.51367188e-01,
       -2.89062500e-01, -3.46679688e-02,  1.81884766e-02, -3.92578125e-01,
        2.46093750e-01,  2.51953125e-01, -9.86328125e-02,  3.22265625e-01,
        4.49218750e-01, -1.36718750e-01, -2.34375000e-01,  4.12597656e-02,
       -2.15820312e-01,  1.69921875e-01,  2.56347656e-02,  1.50146484e-02,
       -3.75976562e-02,  6.95800781e-03,  4.00390625e-01,  2.09960938e-01,
        1.17675781e-01, -4.19921875e-02,  2.34375000e-01,  2.03125000e-01,
       -1.86523438e-01, -2.46093750e-01,  3.12500000e-01, -2.59765625e-01,
       -1.06933594e-01,  1.04003906e-01, -1.79687500e-01,  5.71289062e-02,
       -7.41577148e-03, -

In [20]:
wv.most_similar('cricket')

[('cricketing', 0.8372225761413574),
 ('cricketers', 0.8165745735168457),
 ('Test_cricket', 0.8094819188117981),
 ('Twenty##_cricket', 0.8068488240242004),
 ('Twenty##', 0.7624265551567078),
 ('Cricket', 0.75413978099823),
 ('cricketer', 0.7372578382492065),
 ('twenty##', 0.7316356897354126),
 ('T##_cricket', 0.7304614186286926),
 ('West_Indies_cricket', 0.6987985968589783)]

In [23]:
wv.most_similar('cutie')

[('hottie', 0.7584797739982605),
 ('babe', 0.693034291267395),
 ('cutie_pie', 0.6432326436042786),
 ('gal', 0.627162516117096),
 ('bestie', 0.6264120936393738),
 ('beau', 0.616774320602417),
 ('Heroes_hottie', 0.6128145456314087),
 ('blondie', 0.605380654335022),
 ('JoBro', 0.6051076054573059),
 ('heartthrob', 0.598899245262146)]

In [26]:
wv.similarity("football", "sports")

0.5927178

In [27]:
vec=wv['king']-wv['man']+wv['queen']
vec

array([-0.19494629, -0.2446289 , -0.09539795,  0.3461914 ,  0.01635742,
       -0.08374023,  0.23876953, -0.42193604, -0.28955078,  0.41699982,
       -0.3466797 , -0.43359375,  0.01025391,  0.1256485 ,  0.06689453,
       -0.09790039,  0.20507812, -0.10098267,  0.08251953,  0.32373047,
        0.4580078 ,  0.24821472, -0.20288086,  0.28955078,  0.10253906,
       -0.03857422, -0.48242188, -0.1081543 ,  0.15527344, -0.15270996,
       -0.21386719, -0.20581055,  0.10595703,  0.63391113, -0.5810547 ,
       -0.3017578 , -0.11279297,  0.39648438, -0.09619141,  0.03613281,
        0.21411133, -0.6088867 ,  0.3408203 ,  0.12298584,  0.29492188,
       -0.00195312, -0.03781128,  0.16821289,  0.15332031,  0.24481201,
       -0.04626465,  0.13183594, -0.2626953 ,  0.17150879, -0.40771484,
        0.02772522, -0.41064453, -0.1862793 ,  0.07617188, -0.10406494,
        0.14550781,  0.41992188,  0.00299072,  0.2758789 ,  0.18359375,
       -0.39794922, -0.07757568,  0.12207031, -0.43115234,  0.42

In [29]:
wv.most_similar([vec])

[('queen', 0.8422122001647949),
 ('king', 0.8070623278617859),
 ('queens', 0.6293326020240784),
 ('monarch', 0.6233264207839966),
 ('kings', 0.599048376083374),
 ('princess', 0.5685814619064331),
 ('royal', 0.55137038230896),
 ('princes', 0.5398682951927185),
 ('NYC_anglophiles_aflutter', 0.5370354652404785),
 ('Queen_Consort', 0.5246095061302185)]