# In corpus linguistics, part-of-speech tagging (POS tagging or PoS tagging or POST), also called grammatical tagging or word-category disambiguation, is the process of marking up a word in a text (corpus) as corresponding to a particular part of speech, based on both its definition and its context—i.e., its relationship with adjacent and related words in a phrase, sentence, or paragraph. A simplified form of this is commonly taught to school-age children, in the identification of words as nouns, verbs, adjectives, adverbs, etc.

In [1]:
import nltk
import sys
import sklearn

In [2]:
paragraph = """
What are the qualities of your voice?
The Throat Center. {Chakra of the Month}
Mine is usually calm, soothing and clear. Yoga students often tell me my voice is relaxing, which is a sweet compliment. However, when I’m caught up in a moment of anger, fear or stress, my voice can get sharp and sound anything but calm and relaxing.

Let’s bring more mindfulness to our voices and how we use them this month. Our December chakra focus is our communication center, the throat center.
Just joining in? Read about the first, second, third and fourth chakras.

Chakra author Anothea Judith writes:
Vaguely Relephant Reads:

You Don’t have to be an Alcoholic to be Wasted. Read 

How to Get Over the One you Can’t Get Over. Read 

Chakra five is the center related to communication through sound, vibration, self-expression and creativity.
It is the realm of consciousness that controls, creates, transmits and receives communication, both within ourselves and between each other.
It is the center of dynamic creativity, of synthesizing old ideas into something new.
The fifth chakra color is bright, cerulean or turquoise blue.
Some symptoms of an out-of-balance fifth chakra include suffering from throat-centered illnesses, like strep throat or losing our voice.
Throaty ailments may mean we need to slow down, rest and communicate more clearly, both with ourselves and others, or they may be a result of excess communication or over-sharing—and a sign that we should give our voice a break and intentionally soak in silence.
If our lives are busy, we need to relax and remember how to set boundaries, express our needs and allow time and space to nurture ourselves.
When our throat chakra is healthy and balanced, we express ourselves and communicate clearly, honestly and openly—but not excessively.
Here are some other ways to help balance and align our throat chakra:
Chant
Any time we speak, sing, scream, whisper or make any other sort of sound with our voice, we invoke throat energy. The element of vishuddha chakra is sound, so belt out some long OMs, or whatever other chants you like.
Asana
Warm up by practicing gentle neck stretches in all directions. Many heart openers/back bends are also throat openers. The ultimate is probably fish pose (matsyasana).
Experiment with Music
Try practicing yoga or meditation to healing sounds like Tibetan singing bowls, African drums, ocean waves, rock music—and sometimes in sweet silence.
Mantra Meditation
Perfect for sitting, standing or walking meditation, repeating a silent mantra is a form of meditation that helps us quickly focus and calm the mind. It can be a syllable sound like “Om” or a word or phrase in Sanskrit, English or whatever language you prefer. A nice one is “sat nam,” which means “I am the truth,” in Sanskrit, but there are millions to choose from.
Mindfully Drink Tea
Brew up a mugful of your favorite herbal tea and have your own little ceremony as you meditate and enjoy a delicious, soothing hot drink.
Express Yourself
Speak up! Share something about yourself with someone. Write a poem. Sing. Express your unique voice in whatever way you wish.
On that note, if you’d like to participate in a two-week writing challenge this month, check out the reverb13 community writing project happening here at elephant."""

In [3]:
words = nltk.word_tokenize(paragraph)

In [4]:
words

['What',
 'are',
 'the',
 'qualities',
 'of',
 'your',
 'voice',
 '?',
 'The',
 'Throat',
 'Center',
 '.',
 '{',
 'Chakra',
 'of',
 'the',
 'Month',
 '}',
 'Mine',
 'is',
 'usually',
 'calm',
 ',',
 'soothing',
 'and',
 'clear',
 '.',
 'Yoga',
 'students',
 'often',
 'tell',
 'me',
 'my',
 'voice',
 'is',
 'relaxing',
 ',',
 'which',
 'is',
 'a',
 'sweet',
 'compliment',
 '.',
 'However',
 ',',
 'when',
 'I',
 '’',
 'm',
 'caught',
 'up',
 'in',
 'a',
 'moment',
 'of',
 'anger',
 ',',
 'fear',
 'or',
 'stress',
 ',',
 'my',
 'voice',
 'can',
 'get',
 'sharp',
 'and',
 'sound',
 'anything',
 'but',
 'calm',
 'and',
 'relaxing',
 '.',
 'Let',
 '’',
 's',
 'bring',
 'more',
 'mindfulness',
 'to',
 'our',
 'voices',
 'and',
 'how',
 'we',
 'use',
 'them',
 'this',
 'month',
 '.',
 'Our',
 'December',
 'chakra',
 'focus',
 'is',
 'our',
 'communication',
 'center',
 ',',
 'the',
 'throat',
 'center',
 '.',
 'Just',
 'joining',
 'in',
 '?',
 'Read',
 'about',
 'the',
 'first',
 ',',
 'second

In [5]:
# We will create a variable called Tagged Words along with their parts of speech

In [10]:
tagged_words = nltk.pos_tag(words)
tagged_words

[('What', 'WP'),
 ('are', 'VBP'),
 ('the', 'DT'),
 ('qualities', 'NNS'),
 ('of', 'IN'),
 ('your', 'PRP$'),
 ('voice', 'NN'),
 ('?', '.'),
 ('The', 'DT'),
 ('Throat', 'NNP'),
 ('Center', 'NNP'),
 ('.', '.'),
 ('{', '('),
 ('Chakra', 'NNP'),
 ('of', 'IN'),
 ('the', 'DT'),
 ('Month', 'NNP'),
 ('}', ')'),
 ('Mine', 'NNP'),
 ('is', 'VBZ'),
 ('usually', 'RB'),
 ('calm', 'JJ'),
 (',', ','),
 ('soothing', 'JJ'),
 ('and', 'CC'),
 ('clear', 'JJ'),
 ('.', '.'),
 ('Yoga', 'JJ'),
 ('students', 'NNS'),
 ('often', 'RB'),
 ('tell', 'VBP'),
 ('me', 'PRP'),
 ('my', 'PRP$'),
 ('voice', 'NN'),
 ('is', 'VBZ'),
 ('relaxing', 'VBG'),
 (',', ','),
 ('which', 'WDT'),
 ('is', 'VBZ'),
 ('a', 'DT'),
 ('sweet', 'JJ'),
 ('compliment', 'NN'),
 ('.', '.'),
 ('However', 'RB'),
 (',', ','),
 ('when', 'WRB'),
 ('I', 'PRP'),
 ('’', 'VBP'),
 ('m', 'RB'),
 ('caught', 'VBN'),
 ('up', 'RP'),
 ('in', 'IN'),
 ('a', 'DT'),
 ('moment', 'NN'),
 ('of', 'IN'),
 ('anger', 'NN'),
 (',', ','),
 ('fear', 'NN'),
 ('or', 'CC'),
 ('stress

In [7]:
type(tagged_words)

list

In [9]:
print("Length of words",len(words))
print("Length of Tagged words is ",len(tagged_words))

Length of words 644
Length of Tagged words is  644


In [13]:
# We need to append the words with their respective parts of speech
words_tags = []
for tw in tagged_words:
    words_tags.append(tw[0]+"_"+ tw[1])

In [14]:
words_tags

['What_WP',
 'are_VBP',
 'the_DT',
 'qualities_NNS',
 'of_IN',
 'your_PRP$',
 'voice_NN',
 '?_.',
 'The_DT',
 'Throat_NNP',
 'Center_NNP',
 '._.',
 '{_(',
 'Chakra_NNP',
 'of_IN',
 'the_DT',
 'Month_NNP',
 '}_)',
 'Mine_NNP',
 'is_VBZ',
 'usually_RB',
 'calm_JJ',
 ',_,',
 'soothing_JJ',
 'and_CC',
 'clear_JJ',
 '._.',
 'Yoga_JJ',
 'students_NNS',
 'often_RB',
 'tell_VBP',
 'me_PRP',
 'my_PRP$',
 'voice_NN',
 'is_VBZ',
 'relaxing_VBG',
 ',_,',
 'which_WDT',
 'is_VBZ',
 'a_DT',
 'sweet_JJ',
 'compliment_NN',
 '._.',
 'However_RB',
 ',_,',
 'when_WRB',
 'I_PRP',
 '’_VBP',
 'm_RB',
 'caught_VBN',
 'up_RP',
 'in_IN',
 'a_DT',
 'moment_NN',
 'of_IN',
 'anger_NN',
 ',_,',
 'fear_NN',
 'or_CC',
 'stress_NN',
 ',_,',
 'my_PRP$',
 'voice_NN',
 'can_MD',
 'get_VB',
 'sharp_JJ',
 'and_CC',
 'sound_JJ',
 'anything_NN',
 'but_CC',
 'calm_NN',
 'and_CC',
 'relaxing_NN',
 '._.',
 'Let_VB',
 '’_NNP',
 's_VB',
 'bring_VBG',
 'more_JJR',
 'mindfulness_NN',
 'to_TO',
 'our_PRP$',
 'voices_NNS',
 'and_CC',

In [16]:
tagged_paragraph = ' '.join(words_tags)
tagged_paragraph

'What_WP are_VBP the_DT qualities_NNS of_IN your_PRP$ voice_NN ?_. The_DT Throat_NNP Center_NNP ._. {_( Chakra_NNP of_IN the_DT Month_NNP }_) Mine_NNP is_VBZ usually_RB calm_JJ ,_, soothing_JJ and_CC clear_JJ ._. Yoga_JJ students_NNS often_RB tell_VBP me_PRP my_PRP$ voice_NN is_VBZ relaxing_VBG ,_, which_WDT is_VBZ a_DT sweet_JJ compliment_NN ._. However_RB ,_, when_WRB I_PRP ’_VBP m_RB caught_VBN up_RP in_IN a_DT moment_NN of_IN anger_NN ,_, fear_NN or_CC stress_NN ,_, my_PRP$ voice_NN can_MD get_VB sharp_JJ and_CC sound_JJ anything_NN but_CC calm_NN and_CC relaxing_NN ._. Let_VB ’_NNP s_VB bring_VBG more_JJR mindfulness_NN to_TO our_PRP$ voices_NNS and_CC how_WRB we_PRP use_VBP them_PRP this_DT month_NN ._. Our_PRP$ December_NNP chakra_NN focus_NN is_VBZ our_PRP$ communication_NN center_NN ,_, the_DT throat_NN center_NN ._. Just_NNP joining_VBG in_IN ?_. Read_VB about_IN the_DT first_JJ ,_, second_JJ ,_, third_JJ and_CC fourth_JJ chakras_NNS ._. Chakra_NNP author_NN Anothea_NNP Judit