# 5.1 Working with Text

First, we import `spacy`. If this is the first time you are using it you need to install it using the following code:

In [None]:
!pip install spacy

In [1]:
import spacy

You will also need to download a model. For this examples, we are using `en_core_web_lg`

In [None]:
!python -m spacy download en_core_web_lg

And this is how we load it into our `nlp` object.

In [2]:
nlp  = spacy.load("en_core_web_lg")

In [3]:
sentence1 = "My cat is extremely quiet."
sentence2 = "My cat is quiet, and I often mistake it for a pillow."

As in the spacy tutorial, we can process a string our text with the `nlp` object to create a `doc` object.

In [4]:
doc = nlp(sentence1)

We can use `len()` to see how many tokens there are in our text.

In [5]:
len(doc)

6

We can print out each of the tokens to better understand how tokenization works.

In [6]:
for token in doc:
    print(token)

My
cat
is
extremely
quiet
.


We can then linguistic attributes such as Part-of-speech tags with the `.pos_` attribute.

In [7]:
for token in nlp(sentence1):
    print(token.text, token.pos_)

My PRON
cat NOUN
is AUX
extremely ADV
quiet ADJ
. PUNCT


In [8]:
for token in nlp(sentence2):
    print(token.text, token.pos_)

My PRON
cat NOUN
is AUX
quiet ADJ
, PUNCT
and CCONJ
I PRON
often ADV
mistake VERB
it PRON
for ADP
a DET
pillow NOUN
. PUNCT
