# First text analysis with Python


In this notebook, I will show you how to run a simple text analysis with spaCy package in Python.

In [2]:
import spacy

nlp = spacy.load("en_core_web_sm")

In [3]:
example_text = "Hi. This is my first awesome sentence to analyze."

In [4]:
doc = nlp(example_text)

## Print token

In the following, you can iterate through the parsed doc and print each token in the `doc` object.

In [9]:
for token in doc:
    print(token)

Hi
.
This
is
my
first
awesome
sentence
to
analyze
.


## Print lemmatized form

You can print lemmatized form by `token.lemma_` (do not forget `_` at the end.)

In [8]:
# Lemmatize
for token in doc:
    print(token.text, token.lemma_, sep="\t")

Hi	hi
.	.
This	this
is	be
my	my
first	first
awesome	awesome
sentence	sentence
to	to
analyze	analyze
.	.


## Print Part of Speech information

You can add more information, such as `pos_`

In [7]:
for token in doc:
    print(token.text, token.pos_, token.tag_, sep="\t")

Hi	INTJ	UH
.	PUNCT	.
This	PRON	DT
is	AUX	VBZ
my	PRON	PRP$
first	ADJ	JJ
awesome	ADJ	JJ
sentence	NOUN	NN
to	PART	TO
analyze	VERB	VB
.	PUNCT	.


## spaCy token information

Some useful token information are following:


| code | what it does | example |
|----|----|----|
| token.lemma_ | lemmatized form | be, child |
| token.pos_ | simple POS (Universal Dependency) | NOUN, VERB |
| token.tag_ | fine-grained POS (PennTag set) | NN, JJ, VB, BBZ |
| token.dep_ | dependency type | amod, advmd |
| token.head | token information of the head of the dependency | 