## sapCy: Industrial-Strength Natural Language Processing

spaCy is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.

Basically installation of spaCy is done with pip or pip3 \
`pip3 install spacy -U` \
More information can be found here: https://spacy.io/usage 

We also need to install a language model as a python package \
`python3 -m spacy download en_core_web_sm` \
More information can be found here: https://spacy.io/usage/models

### Example 1: This is spaCy

In this exampem we are using spaCy to tokenize a text, list noun phrases and verbs and we find named entities. \

In [1]:
import spacy

# Load English tokenizer, tagger, parser, NER and word vectors
nlp = spacy.load("en_core_web_sm")

# Process whole documents
text = ("Scorpions are a German rock band formed in 1965 in Hannover by Rudolf Schenker. Since the band's inception, its musical style has ranged from hard rock to heavy metal.")
doc = nlp(text)

# Analyze syntax
print("Noun phrases:", [chunk.text for chunk in doc.noun_chunks])
print("Verbs:", [token.lemma_ for token in doc if token.pos_ == "VERB"])

# Find named entities, phrases and concepts
for entity in doc.ents:
    print(entity.text, entity.label_)

Noun phrases: ['Scorpions', 'a German rock band', 'Hannover', 'Rudolf Schenker', "the band's inception", 'its musical style', 'hard rock', 'heavy metal']
Verbs: ['form', 'range']
German NORP
1965 DATE
Hannover GPE
Rudolf Schenker PERSON


### Example 2: Visualizing the entity recognizer

The next one is an exaple for spaCy visualizing named entities \
The entity visualizer, ent, highlights named entities and their labels in a text.\
Here the entity types supported by spaCy: \
https://spacy.io/api/annotation#named-entities

In [2]:
import spacy
from spacy import displacy

doc_ent = nlp("Scorpions are a German rock band formed in 1965 in Hannover by Rudolf Schenker. Since the band's inception, its musical style has ranged from hard rock to heavy metal."
              "The lineup from 1979 to 1992 was the most successful incarnation of the group, and included Klaus Meine (vocals), Rudolf Schenker (rhythm guitar), Matthias Jabs (lead guitar), Francis Buchholz (bass), and Herman Rarebell (drums)")

displacy.render(doc_ent, style="ent", jupyter=True)