# Comparison of NER systems

In [15]:
text = """The Hawaiian Islands became the fiftieth US state in 1959.
          Since the passage of the Social Security Indexing Act of 1972, the level of Social Security benefits increases each year along with the Consumer Price Index. 
          The leading case, perhaps the most studied case, in all the common law is Hadley v. Baxendale, decided in England in 1854. 
          Lyndon Baines Johnson (August 27, 1908 – January 22, 1973), often referred to as LBJ, was an American politician who served as the 36th president of the United States from 1963 to 1969.
    """ 
        

# Stanford NLP

An online demo of Stanford NER is available [at this link](http://nlp.stanford.edu:8080/ner/)

In [16]:
from stanfordnlp.server import CoreNLPClient

import os
os.environ['CORENLP_HOME'] = r'/Users/mazidi/stanford-corenlp-full-2018-10-05'

In [17]:
# set up the client
with CoreNLPClient(annotators=['tokenize','ssplit','pos','lemma','ner'], timeout=60000, memory='16G') as client:
    # submit the request to the server
    ann = client.annotate(text)
    
    print('\nTokens \t POS \t NER')
    sentence_count = 1
    for sentence in ann.sentence:
        print('\nSentence', sentence_count)
        for token in sentence.token:
            if token.ner != 'O':
                print (token.word, '\t', token.pos, '\t', token.ner)
            
        sentence_count += 1

Starting server with command: java -Xmx16G -cp /Users/mazidi/stanford-corenlp-full-2018-10-05/* edu.stanford.nlp.pipeline.StanfordCoreNLPServer -port 9000 -timeout 60000 -threads 5 -maxCharLength 100000 -quiet True -serverProperties corenlp_server-e2f00d45e81a4cd4.props -preload tokenize,ssplit,pos,lemma,ner

Tokens 	 POS 	 NER

Sentence 1
Hawaiian 	 JJ 	 LOCATION
Islands 	 NNPS 	 LOCATION
fiftieth 	 NN 	 ORDINAL
US 	 NNP 	 COUNTRY
1959 	 CD 	 DATE

Sentence 2
Social 	 NNP 	 MISC
Security 	 NNP 	 MISC
Indexing 	 NN 	 MISC
Act 	 NN 	 MISC
1972 	 CD 	 DATE
each 	 DT 	 SET
year 	 NN 	 SET
Consumer 	 NNP 	 MISC
Price 	 NNP 	 MISC
Index 	 NNP 	 MISC

Sentence 3
Hadley 	 NNP 	 PERSON
Baxendale 	 NNP 	 PERSON
England 	 NNP 	 COUNTRY
1854 	 CD 	 DATE

Sentence 4
Lyndon 	 NNP 	 PERSON
Baines 	 NNP 	 PERSON
Johnson 	 NNP 	 PERSON
August 	 NNP 	 DATE
27 	 CD 	 DATE
, 	 , 	 DATE
1908 	 CD 	 DATE
January 	 NNP 	 DATE
22 	 CD 	 DATE
, 	 , 	 DATE
1973 	 CD 	 DATE
LBJ 	 NNP 	 ORGANIZATION
American 	 J

# SpaCy

In [18]:
import spacy

nlp = spacy.load('en_core_web_md')

In [19]:
doc = nlp(text)

for ent in doc.ents:
    print(ent.text, ent.label_)

Hawaiian Islands GPE
fiftieth ORDINAL
US GPE
1959 DATE
the Social Security Indexing Act LAW
1972 DATE
Social Security ORG
each year DATE
Hadley v. Baxendale PERSON
England GPE
1854 DATE
Lyndon Baines Johnson PERSON
August 27, 1908 – January 22, 1973 DATE
LBJ PERSON
American NORP
36th ORDINAL
the United States GPE
1963 to 1969 DATE
