turCy

An Open Information Extraction System mainly designed for German.

Installation

pip install turcy

python -m spacy download de_core_news_lg-3.0.0 --direct

Can be applied to other languages as well, however some extrawork is necessary as no patterns for english are shipped. Therefore, you would have to build your own patterns first. For building patterns, a `pattern_builder module is available.

How it works

1. Building a Pattern

2. Extraction

Load the German Language Model from spaCy.
Add turCy to the nlp-Pipeline.
Pass the document to the pipeline.
Iterate over the sentences in the document and access the triples in each sentence.

def example():
    nlp = spacy.load("de_core_news_lg", exclude=["ner"])
    nlp.max_length = 2096700
    turcy.add_to_pipe(nlp)  # apply/use current patterns in list
    pipeline_params = {"attach_triple2sentence": {"pattern_list": "small"}}
    doc = nlp("Nürnberg ist eine Stadt in Deutschland.", component_cfg=pipeline_params)
    for sent in doc.sents:
        print(sent)
        for triple in sent._.triples:
            (subj, pred, obj) = triple["triple"]
            print(f"subject:'{subj}', predicate:'{pred}' and object: '{obj}'")

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
turcy		turcy
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.py		example.py
img.png		img.png
img_1.png		img_1.png
img_2.png		img_2.png
img_3.png		img_3.png
img_4.png		img_4.png
img_5.png		img_5.png
img_6.png		img_6.png
requirements.txt		requirements.txt
satz1_dp_tree.png		satz1_dp_tree.png
satz1_text.png		satz1_text.png
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

turCy

Installation

How it works

1. Building a Pattern

2. Extraction

3. Results

References

About

Releases

Packages

Languages

License

ChrisDelClea/turCy

Folders and files

Latest commit

History

Repository files navigation

turCy

Installation

How it works

1. Building a Pattern

2. Extraction

3. Results

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages