PromptNER based Chain-of-Thought prompting for span tasks #180

kabirkhan · 2023-06-20T16:45:45Z

Description

Implements Chain-of-Thought prompting for Span tasks

Based on https://arxiv.org/pdf/2305.15444.pdf

Example Usage:

import spacy

nlp = spacy.blank("en")
nlp.add_pipe(
    "llm",
    name="llm_ner",
    config={
        "task" : {
            "@llm_tasks": "spacy.NER.v3",
            "description": "Entities are the names of people and basketball teams. Names of sports are not entities. Names of soccer, baseball, rubgy, etc teams are not basketball team entities. adjectives are not entities. Pronouns are not persons. Man city is the name of a soccer team so not an entity in this case.",
            "examples": [
                {
                    "text": "The Golden State Warriors beat the LA Lakers by almost 20 points on the back of an amazing performance from Steph Curry.",
                    "entities": [
                         # In the final code these should be able to be provided in the prompt str format
                         # e.g. | Golden State Warriors | BASKETBALL_TEAM | True | is a basketball team in the NBA |
                        {
                            "text": "Golden State Warriors",
                            "label": "BASKETBALL_TEAM",
                            "is_entity": True,
                            "reason": "is a basketball team in the NBA"
                        },
                        {
                            "text": "LA Lakers",
                            "label": "BASKETBALL_TEAM",
                            "is_entity": True,
                            "reason": "is a basketball team in the NBA"
                        },
                        {
                            "text": "20",
                            "label": "==NONE==",
                            "is_entity": False,
                            "reason": "is a number"
                        },
                        {
                            "text": "amazing performance",
                            "label": "==NONE==",
                            "is_entity": False,
                            "reason": "is a description of an event"
                        },
                        {
                            "text": "Steph Curry",
                            "label": "PERSON",
                            "is_entity": True,
                            "reason": "is the name of a person"
                        },
                    ]
                }
            ],
            "labels": "PERSON,BASKETBALL_TEAM",
        },
        "model": {
            "@llm_models": "spacy.GPT-3-5.v1",
            "config": {"max_tokens": 200},
        },
        "save_io": True
    }
)

doc = nlp("The Denver Nuggets beat the Miami Heat 4 games to 1")
print([(ent.text, ent.label_) for ent in doc.ents])

# The names of these sports teams are specifically changed to view the model's reasoning behind labeling them or not.
doc2 = nlp("In other sports news Atletico Madrid was able to tie the Gonzaga bulldogs away from home with a stunning strike from Kevin De Bruyne.")
print([(ent.text, ent.label_) for ent in doc2.ents])

Types of change

Feature

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran all tests in tests and usage_examples/tests, and all new and existing tests passed. This includes
- all external tests (i. e. pytest ran with --external)
- all tests requiring a GPU
My changes don't require a change to the documentation, or if they do, I've added all required information.

…cot-ner

spacy_llm/tasks/util/span.py

…ab/cot-ner

…ab/cot-ner-integrate

rmitsch

We're getting there 🙂 The templates for the other span-based tasks need to be updated still.

spacy_llm/tasks/span.py

spacy_llm/tasks/util/examples.py

spacy_llm/tasks/templates/ner.v3.jinja

spacy_llm/tasks/util/examples.py

spacy_llm/tasks/ner.py

spacy_llm/tasks/util/examples.py

spacy_llm/ty.py

spacy_llm/tasks/templates/ner.v3.jinja

spacy_llm/tasks/ner.py

spacy_llm/tasks/span.py

spacy_llm/tasks/templates/ner.v3.jinja

spacy_llm/tests/tasks/test_ner.py

… parsing of SpanReason

* use a default example for zero-shot NER/spancat COT tasks if none are provided * add test for no examples spancat COT * Fix tests. * Update filterwarnings. * Update RELExample factory. * Comment validator. * Attempt to fix pydantic macOS error. * Attempt to fix pydantic macOS error. * Update filterwarnings. --------- Co-authored-by: Raphael Mitsch <r.mitsch@outlook.com>

rmitsch · 2023-08-24T10:09:29Z

I think this is ready for a final once-over and running the external tests in the CI.

svlandeg

There's still two unresolved conversations, can you have a look at those?

I'd also like us to run a few more tests on how v3 can be used as a substitute for v2, and which changes (if any) would be needed in e.g. the Dolly NER example.

…sSpan tasks.

svlandeg

This is looking really good, nice team effort all! 🙏

Had a few nits + one more important question about a phrasing in the NER template.

.github/workflows/test.yml

spacy_llm/tasks/ner/registry.py

spacy_llm/tasks/ner/task.py

spacy_llm/tasks/spancat/registry.py

spacy_llm/tasks/templates/ner.v3.jinja

spacy_llm/tasks/templates/spancat.v3.jinja

spacy_llm/tests/tasks/test_ner.py

…/cot-ner

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

…/cot-ner

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

# Conflicts: # usage_examples/tests/test_usage_examples.py

Kabir Khan added 3 commits June 15, 2023 15:11

initial POC for Chain of Thought NER task

a175d36

ruff fix

820337b

Merge branch 'main' of ssh://github.com/explosion/spacy-llm into kab/…

ea261aa

…cot-ner

svlandeg added the feat/new New feature label Jun 21, 2023

vinbo8 reviewed Jun 23, 2023

View reviewed changes

spacy_llm/tasks/util/span.py Outdated Show resolved Hide resolved

Kabir Khan and others added 3 commits July 4, 2023 13:26

Merge branch 'develop' of ssh://github.com/explosion/spacy-llm into k…

6fa0b6a

…ab/cot-ner

update template

4c562d9

consilidate approach to work with main SpanTask

54a9eae

kabirkhan changed the base branch from main to develop July 5, 2023 23:07

Kabir Khan and others added 7 commits July 5, 2023 16:38

Merge branch 'develop' of ssh://github.com/explosion/spacy-llm into k…

8737f4a

…ab/cot-ner-integrate

fix tests around label consistency checks

7a1fbdb

Merge branch 'main' into kab/cot-ner

2921ca9

merge kab/cot-ner-integrate

2a0b1c8

fix edge cases

4772cb2

merge develop

d95eaac

update label consistency checks

57a71b3

vinbo8 marked this pull request as ready for review July 6, 2023 11:09

vinbo8 added 2 commits July 6, 2023 14:26

move label consistency checks

6335b3b

handle labels in span.py

c565256

rmitsch reviewed Jul 10, 2023

View reviewed changes

vinbo8 added 3 commits July 17, 2023 11:49

cleanup older NER

54f28d2

fixes

aa25bba

cleanup

f52feb5

kabirkhan commented Jul 17, 2023

View reviewed changes

spacy_llm/tasks/util/examples.py Outdated Show resolved Hide resolved

kabirkhan commented Jul 17, 2023

View reviewed changes

spacy_llm/ty.py Outdated Show resolved Hide resolved

kabirkhan commented Jul 17, 2023

View reviewed changes

spacy_llm/tasks/templates/ner.v3.jinja Show resolved Hide resolved

rmitsch reviewed Jul 18, 2023

View reviewed changes

spacy_llm/tasks/ner.py Outdated Show resolved Hide resolved

spacy_llm/tasks/span.py Outdated Show resolved Hide resolved

spacy_llm/tasks/span.py Outdated Show resolved Hide resolved

spacy_llm/tasks/templates/ner.v3.jinja Show resolved Hide resolved

svlandeg reviewed Jul 18, 2023

View reviewed changes

spacy_llm/tests/tasks/test_ner.py Outdated Show resolved Hide resolved

kabirkhan changed the title ~~[wip] idea for Chain of Thought prompting for span tasks~~ PromptNER based Chain-of-Thought prompting for span tasks Jul 21, 2023

update NER template with label_definitions + initial description, fix…

19d55cc

… parsing of SpanReason

rmitsch and others added 6 commits August 21, 2023 14:32

Update filterwarnings.

80fc6c8

Renamed examples to prompt_examples.

74a8486

Test Pydantic Mac OS Py 3.8 issue.

fa4e879

Incorporate feedback. Readd Pydantic REL example workaround.

5903a03

Readd NER Dolly usage example, removed TextCat Dolly one.

69f5111

rmitsch mentioned this pull request Aug 24, 2023

Add usage example for TextCat with Dolly 2 #271

Merged

3 tasks

Update NER Dolly usage example to use NER.v3.

4706aa7

svlandeg reviewed Aug 24, 2023

View reviewed changes

rmitsch and others added 6 commits August 24, 2023 16:31

Readd NER Dolly test. Revert to NER.v2. Refactor span extraction for …

134da76

…sSpan tasks.

Fix span reason extraction.

7d27d0e

Add working Paris-Paris-Paris example.

2979ffc

Uncomment example for NER prediction test.

398652a

Fix NER prediction test.

b4ad4d8

remove errors class entirely

66c4fae

svlandeg reviewed Aug 24, 2023

View reviewed changes

rmitsch and others added 11 commits August 25, 2023 08:33

Merge branch 'kab/cot-ner' of github.com:explosion/spacy-llm into kab…

88a2482

…/cot-ner

Update .github/workflows/test.yml

2664a47

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

Update spacy_llm/tests/tasks/test_ner.py

ee7fcdd

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

Remove overlap part in NER template.

b64bf90

Merge branch 'kab/cot-ner' of github.com:explosion/spacy-llm into kab…

f895343

…/cot-ner

Remove overlap path in NER and SpanCat templates.

c763689

Update spacy_llm/tasks/spancat/registry.py

d2f2a07

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

Update spacy_llm/tasks/ner/registry.py

48bbd3c

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

Merge branch 'develop' into kab/cot-ner

68c0a55

# Conflicts: # usage_examples/tests/test_usage_examples.py

Changed SpanCat prompt intro.

1c269e0

Add docstring info for description.

3a31530

rmitsch merged commit 8e2fd0f into develop Aug 25, 2023
11 checks passed

svlandeg deleted the kab/cot-ner branch August 25, 2023 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PromptNER based Chain-of-Thought prompting for span tasks #180

PromptNER based Chain-of-Thought prompting for span tasks #180

kabirkhan commented Jun 20, 2023 •

edited

Loading

rmitsch left a comment

rmitsch commented Aug 24, 2023

svlandeg left a comment

svlandeg left a comment

PromptNER based Chain-of-Thought prompting for span tasks #180

PromptNER based Chain-of-Thought prompting for span tasks #180

Conversation

kabirkhan commented Jun 20, 2023 • edited Loading

Description

Example Usage:

Types of change

Checklist

rmitsch left a comment

Choose a reason for hiding this comment

rmitsch commented Aug 24, 2023

svlandeg left a comment

Choose a reason for hiding this comment

svlandeg left a comment

Choose a reason for hiding this comment

kabirkhan commented Jun 20, 2023 •

edited

Loading