GitHub

Weak Annotators (NER)[WIP]

Experiments with weak annotators for NER using different models and methods including LLMs. Requires some GPU memory :) (for 7B models ~ 8-12GB).

Installation

pip install weak-annotators

or from source:

pip install git+https://github.com/imvladikon/weak_annotators.git

Usage

Using UniversalNER extractor:

from weak_annotators import UniversalNerExtractor

text = """
The patient was prescribed 100 mg of aspirin daily for 3 days.
""".strip()
labels = ["DRUG", "DISEASE", "SYMPTOM", "DURATION"]
extractor = UniversalNerExtractor(labels=labels)
print(extractor(text))
# [Span(start=37, end=44, text='aspirin', label='DRUG'), Span(start=55, end=61, text='3 days', label='DURATION')]

It returns a list of Spans but if pass return_dict=True it will return a list of dictionaries:

print(extractor(text, return_dict=True))
# [{'start': 37, 'end': 44, 'text': 'aspirin', 'label': 'DRUG'}, {'start': 55, 'end': 61, 'text': '3 days', 'label': 'DURATION'}]

Using medalpaca LLM:

It requires labels descriptions:

from weak_annotators import MedAlpacaExtractor

labels = ["DRUG", "DISEASE", "SYMPTOM", "DURATION"]
labels_descriptions = {
    "DRUG": "Drug or medication",
    "DISEASE": "Any disease, syndrome, or medical condition",
    "SYMPTOM": "Any symptom or sign of a disease or medical condition",
    "DURATION": "Any period of time",
}
extractor = MedAlpacaExtractor(labels=labels, labels_description=labels_descriptions)

text = """
The patient was prescribed 100 mg of aspirin daily for 3 days.
""".strip()

annotations = extractor(text)
print(annotations)

Optionally, it's possible to pass prompt_template to MedAlpacaExtractor.

prompt_template = "Extract entities of type {} from the following text:"
extractor = MedAlpacaExtractor(labels=labels, labels_description=labels_descriptions, prompt_template=prompt_template)

Using flair (TARS extractor):

from weak_annotators import FlairExtractor

labels = ["DRUG", "DISEASE", "SYMPTOM", "DURATION"]
extractor = FlairExtractor(labels=labels)
print(extractor(text))

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
weak_annotators		weak_annotators
.gitignore		.gitignore
README.md		README.md
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weak_annotators

weak_annotators

.gitignore

.gitignore

README.md

README.md

requirements-dev.txt

requirements-dev.txt

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Weak Annotators (NER)[WIP]

Installation

Usage

About

Releases

Packages

Languages

imvladikon/weak_annotators

Folders and files

Latest commit

History

Repository files navigation

Weak Annotators (NER)[WIP]

Installation

Usage

About

Resources

Stars

Watchers

Forks

Languages