Skip to content

0.12.0

Compare
Choose a tag to compare
@ghisvail ghisvail released this 28 Nov 19:02
· 162 commits to main since this release

Changes

  • Document attributes are now supported (both for text and audio) and are added/accessed the same way as annotations attributes:

    doc.attrs.add(Attribute(label="type", value="report"))
    doc.attrs.get(label="type")

  • Brat Input and Output converters can now load and save UMLS CUIs stored in notes

  • the Trainer now saves both the last checkpoint and the best checkpoint, instead of only the last checkpoint

  • medkit is now compatible with the latest (0.9) EDS-NLP

  • most operations loading models from HuggingFace can now receive an authentication token (useful to access private repositories)

  • new from_dir()/from_file() helper methods added to TextDocument/AudioDocument

  • new text classification, audio diarization and audio transcription metrics

  • support for remapping entity labels in Seq2SeqEvaluator (useful when predicted and reference label do not match exactly)

  • custom attributes (DateAttribute, UMLSNormAttribute) don't have None as a value anymore

  • easier initialization of PASpeakerDetector

  • many bugfixes