Skip to content
This repository has been archived by the owner on Nov 21, 2023. It is now read-only.

Releases: TeamHeka/medkit

Release 0.9.2

25 Sep 10:09
Compare
Choose a tag to compare

Changes since 0.9.1

  • fix doc generation

Release 0.9.1

25 Sep 09:51
Compare
Choose a tag to compare

Release 0.9.1 is available !

Changelog from 0.8.1

[Added/Changed]

  • [text] attrs_to_copy for segmentation operations
  • [text] store section names in Attribute rather than metadata
  • Trainer logging changes
  • [text] change default params for RegexpMatcherRule
  • [text] add default rules for rule-based operations
  • [text][preprocessing] Change Normalizer name
  • Add e3c corpus support

[Fixed]

  • [text] fix syntagma tokenizer rules
  • [text] Harmonize kb_id field between EntityNormAttribute and other places
  • [text] fix duplicate finder
  • [umls-coder-normalizer] Don't crash on empty list of entities

[Config]

  • Drop python 3.7 for bumping to torch 2.0
  • [audio] remove pyannote-audio dependency from medkit package

[Docs]

  • Add doc for load_text_documents
  • [readme] add contact and link to CONTRIBUTING.md
  • update medkit logo

You will find the documentation at: https://medkit.readthedocs.io/en/0.9.1

Finally, to install medkit, the package is available on PyPI under the name of medkit-lib

We strongly recommend to create virtual environments (using venv or conda) before installing medkit.

# Install medkit with only required dependencies
pip install medkit-lib

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector, ...):

# Install medkit with all extra optional dependencies
pip install 'medkit-lib[optional]'

Release 0.8.1

09 Aug 12:27
Compare
Choose a tag to compare

Changes since 0.8.0

  • fix doc generation

Release 0.8.0

09 Aug 08:41
Compare
Choose a tag to compare

Release 0.8.0 is available !

Changelog from 0.7.1

[Added]

  • add simstring matcher operation
  • add mtsamples loader
  • add doccano converter
  • add edsnlp date matcher operation
  • add post-processing operation to filter overlapping entities

[Fixed]

  • change rule-based operation strategy for special characters
  • [regexpmatcher] Change label to category (like semgroup)
  • [medkit-json] don't force ascii chars only when exporting
  • fix "Too much padding in HFEntityMatcherTrainable"

[Docs]

  • add documentation for custom operation
  • fix links in documentation
  • fix inheritance in generated docs

You will find the documentation at: https://medkit.readthedocs.io/en/0.8.1

Finally, to install medkit, the package is available on PyPI under the name of medkit-lib

We strongly recommend to create virtual environments (using venv or conda) before installing medkit.

# Install medkit with only required dependencies
pip install medkit-lib

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector, ...):

# Install medkit with all extra optional dependencies
pip install 'medkit-lib[optional]'

Release 0.7.1

04 Jul 13:41
Compare
Choose a tag to compare

Release 0.7.1 is available !

Changelog from 0.7.1

[Fixed]

  • Fix umls coder normalizer gpu support
  • Fix message for provenance error
  • Fix missing save rules in rule-based text operations

You will find the documentation at: https://medkit.readthedocs.io/en/0.7.1

Finally, to install medkit, the package is available on PyPI under the name of medkit-lib

We strongly recommend to create virtual environments (using venv or conda) before installing medkit.

# Install medkit with only required dependencies
pip install medkit-lib

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector, ...):

# Install medkit with all extra optional dependencies
pip install 'medkit-lib[optional]'

Release 0.7.0

26 Jun 16:06
Compare
Choose a tag to compare

Release 0.7.0 is available !

Changelog from 0.6.0

[Added]

  • Add duplicate finder
  • Add char replacer
  • Add alignment utils

[Fixed]

  • Fix encoding for medkit-json-io

[Docs]

  • Add doc for training
  • Update docs

You will find the documentation at: https://medkit.readthedocs.io/en/0.7.0

Finally, to install medkit, the package is available on PyPI under the name of medkit-lib

We strongly recommend to create virtual environments (using venv or conda) before installing medkit.

# Install medkit with only required dependencies
pip install medkit-lib

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector, ...):

# Install medkit with all extra optional dependencies
pip install 'medkit-lib[optional]'

Release 0.6.0

24 May 11:42
Compare
Choose a tag to compare

Release 0.6.0 is available !

Changelog from 0.5.0

[Added]

  • Add NER metrics : seqeval

[Fixed]

  • Fix label mapping quickumls
  • Fix extra characters in segmentation operation : add strip functions
  • Fix iamsystem : attrs_to_copy #12

[Docs]

  • Add examples for segmentation operations, in particular rules files
  • Add example for finetuning of HF model
  • Move docs to readthedocs

[Other stuff]

  • Remove develop branch

You will find the documentation at: https://medkit.readthedocs.io/en/0.6.0

Finally, to install medkit, the package is available on PyPI under the name of medkit-lib

We strongly recommend to create virtual environments (using venv or conda) before installing medkit.

# Install medkit with only required dependencies
pip install medkit-lib

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector, ...):

# Install medkit with all extra optional dependencies
pip install 'medkit-lib[optional]'

Release 0.5.0

03 May 06:51
Compare
Choose a tag to compare

Release 0.5.0 is available !

Changelog from 0.4.1

[Added]

  • text: add iam system #111
  • training: add finetuning of HF model #110
  • text: add edsnlp

[Fixed]

  • Skip empty sentences in SentenceTokenizer #119
  • Fix duckling matcher #11
  • Fix minor issues #75 !129
  • Fix encoding #117
  • Fix docs : tuto !138
  • Fix spans example #127

You will find the documentation at: https://heka.gitlabpages.inria.fr/medkit.

Finally, to install medkit 0.5.0, the package is available on PyPI under the name of medkit-lib

We strongly recommend to create virtual environments (using venv or conda) before installing medkit.

# Install medkit with only required dependencies
python -m pip install medkit-lib

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector):

# Install medkit with all extra optional dependencies
python -m pip install 'medkit-lib[optional]'

Release 0.4.1

28 Mar 15:02
Compare
Choose a tag to compare

Release 0.4.1 is available !

Changelog from 0.3.1

[Added]

  • audio: add io converter for audio annotation formats .rttm and .str #85
  • audio: add speechbrain transcriber !102
  • text: add attribute propagation using interval tree !90
  • audio: add audio diarization/speaker detection with pyannote !105
  • docs: add io and text examples !77
  • text: add entity normalization attributes !107
  • text: add custom text operation #108
  • training: introduce training architecture in medkit #109 !22
  • io: add new medkit json format !123

[Fixed]

  • Fix SyntacticRelationExtractor !92
  • Fix HF translator issues !66
  • Fix BratOutputConverter !89
  • Fix miscs !28
  • Fix HFEntity matcher #114
  • Fix RegexpMatcher !128 (Github issue #10)

[Changed]

  • Refactor anotation and operation ids !108
  • Make medkit_doc_to_displacy support Segment #106
  • Add names to Operations and their descriptions #71
  • Change label for QuickUMLSMatcher #65
  • Make text and audio mandatory !117
  • Refactor Collection class !116
  • Rework medkit data model !119

[Development]

  • Commit poetry.lock #91
  • Add github mirroring
  • Simplify deps testing !112
  • Add API generation for docs !123

You will find the documentation at: https://heka.gitlabpages.inria.fr/medkit.

Finally, to install medkit 0.4.1, the package is available on PyPI under the name of medkit-lib

We strongly recommend to create virtual environments (using venv or conda) before installing medkit.

# Install medkit with only required dependencies
python -m pip install medkit-lib

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector):

# Install medkit with all extra optional dependencies
python -m pip install medkit-lib[optional]

Release 0.3.1

06 Oct 15:46
Compare
Choose a tag to compare
Release 0.3.1 Pre-release
Pre-release

This release 0.3.1 fixes a pip install issue found for release 0.3.0.

You will find the documentation at: https://heka.gitlabpages.inria.fr/medkit.
Finally, to install medkit 0.3.1, download the package in release assets and install the package.
For example:

# Install medkit with only required dependencies
python -m pip install 'medkit-0.3.1.tar.gz'

To install with extra dependencies (e.g., hf-translator, hf-entity-matcher, hf-transcriber, quick-umls, resampler, rush-sentence-tokenizer, umls-coder-normalizer, webrtc-voice-detector):

# Install medkit with all extra optional dependencies
python -m pip install 'medkit-0.3.1.tar.gz[optional]'

If you have some issues during installation, it is possible that you may need some additional packages:

sudo apt-get install -y gcc g++ libsndfile1 graphviz python3 python3-dev