Skip to content

Version 0.7.3

Compare
Choose a tag to compare
@kermitt2 kermitt2 released this 13 May 22:07
· 241 commits to master since this release

Added

  • Support for JDK beyond 1.11, tested up to Java 17, thanks to removal of dynamic native library loading after the start of the JVM
  • Incremental training (all models and ML engines), add this option in training command line and training web service (#971)
  • Systematic benchmarking on two new sets: PLOS (1000 artilces) and eLife (984 articles)
  • All end-to-end evaluation datasets are now available from the same place: https://zenodo.org/record/7708580
  • Option to output coordinates in notes and figure/table captions
  • Support for Mac ARM architecture (#975)
  • Play With Docker documentation (#962)

Changed

  • Update to DeLFT version 0.3.3
  • Demo now hosted as HuggingFace space
  • Additional training data, in particular for citation, reference-segmenter, segmentation, header, etc.
  • Update Deep Learning models (and some of the CRF)
  • The standard analyzer for sub-lexical tokenization is available in grobid-core, and used for the citation model (in particular for improving CJK references) (#990)
  • Update evaluations

Fixed

  • Correct wrong content type in doc for processCitation web service
  • Sentence segmentation applied to notes (#995)
  • Other minor fixes