Skip to content

Releases: LanguageMachines/PICCL

v0.9.5

11 Dec 22:05
Compare
Choose a tag to compare

Added a string linking stage to ticcl, this adds extra markup information (t-str/t-correction) using the foliatextcontent tool, this is in turn needed by FLAT for proper visualisation.

v0.9.4

01 Oct 16:24
Compare
Choose a tag to compare

Previous release was premature and bugged; this fixes it.

v0.9.3

01 Oct 14:48
Compare
Choose a tag to compare

Minor update: Added an --outputclass parameter for ticcl.nf to choose the output text class and provide extra flexibility. Set either that or --inputclass.

v0.9.2

01 Oct 08:55
Compare
Choose a tag to compare
  • added a clearer error message with explanation in case the indexNT file is empty (related to LanguageMachines/lexiconenrichment#1)
  • removed explicit flat url (let LaMachine handle it)
  • minor README update

v0.9.1

19 Aug 14:18
Compare
Choose a tag to compare
  • publish more intermediate output #58
  • added a --nofoliacorrect output option to skip the final foliacorrect step

v0.9.0

15 Apr 15:35
Compare
Choose a tag to compare

This PICCL release builds upon the long awaited TICCLtools v0.7:

Ticcl:

  • Fixed chaining
  • Implemented chainclean and made it optional
  • Changed default separator to underscore
  • TICCL-rank invocation changed
  • changed skipcols
  • added --low --high and --ngrams parameter
  • added alphabet file to TICCL-unk

General:

  • Migrated to nextflow process selectors, solved deprecation warnings (#57)
  • verify output files have non-zero size
  • Added schematic figures to document the architecture of the pipelines

Webservice:

  • Added inputtemplate for custom lexicon #56

v0.8.2

25 Aug 21:46
Compare
Choose a tag to compare

(Previous release was premature and didn't pass tests, fixed in this one)

v0.8.1

25 Aug 19:36
Compare
Choose a tag to compare

Intermediate bugfix release (#53 is still open):

  • attempted fix for FoLiA-correct invocation (#50 #52)

v0.8.0

14 Jun 21:01
Compare
Choose a tag to compare
  • Several workflows that used to be part of PICCL have been split off into separate projects now, this concerns:
    • The nederlab pipeline for enrichment of historical dutch, they are now in https://github.com/proycon/nederlab-pipeline
    • The frog, ucto and folia validation pipelines https://github.com/proycon/aNtiLoPe, PICCL depends on this new aNtiLoPe project now
    • This is an organisational change in favour of more modularity, clarity and better maintainability, it does not affect the functionality or installation of PICCL!
  • Allow unsetting flaturl in external yaml configuration to disable flat viewers (proycon/clam#75)
  • Propagate existing input textclass option to PICCL and assume a default of 'current' (rather than 'OCR') if OCR is skipped (#48) and change TICCL inputclass default to 'current' instead of 'OCR' when dealing with FoLiA input
  • Delete zero byte input files prior to FoLiA-correct (artefact of earlier patchy error ignore strategy) #49

v0.7.6

05 Mar 11:04
Compare
Choose a tag to compare
  • Another fix for plain text input and no ocr AND no ticcl scenario (addressed in #43)
  • Clean up in the wrapper script (it's becoming too convoluted)