-
Notifications
You must be signed in to change notification settings - Fork 35
chore: bump version to 0.15.0 #355
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
6f6310d to
dbbfd69
Compare
Coverage Report
Files without new missing coverage
273 files skipped due to complete coverage. Coverage success: total of 98.06% is above 98.06% 🎉 |
|



Changelog
Added
edsnlp.data.read_parquetnow accept awork_unit="fragment"option to split tasks between workers by parquet fragment instead of row. When this is enabled, workers do not read every fragment while skipping 1 in n rows, but read all rows of 1/n fragments, which should be faster.edsnlp.trainscriptoutput_model_dir), and whether to save the model or not (save_model)logger=False)edsnlp.data.read_conlland with a specificeds.conll_dict2docconvertereds.biaffine_dep_parser) component and metricseds.extractive_qacomponent to perform extractive question answering using questions as prompts to tag entities instead of a list of predefined labels as ineds.ner_crf.Fixed
join_threadmissing attribute inSimpleQueuewhen cleaning a multiprocessing executorcls_token_idandsep_token_id(we now also look for these tokens in thespecial_tokens_mapandvocabmappings)random.RandomState()) when shuffling in data readers : this is important forsub_batch_sizeargument ofTrainingData.