Closed
Conversation
* Create dataset loader for PsyTAR * Fix config instance when only data_dir passed * Style and remove unused dependency * fix: updates psytarclass to inherit from bb Co-authored-by: Natasha Seelam <nseelam1@gmail.com>
* Add support for n2c2 2010 * Add support for n2c2 2010 * Format code * Import BigBioConfig from utils; cosmetic refactor Co-authored-by: Ayush Singh <singh.ay@northeastern.edu>
| { | ||
| "text_1": datasets.Value("string"), | ||
| "text_2": datasets.Value("string"), | ||
| "label": datasets.Value("string"), |
Collaborator
There was a problem hiding this comment.
Change to float in the source schema?
* update README - add images for task assigment
* add pubmed_qa
* update pubmed_qa.py
* update question type to yesnomaybe, use [maybe, LONG_ANSWER] as the answer for PQA-Unlabeled
* fix: update to new qa schema
* remove unused files
* add 10-fold data for pqal (subset_id pqal_fold{k}_[source|bigbio]), remove LONG_ANSWER, update question type to yesno
* add 10-fold data for pqal (subset_id pqal_fold{k}_[source|bigbio]), remove LONG_ANSWER, update question type to yesno
* update pubmed_qa.py - add description for each dataset subset, change naming for the subset_id following bigbio convention, update None to BigBioValues.NULL on the bigbio schema
* format, remove print, add TODO
format, remove print, add TODO
Co-authored-by: Natasha Seelam <nseelam1@gmail.com>
Co-authored-by: Gabriel Altay <gabriel.altay@gmail.com>
* add back images that were removed in bigscience-workshop#357 * oops! rename images
* run tests by config name, cleaned up a bit * referenced -> existing referenced -> existing
rename n2c2_2006 -> n2c2_2006_smokers (bigscience-workshop#478)
* n2c2 2006 de-identification task * remove name == main block remove name == main block Co-authored-by: Gabriel Altay <gabriel.altay@gmail.com>
* Initial NLM-WSD commit * Further development * Further development * Reformat * add custom local config add custom local config * import dataclass import dataclass * fix custom config typos fix custom config typos Co-authored-by: Gabriel Altay <gabriel.altay@gmail.com>
Contributor
Author
|
Hi @debajyotidatta I've addressed your comments in a new PR: #479 Could you please review the changes there and I'll go ahead and close this one? |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Closes #161
Checkbox
biodatasets/my_dataset/my_dataset.py(please use only lowercase and underscore for dataset naming)._CITATION,_DATASETNAME,_DESCRIPTION,_HOMEPAGE,_LICENSE,_URLs,_SUPPORTED_TASKS,_SOURCE_VERSION, and_BIGBIO_VERSIONvariables.NOTE: _LICENSE is "Unknown" in this case
_info(),_split_generators()and_generate_examples()in dataloader script.BUILDER_CONFIGSclass attribute is a list with at least oneBigBioConfigfor the source schema and one for a bigbio schema.datasets.load_datasetfunction.python -m tests.test_bigbio biodatasets/my_dataset/my_dataset.py.