This repository provides instructions and data samples for submissions to the SimpleText Task4 @ SOTA? task hosted on Codalab. All test set submissions for the shared task must be submitted through Codalab here.
-
Evaluation Phase 1 (Few-shot Testing)
- Start: April 23, 2024
- End: April 28, 2024
-
Evaluation Phase 2 (Zero-shot Testing)
- Start: April 29, 2024
- End: May 3, 2024
More information on important dates is available on the SimpleText SOTA? website.
In this repository, we have included an example from the validation dataset available here. You can submit results from your systems on the validation dataset before April 23rd.
Note that once the evaluation phase begins, the test datasets will automatically switch to the relevant dataset for that phase.
During the practice phase we have uploaded the validation dataset annotations on Codalab. Participants can prepare for the testing phase by making a few practice submissions with the validation dataset ahead of time to ensure that your outputs are processed and results are obtained. The following two folders will be discussed:
-
blind-validation-dataset
- This folder contains sub-directories of articles. Each sub-directory has an identifier and contains the LaTeX content in
.tex
format. - During the testing phases, participants will receive unique articles collection as test dataset releases for each phase, available as input to the systems. Similar to the dataset in this folder there will be no
annotations.json
files released.
- This folder contains sub-directories of articles. Each sub-directory has an identifier and contains the LaTeX content in
-
sample-submission
- This folder contains the expected output format from participant systems given the input data. The output must preserve the original subdirectory names from the input dataset to uniquely identify each article during evaluation. Each subdirectory should include a file named
annotations.json
, which either contains SOTA annotations or the string "unanswerable". - Create a
*.zip
file of all output subdirectories as shown insample-submission.zip
. The naming of the zipped file is not relevant. On Codalab, assuming you are registered to the SOTA? task, navigate to theParticipate
tab and selectSubmit/View Results
to submit your zip file.
- This folder contains the expected output format from participant systems given the input data. The output must preserve the original subdirectory names from the input dataset to uniquely identify each article during evaluation. Each subdirectory should include a file named
Note: Submitting the sample-submission.zip
provided in this repository, during the practice phase until the end of April 22nd, as it is will result in an evaluation score of 100%, as this file is an exact replica of the validation set, merely copying over the annotations.json
files.
The official SOTA? evaluation script is publicly released here.
To host your own competitions, you can follow the step-by-step guide provided by the developers at Codalab. The competition configuration file (.yaml
) for SOTA? is publicly available in the online-competition-site
subfolder within this repository.