PARMEX 2022

PARAPHRASE IDENTIFICATION IN MEXICAN SPANISH

IMPORTANT

Due to a problem we found in the script we used to separate the paraphrased sentences into training, validation, and testing all datasets were updated, so you will need to train again the models and then perform the predictions on the new development and test sets.

Check our codalab page HERE.

Task Description

In this task participants will be given pairs of sentences and the aim is to indicate whether each pair captures a paraphrase relationship, i.e. to classify them as paraphrases (P) or not paraphrases (NP).

Full task details here.

Corpus

The distribution of the corpus is shown in the following table:

Corpus	Total sentence-pairs	Paraphrase sentence-pairs	Non-paraphrase sentence-pairs
Training	7,382	1,282	6,100
Validation	97	20	77
Test	2,819	542	2,277
Total	10,298	1844	8454

The labels for each sentence-pair are the following:

1: Paraphrase (P)
0: Non-paraphrase (NP)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
README.md		README.md
new_parmex_test.csv		new_parmex_test.csv
new_parmex_train.csv		new_parmex_train.csv
new_parmex_val.csv		new_parmex_val.csv
output_format_sample.txt		output_format_sample.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PARMEX 2022

PARAPHRASE IDENTIFICATION IN MEXICAN SPANISH

IMPORTANT

Task Description

Corpus

About

Releases

Packages

GIL-UNAM/PARMEX_2022

Folders and files

Latest commit

History

Repository files navigation

PARMEX 2022

PARAPHRASE IDENTIFICATION IN MEXICAN SPANISH

IMPORTANT

Task Description

Corpus

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages