Skip to content

scrosseye/ELLIPSE-Corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

ELLIPSE-Corpus

The English Language Learner Insight, Proficiency and Skills Evaluation (ELLIPSE) Corpus

The English Language Learner Insight, Proficiency and Skills Evaluation (ELLIPSE) Corpus is a freely available corpus of ~6,500 ELL writing samples that have been scored for overall holistic language proficiency as well as analytic proficiency scores related to cohesion, syntax, vocabulary, phraseology, grammar, and conventions. In addition, the ELLIPSE corpus provides individual and demographic information for the ELL writers in the corpus including economic status, gender, grade level (8-12), and race/ethnicity. The corpus provides language proficiency scores for individual writers and was developed to advance research in corpus and NLP approaches to assess overall and more fine-grained features of proficiency.

This repository contains the corpus and the scoring rubric.

The data is provided under a CC BY-NC-SA 4.0 DEED Attribution-NonCommercial-ShareAlike 4.0 International license (https://creativecommons.org/licenses/by-nc-sa/4.0/deed.en)

About

the English Language Learner Insight, Proficiency and Skills Evaluation (ELLIPSE) Corpus

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published