Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
-
Updated
Aug 18, 2022 - HTML
Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.
近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)
Un corpus de chansons de geste
The University of Pittsburgh English Language Institute Corpus (PELIC) dataset
A Corpus of the Kurdish Folkloric Lyrics
Materiales para el curso de verano, «Del corpus a la interpretación: Estilometría con R», Burgos, 2021
A Text / Speech Summarizer
A text analysis project on collection of script dialogue between characters for the episode 4,5,6 of star wars
Arabic Stories Corpus
HUMOR dataset for humor research
data, metadata, tools, and LDA experiments on a corpus of Sanskrit philosophy texts
Article title, authors, date and body extraction dataset.
Predictive texting is a data processed tool that makes it quicker and easier to write text by suggesting words as you type. The tool will read the text inside the text input area and predict the three most suitable options. After the prediction is made, the options are displayed as buttons. The user can press the button to insert text, the tool …
Add a description, image, and links to the corpus topic page so that developers can more easily learn about it.
To associate your repository with the corpus topic, visit your repo's landing page and select "manage topics."