Skip to content
Permalink
master
Switch branches/tags
Go to file
 
 
Cannot retrieve contributors at this time

Summary

A Universal Dependencies corpus for spoken French.

Introduction

The corpus was converted automatically from the Rhapsodie treebank with manual corrections.

Xpos and features (which are not available in v2.2 of UD_French-Spoken) will be added to future versions of this treebank as they are encoded in the Rhapsodie treebank.

Structure

  • fr_rhapsodie-ud-train.conllu 1167 sentences 15172 tokens
  • fr_rhapsodie-ud-dev.conllu 909 sentences 10062 tokens
  • fr_rhapsodie-ud-test.conllu 730 sentences 9991 tokens
  • total 2806 sentences 35225 tokens

Changelog

  • 2021-11-15 v2.9
    • Repository renamed from UD_French-Spoken to UD_French-Rhapsodie.
  • 2020-11-15 v2.7
    • Morphology added
  • 2018-04-15 v2.2
    • Initial release

=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.2 License: CC BY-SA 4.0 Includes text: yes Genre: spoken Lemmas: converted from manual UPOS: converted with corrections XPOS: not available Features: not available Relations: converted with corrections Contributors: Gerdes, Kim; Kahane, Sylvain; Nakhlé, Mariam; Yan, Chunxiao; Etienne, Aline; Courtin, Marine Contributing: here Contact: kim@gerdes.fr