Parsing_Speech

Dependency Annotation and Parsing for Spontaneous Speech

Convert original CHILDES xml file to .conllu file
1. python3 code/ori_xml2conll.py --input Path_To_Corpus --output Path_To_Output --section Section (e.g. English-NA or English-UK)
2. eve.py tailored specifically for the Eve corpus from the Brown corpus
Semi-automatic conversion from CHILDES annotation to UD annotation
1. python3 code/converter.py --input Input_Path --output Output_Path
2. in data/Eve/eve_annotated
Manual annotation
1. in data/Eve/eve_annotated
Significance testing of parsing results
1. python3 code/bootrap.py --gold Gold_Annotation_File --pred Predicted_File --n Number_Of_Iterations (e.g. 10000) --c Sample_Size (e.g. number of utterances in the file)
Descriptive statistics of child information from CHILDES
1. python3 code/descriptive_statistics.py --input Input_Path --output .csv_file
2. English: in results/en_descriptive.csv
3. Chinese: in results/zh_descriptive.csv
Models

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
code		code
data		data
results		results
.DS_Store		.DS_Store
README.md		README.md

Provide feedback