This repository contains code and data for the paper "English Negative Constructions and Communicative Functions in Child Language", to appear in CogSci 2021, as well as an ongoing journal article extension.
In particular, the study investigates negative syntactic constructions and the communicative functions they play in early child language of English (age ranges from 12 to 72 months).
python3 COGSCI2021/diaparse/chides2csv.py --path OUTPUT_PATH
python3 COGSCI2021/diaparse/diaparser.py --input CSV_DIRECTORY --output CONLL_DIRECTORY
python3 COGSCI2021/diaparse/construction.py --input CONLL_DIRECTORY --output FILENAME --domain DOMAIN/FUNCTION (--desp)
-desp
is for whether you'd like to generate descriptive file (e.g. number of uterances at each age of the childre) for both child and parent speech- For
--domain
, set it to:emotion
for emotionmotor
for prohibition and inabilitylearning
for labelingepistemic
for epistemic negationpercetion
for non-existence
- Descriptive files for child and parent speech are in
COGSCI2021/data/
- Data for each domain/function of negation is in
COGSCI2021/data/
Updated codes for journal article extension, including both the syntactic and the discourse level as well as individual variation
python3 COGSCI2021/diaparse/chides2csv.py --path OUTPUT_PATH
python3 code/parse.py --output CONLL_DIRECTORY
python3 code/discourse.py --input CONLL_DIRECTORY --output FILENAME --domain DOMAIN/FUNCTION --desp
data/child_descriptive.txt
data/parent_descriptive.txt
The FILENAME
and DOMAIN/FUNCTION
can be anything, and one only needs to turn on the --desp
flag once
python3 code/discourse.py --input CONLL_DIRECTORY --output FILENAME --domain DOMAIN/FUNCTION --level d/s
- For
--domain
, set it to:emotion
for emotionmotor
for prohibition and inabilitylearning
for labelingepistemic
for epistemic negationpercetion
for non-existence - For
--level
,d
is for discourse level, ands
is for sentence level
data/DOMAIN.txt
, e.g.data/emotion.txt
for the sentence leveldata/DOMAIN_discourse.txt
, e.g.data/emotion_discourse.txt
for the discourse level
For simplicity, one could modify the individuals
variable in the code/discourse.py
file to decide which child's / children's production they would like to analyze. Then run:
python3 code/discourse --input CONLL_DIRECTORY --output FILENAME --domain DOMAIN/FUNCTION --individual --level s
This would generate descriptive statistics for each individual child of interest; the data for each child within a particular domain would already be generated by the previous step.
data/individual_child_descriptive.txt
data/individual_parent_descriptive.txt