analysis

R scripts

voxclamantis_vowels_epitran_wikipron.R: analysis script for vowel midpoint F1 and F2

This script takes as input the Epitran and WikiPron midpoint F1 and F2 files, the per-utt-mcd scores files, reading_info.csv (inventory), and returns counts of tokens, families, and languages presented in the paper, correlation tables, and the correlation scatterplot. It optionally saves text files of the midpoint F1 and F2 following outlier exclusion that serves as input to the Python dispersion analysis.

voxclamantis_sibilants_epitran_wikipron.R: analysis script for sibilant mid-frequency peak

This script takes as input the Epitran and WikiPron sibilant info.csv and sibilant.csv files, the per-utt-mcd scores files, reading_info.csv (inventory), and returns counts of tokens, families, and languages presented in the paper, the correlation of mean mid-frequency peak /s/ and /z/, and the correlation scatterplot.

Python scripts

The python scripts in this folder have a number of dependencies detailed in environment.yml file. To install dependencies using conda run:

$ conda env create -f environment.yml

And to activate the environment run:

$ conda activate wild

extract_info.py: preprocessing script for vowel dispersion analysis

This script takes as input the folder where epiwiki dispersion files are (with file names ending in formants_mid_fin.csv) and it outputs a preprocessed tsv file with per language--vowel dispersion entropies. Run it with command:

$ python extract_info.py --src-path <src-path> --tgt-file <tgt-file>

In this command, <src-path> is the data source path, while <tgt-file> is the extracted info output filename. For example:

$ python extract_info.py --src-path data/vowels/midpoint_f1_f2/dispersion_input_epiwiki/ --tgt-file data/vowels/midpoint_f1_f2/preprocessed.tsv

analyse_dispersion.py: analysis script for dispersion entropy vs number of vowel categories correlations

This script takes as input the preprocessed file generated by extract_info.py and prints in the terminal Pearson and Spearman correlations (with their respective p-values). Run it with command:

$ python analyse_dispersion.py --info-file <info-file> --formants-type <formants-type>

where <formants-type> is either erb or hz. And <info-file> is the extract_info.py output. For example:

$ python analyse_dispersion.py --info-file data/vowels/midpoint_f1_f2/preprocessed.tsv --formants-type erb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

analysis

R scripts

voxclamantis_vowels_epitran_wikipron.R: analysis script for vowel midpoint F1 and F2

voxclamantis_sibilants_epitran_wikipron.R: analysis script for sibilant mid-frequency peak

Python scripts

extract_info.py: preprocessing script for vowel dispersion analysis

analyse_dispersion.py: analysis script for dispersion entropy vs number of vowel categories correlations

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
analyse_dispersion.py		analyse_dispersion.py
environment.yml		environment.yml
extract_info.py		extract_info.py
voxclamantis_sibilants_epitran_wikipron.R		voxclamantis_sibilants_epitran_wikipron.R
voxclamantis_vowels_epitran_wikipron.R		voxclamantis_vowels_epitran_wikipron.R

VoxClamantisProject/analysis

Folders and files

Latest commit

History

Repository files navigation

analysis

R scripts

voxclamantis_vowels_epitran_wikipron.R: analysis script for vowel midpoint F1 and F2

voxclamantis_sibilants_epitran_wikipron.R: analysis script for sibilant mid-frequency peak

Python scripts

extract_info.py: preprocessing script for vowel dispersion analysis

analyse_dispersion.py: analysis script for dispersion entropy vs number of vowel categories correlations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages