Skip to content
generate strings and compute statsistics fro subcomponents
Jupyter Notebook Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README.md
add-book-stats.py
compute-pseudowords.py
compute-pw6.py
compute-sublexstats-all6strings.py
compute-sublexstats-english_french.py
compute-sublexstats-for-Vinckiers-experiment.ipynb
dico.py
english-blp-reduced.tsv
english-freqbooks.csv
estimate-sublexstats-english-french.py
explore_sublex_stats_101218.ipynb
french-freqbooks.csv
french-lexique-reduced.tsv
make.py
merge_stats_with_lexique_and_blp.py
old_vinckier_stimuli.pkl
reduce-lexicons.py
sublexstats.py
super_big_stimuli_selector.py
vinckier_stats_books.csv
vinckier_stats_films.csv

README.md

sublexical statistics for French and English

% Time-stamp: <2019-03-15 17:03:47 christophe@pallier.org>

  • reduce-lexicons.py: extracts lexical frequencies informations from French-Lexique382.csv and British-Lexicon-Project.tsv into the four files english-freqbooks.csv, english-freqfilms.csv, french-freqbooks.csv, french-freqfilms.csv

  • sublexstats.py: module providing a class (sublexstats) gathering sublexical statistics (letter, bigram, trigram and trigram frequencies).

  • estimate-sublexstats-english-french.py: estimates the frequencies of subcomponents from French and English films and books databases, and creates the four files english-freqbooks.sublexstats french-freqbooks.sublexstats english-freqfilms.sublexstats french-freqfilms.sublexstats

  • compute-sublexstats-english_french.py: computes the frequencies of letters, bigrams, trigrams, quadrigrams in French and in English. Produces french-stats.csv english-stats.csv

  • compute-sublexstats-all6strings.py: generates all possible 6-letter strings and computes their sublexical statistics. Outputs: [A-Z]-strings.csv files.

  • compute-pseudowords.py: randomly generates 6-letter strings and computes their sublexical statistics.

  • add-book-stats.py: add fields with sublexical stats to selected*.csv files

  • super_big_stimuli_selector.py

You can’t perform that action at this time.