Skip to content

jixing-li/lpp_data

Repository files navigation

Scripts for the data paper "Le Petit Prince: A multilingual fMRI corpus using ecological stimuli".

  1. compute_fd.py:
  • Compute the tsnr for the raw and preprocessed fMRI data.
  1. compute_tsnr.py:
  • Compute the temporal signal-to-noise ratio (tSNR) for the raw and preprocessed fMRI data.
  1. compute_isc.py:
  • Compute inter-subject correlation (ISC) for each voxel's timeseries across subjects in each language group.
  1. get_f0_intensity.m:
  • Get the f0 and root-mean-square (RMS) intensity for every 10 ms of the audios using the Matlab toolbox Voicebox.
  1. get_word_frequency.py:
  • Get word frequency using the Google Books (Version 20120701) unigram frequency counts.
  1. get_word_embeddings.py:
  • Extract GloVe and BERT embeddings using SpaCy.
  1. parse_syntax.sh:
  • Get part-of-speech tagging, constituent tree structure and dependency relations for every sentence using the Stanford parser.
  1. count_parser_actions.py:
  • Syntactic node counts for each word in the audiobooks based on bottom-up, top-down and left-corner parsing strategies as applied to the Stanford-derived constituency trees described above.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages