Behavioral phenotyping project

Behavioral data embeddings for the stratification of individuals with neurodevelopmental conditions.

Designed for observational measurements of cognition and behavior of individuals with Autism Spectrum Conditions (ASCs).

TODO: Abstract

Technical Requirements

Python 3.6+

R 3.4+

The full list of required Python Packages is available in requrirements.txt file. It is possible to install all the dependency by:

$ pip install -r requirements.txt

Behavioural Phenotyping Pipeline (TLDR ;))

A complete example of the Behavioural Phenotype Stratification is available as Jupyter notebook:

jupyter notebook behavioral_phenotyping_pipeline.ipynb

Documentation (at a glance)

The code is structured into multiple modules (.py files), including algorithms and methods for the multiple steps of the pipeline:

dataset.py: Connects to the database and dump data
features.py: Returns vocabulary and dictionary of behavioral EHRs for each of the 4 possible depth levels. It also returns a dataset with quantitative scores for level 4 features
pt_embedding.py: Performs TFIDF for patient embeddings; Glove embeddings on words and average them out for subject embeddings; Word2vec embeddings on words, that are then averaged to output individual representations
clustering.py: Performs Hierarchical Clustering/k-means on embeddings, and quantitative 4th level features
visualization.py: Visualizes results (e.g. scatterplot & dendrogram)for sub-cluster visualization; Heatmap for inspection of quantitative scores between sub-clusters
basic_statistics.py: Returns basic demographic statistics for dataset description
test-demog-cl.R: Runs multiple pairwise comparisons between subgroups to check for confounders and support clinical validation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

basic_statistics.py

basic_statistics.py

behavioral_phenotyping_pipeline.ipynb

behavioral_phenotyping_pipeline.ipynb

clustering.py

clustering.py

datamap.py

datamap.py

dataset.py

dataset.py

features.py

features.py

pt_embedding.py

pt_embedding.py

requirements.txt

requirements.txt

test-demog-cl.R

test-demog-cl.R

visualization.py

visualization.py

Repository files navigation

Behavioral phenotyping project

TODO: Abstract

Technical Requirements

Behavioural Phenotyping Pipeline (TLDR ;))

Documentation (at a glance)

TODO: Paper, Poster, Conference Reference

TODO: Credits and Acknowledgements

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.gitignore		.gitignore
README.md		README.md
basic_statistics.py		basic_statistics.py
behavioral_phenotyping_pipeline.ipynb		behavioral_phenotyping_pipeline.ipynb
clustering.py		clustering.py
datamap.py		datamap.py
dataset.py		dataset.py
features.py		features.py
pt_embedding.py		pt_embedding.py
requirements.txt		requirements.txt
test-demog-cl.R		test-demog-cl.R
visualization.py		visualization.py

leriomaggio/behavioral_phenotyping

Folders and files

Latest commit

History

Repository files navigation

Behavioral phenotyping project

TODO: Abstract

Technical Requirements

Behavioural Phenotyping Pipeline (TLDR ;))

Documentation (at a glance)

TODO: Paper, Poster, Conference Reference

TODO: Credits and Acknowledgements

About

Resources

Stars

Watchers

Forks

Languages