Geras

Accuracy comparison of a Bayesian Network, a Dense Neural Network and an LSTM in detecting Alzheimer's symptoms on the Pitt Corpus: https://dementia.talkbank.org/access/English/Pitt.html

Brief guide

The dataset has been pre-processed to extract relevant features and keep only lemmatized content words (verbs, nouns, adverbs and adjectives). The Bayesian Network and the Dense Network classify using only features. The LSTM classifies using both features and embedded text.

Please note

To run the LSTM, access to the dataset should be requested as described here: https://dementia.talkbank.org/. Text should then be processed to keep only lemmatized content words and stored onto two .csv files, one for the training set (train_set_lstm.csv) and one for the test set (test_set_lstm.csv), as a dataframe with a text column and a dementia column with Y or N values, where Y indicates presence of dementia.

To embed the text, download the pre-trained Wiki News 300d vectors from https://fasttext.cc/docs/en/english-vectors.html, unzip the compressed file and add it to the project folder. Run lstm.py --embed. The embedded matrix will be saved to file; all further tests can be run without embedding (with lstm.py).

Extracted features:

Age
Number of utterances
Mean length of utterances
Number of sentences
Number of unique words
Number of predicates
Number of coordinated sentences
Number of subordinated sentences
Repetitions ratio
Revisions ratio
Unintelligible words ratio
Filler words ratio
Trailing offs ratio
Incomplete words ratio
Prolonged syllables ratio
Pauses between syllables ratio
Pauses between words ratio
Overlaps ratio
Adjectives and adverbs ratio
Type-token ratio
Idea density
Word2Vec distance

Performance

The algorithms perform with the following accuracy on the test set:

Model	Accuracy	Specificity	Sensitivity
BN	72.1%	69.4%	74.2%
DNN	81.1%	75.5%	85.5%
LSTM	86.5%	87.8%	85.5%

The dataset was split into 64% - 16% - 20% training/validation/test set.

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
Background & Motivation		Background & Motivation
Literature Review		Literature Review
ML		ML
Software/geras		Software/geras
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Geras

Brief guide

Please note

Extracted features:

Performance

About

Releases

Packages

Contributors 2

Languages

License

eera-l/Geras

Folders and files

Latest commit

History

Repository files navigation

Geras

Brief guide

Please note

Extracted features:

Performance

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages