The iPython notebook here:
should be updated to reflect the current state of the database. Specifically, this part:
"Here we're asking for a list of IDs of all studies that use words starting with 'emo' (e.g.,'emotion', 'emotional', 'emotionally', etc.) at a frequency of 1 in 1,000 words or greater (in other words, if an article has 5,000 words of text, it will only be included in our set if it uses words starting with 'emo' at least 5 times). Let's find out how many studies are in our list:"
should be changed because the weights are no longer "this many words in 1000" frequencies, they are tf-idf (normalized) frequencies.
demo updated; closes #48