Skip to content
Determine word frequency of LDS General Conference talks.
Python
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE.txt
README.md
genconf_word_count.py

README.md

General Conference Word Frequency

My initial script here was very rough and ugly, done some time ago. I decided to update it to utilize the Natural Language Toolkit as well as a Count Vectorizer from scikit-learn. It's now much quicker and more accurate.

The script provides a word count for LDS General Conference talks.

Dependencies

  • Python 2.7
  • Pandas
  • Numpy
  • Scipy
  • BeautifulSoup
  • NLTK
  • Scikit-Learn
  • re
  • requests

Usage

Download the python script and change the "url" variable to the link of the General Conference talk you want to analyze. Run the script, which will output a CSV consisting of a word and the number of times it was used within the talk.

I've filtered out "stop words" (common words to be ignored) using both the Natural Language Toolkit's and scikit-learn's "stop word" dictionaries.

You can’t perform that action at this time.