Word Counter Application

The original goal of this project was to find and count the top 100 most frequently occurring words in Moby Dick. I broadened the scope of the project by allowing the user to input any text file and adding a simple GUI that displays the top 100 words in a histogram.

Demo

Notes

My word count numbers for Moby Dick are lower than many others that I found from searching the web. This is because I made the choice to include the hyphen ('-') and apostrophe ('’') characters as word characters. This means that "whale" and "whale's" are counted separately, as are "carpet" and "carpet-bag".

Executables

The finished code produces two separate executables. The main method in the WordCounter class produces a command-line program that prompts the user for the name of a text file, and then outputs a histogram in a web browser. The other main method is a GUI that is a bit prettier and more user-friendly. Both executables rely on the same underlying logic in the WordCounter and Sorter classes.

Extra files

A few test files and a couple of extra text files of classics from Project Gutenberg are included in this repository for ease of testing.

Acknowledgments

I was inspired by Matt Pearce's awesome visualization here (GitHub repo here). My visualization is modest compared to his.
I also spent a lot of time on Tablesaw's GitHub pages here.
Matthew Gillard's post on "Getting Started with JavaFX" helped me get the GUI up and running. I modeled the basic structure of my stage on his.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.idea		.idea
.settings		.settings
bin		bin
src		src
target		target
.classpath		.classpath
.project		.project
Readme.md		Readme.md
dependency-reduced-pom.xml		dependency-reduced-pom.xml
dracula.txt		dracula.txt
moby-dick.iml		moby-dick.iml
mobydick.txt		mobydick.txt
pile-of-words.jpg		pile-of-words.jpg
pom.xml		pom.xml
prideandprejudice.txt		prideandprejudice.txt
short-text.txt		short-text.txt
stop-words.txt		stop-words.txt
two-line-text.txt		two-line-text.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word Counter Application

Demo

Notes

Executables

Extra files

Acknowledgments

About

Languages

krzwier/moby-dick

Folders and files

Latest commit

History

Repository files navigation

Word Counter Application

Demo

Notes

Executables

Extra files

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Languages