Text analysis toolkit for Data Structures course project
The goal of this project is to provide a GUI for a program that analyzes a text file. The program allows the user to open an existing text file and break it down into components. It will break the text file into individual tokens (words and punctions), distinct words, individual sentences, and individual paragraphs. The program calculates statistics such as the number of times a word appears, the average word count for a sentence, the average sentence count for a paragraph, and more.
The opening menu allows the user to enter information. E-mail address and phone number are checked using regular expressions.
After selecting a text file, the program displays the contents and summary information at the bottom of the window.
For this project, a token refers to a word or end of word punctuation symbol.
Distinct words and the number of times they appear in the text are displayed.
Each sentence is broken down with statistics for each one.
Paragraphs are also analyzed for relevant statistics.