CSCI 3230 Project Text file searching
Given a set of text documents:
- Clean file to a string without caps, spacing, suffix, symbols, etc.
- Build frequency list (vectors)
- Build Inverted Index
- Build GUI
- Get user input (word/phrase to search for)
- pull documents from Inverted Index
- Pull frequency data
- Arrange and display documents by frequency data
(Optional) - Display similar documents
TextFile.txt
|
*************
* Cleaner *
*************
|
Clean string
|
***************
* Processer *
***************
|
Frequency vector
|
********************
* Frequency List *
********************
|
********************
* Inverted Index *
********************