This program implements the Vector Space Information Retrieval model for searching relevant documents,
demonstrated on the Cranfield corpus.
The program will read through the the cran.all.1400 corpus and ask the user for a query ID from the cran.qry file.
The program will then calculate the Cosine Similarity Scores for each document in relation to the query to then
create a list of top k results for relevant documents, where k is the number of results that the user wishes to
preview.
How to Compile:
javac -O -cp ".\opennlp-tools-1.9.1.jar" .\VectorSpaceModelIR.java
How to Run and their Parameters:
java -cp ".\opennlp-tools-1.9.1.jar" VectorSpaceModelIR .\cranfield-corpus\cran.all.1400 .\cranfield-corpus\cran.qry
SiriuslySirius/vector-space-information-retrieval-model-implementation
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published