Parallel DPH Scorer

It is a parallel implementation of an Information Retrieval System that uses the DPH weighting model to score documents based on a given set of user queries.

Technologies

Prerequisites

Clone the repository in your system
Download Eclipse IDE for Java
After opening Eclipse, when asked for working directory, select the parent directory of the repository folder

How to run the project?

Open the Windows Powershell terminal in administrator mode.
Go to the chocolatey website and copy the following command.
Paste and run the command in the terminal.
Install Java version 11 and Maven

choco install openjdk11
choco install maven

In Eclipse, go to File -> Import -> General -> Existing Projects into Workspace . Then select root directory as the repository directory. Eclipse will take some time to build the project.
Once build, right-click on the project and then go to Build Path -> Configure Build Path. Select the Libraries tab and click on Edit. Select Add and then Standard VM. Click on Directory and select the directory where chocolatey installed Java version 11. This would be the following directory and you can find it in the terminal after you install openjdk11.

Select the new jdk and apply all changes. The project will recompile.
Go to Project -> Properties. Select Java Compiler and enable project specific settings. Change compiler compliance level to 1.8. Apply all changes and close.
In the project directory shown at the left-hand side, go to src -> uk.ac.gla.dcs.bigdata.apps
Right-click on AssessedExercise.java and Run it as a Java Application.
Once the project is done running, this will create a folder in the results directory, containing three files named after the user queries in the data folder. Inside each file, are the top 10 documents and their DPH scores for that query. The data folder also has the JSON file of the documents.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
resources		resources
results/1677456540688		results/1677456540688
src/uk/ac/gla/dcs/bigdata		src/uk/ac/gla/dcs/bigdata
.gitignore		.gitignore
.project		.project
README.md		README.md
chocolatey.PNG		chocolatey.PNG
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parallel DPH Scorer

Technologies

Prerequisites

How to run the project?

About

Releases

Packages

Languages

Ayanabha123456/Parallel-DPH-Scorer

Folders and files

Latest commit

History

Repository files navigation

Parallel DPH Scorer

Technologies

Prerequisites

How to run the project?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages