Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
.settings
WebContent
build/classes
src
.classpath
.project
README

README

Author : Anerudh Balaji

Search Engine over Wikipedia built using JAVA and Lucene API.

Download the wikipedia Dump from the website to get the program working. 

Features :
1) XML Parser to parse the wikipedia dump.  - parse.java
2) Indexer - Indexes files. -- IndexFiles.java
3) Searcher - Search Engine built using lucene to extract the results  - SearchFiles.java
4) AP and NDCG - Average precision and NDCG calculations comparing it with google --ap.java ; Ndcgcalculation.java
5) Web UI - Implemeted using JSP and has features like "I'm feeling lucky button" and google search snippets. - Web folder


List of JAR's used :
Googleapi.jar
google-api-services-customsearch
googlesearch.jar
gson-1.7.jar
gson-2.1-javadoc.jar
jdom-1.0.jar
json-lib-2.4-jdk15.jar
Something went wrong with that request. Please try again.