Skip to content

jfuentes/IRWeb

Repository files navigation

IRWeb

This project is a search engine for ics.uci.edu. It is based on a Crawler first step and then access to indexed data.

The main characteristics are:

  • Crawler4j library is used for crawling the ics.uci.edu domain.
  • Google Gson library is used for comparing the same query results with NDCG metrics.
  • Berkeley DB is used for persistence (crawled data and indexes).
  • PageRank algorithm is used to rank the results.
  • A web interface is provided. A simple Servlet operates the search requests.

Authors: Joel Fuentes & Han Ke.

About

This project is a search engine for ics.uci.edu

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published