Skip to content

elgoog is a search & retrieval engine that uses Inverted Index for lookup and Bayesian Inference Network for retrieval

License

Notifications You must be signed in to change notification settings

shibin-george/elgoog

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

elgoog

elgoog is a (basic) document-search and query-retrieval engine that I developed for the Applied Information Retrieval (CS546, Fall-2019) class at UMass, Amherst.

elgoog uses inverted-index for fast query lookup.

It also supports phrase operators (like ordered and unordered windows or exact phrase matches (which is same as ordered window operator with no separation between the terms)) by using a Bayesian inference network model for query retrieval.

elgoog currently uses 4 scoring mechanisms:

  1. raw-count
  2. BM-25
  3. Query likelihood using Jelenik Mercer smoothing
  4. Query likelihood using Dirichlet smoothing

About

elgoog is a search & retrieval engine that uses Inverted Index for lookup and Bayesian Inference Network for retrieval

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages