Skip to content

Simple search engine implementation indexing PDF and text files to process queries

Notifications You must be signed in to change notification settings

remidomingues/Search-Engine-Demonstrator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Search-Engine-Demonstrator

Implementation of a search engine, indexing PDF and text files to process queries. The index can be stored on disk to speed up the initialization.

Query

  • Unigram (OR)
  • Bigram (OR)
  • Multiword (AND)
  • Phrase

Ranking

  • TF-IDF (Relevance feedback)
  • PageRank
    • Power iterations (default, highest accuracy)
    • MC end-point with random start
    • MC end-point with cyclic start
    • MC complete path
    • MC complete path stopping at dangling nodes
    • MC complete path with random start (recommended)
  • Weighted TF-IDF and PageRank

About

Simple search engine implementation indexing PDF and text files to process queries

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages