Skip to content

A search engine that takes keyword queries as input and retrieves a ranked list of relevant results as output. It scraps a few thousand pages from one of the seed Wiki pages and uses Elasticsearch for a full-text search engine.

License

Notifications You must be signed in to change notification settings

Smile040501/Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Search Engine

A search engine that takes keyword queries as input and retrieves a ranked list of relevant results as output. It scraps a few thousand pages from the seed Wiki page: List of Marvel Cinematic Universe films and uses Elasticsearch for a full-text search engine. On top of the elasticsearch framework, it has a search portal built with React.js and Node.js that allows to give the input query and show the retrieved results.

Features

  • Cleaning and pre-processing of the scrapped data
  • Proper visualization of the ranked list of pages that hold the relevant answers
  • Support for Okapi BM-25 and LM-Dirichlet scoring model
  • Query keyword suggestions based on Levenshtein edit distance
  • Support for both disjunctive and conjunctive keyword queries
  • A configuration window for users to choose any of the scoring models and the number of results to show on the result page

License

MIT

Author

Mayank Singla

Mayank Singla

About

A search engine that takes keyword queries as input and retrieves a ranked list of relevant results as output. It scraps a few thousand pages from one of the seed Wiki pages and uses Elasticsearch for a full-text search engine.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published