Skip to content

Mohamed20Ahmed/Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Search-Engine

Project Details:

  1. Read 10 files (.txt)
  2. Apply tokenization
  3. Apply Stop words (except [in,to])
  4. Build positional index and displays each term
  5. Allow users to write phrase query on positional index and system returns the matched documents for the query.
  6. Compute term frequency for each term in each document.
  7. Compute IDF for each term.
  8. Displays TF.IDF matrix.
  9. Compute cosine similarity between the query and matched documents.
  10. Rank documents based on cosine similarity.