Skip to content

ahsan7162/Bolean-Retrieval-Model-in-Python

Repository files navigation

Boolean-Retrieval-Model-in-Python

what is Boolean Retrieval Model

The (standard) Boolean model of information retrieval (BIR)[1] is a classical information retrieval (IR) model and, at the same time, the first and most-adopted one. It is used by many IR systems to this day. The BIR is based on Boolean logic and classical set theory in that both the documents to be searched and the user's query are conceived as sets of terms (a bag-of-words model). Retrieval is based on whether or not the documents contain the query terms.

Features of this Project

  • Tokenization
  • Removing Stop-words
  • Stemming (using NLTK library)
  • Creating Inverted Index
  • Creating Postional Index
  • Solving Simple Queries ( AND , OR , NOT)
  • Solving Proximity Queries (/k words Apart)
  • GUI (using PyQt) based Implementation