Skip to content

ushashwat/Semantic-Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Semantic Search Engine

Semantic search using NLP on extracted text

Error!

Data Extraction

The text corpus can be extracted from any website that allows web scraping. BeautifulSoup library is used to parse the components from a HTML webpage and extract the text from the body.

Acknowledgement

AllenNLP

Limitations

  • The semantic meaning is not fully comprehended for open-ended questions
  • Web scraping is not reliable and webpages with popups/redirects can create issues

Releases

No releases published

Packages

No packages published