Semantic search using NLP on extracted text
The text corpus can be extracted from any website that allows web scraping. BeautifulSoup library is used to parse the components from a HTML webpage and extract the text from the body.
- The semantic meaning is not fully comprehended for open-ended questions
- Web scraping is not reliable and webpages with popups/redirects can create issues