This repo shows us how to compute similarity between two documents. Documents can be of any length or in some specifc case, one document can contain a user's query and other documents can be text files from where the query will be asked and Doc2Vec can be used to show the similarity between that query and other documents. The maximum score will point to the document from where the query was initially taken.
- Extract the file from the squad_train_doc.json folder.
- Load the JSON file.
- Run the model and implement the model.
- Input your question from any of the documents, using list_questions_per_doc[index].
- Compute the dataframe.
- Compare the results.