Sementic_Textual_Similarity

Task: Finding Semantic Textual Similarity Proposed Approach:

Encode both the sentence using Universal Sentence Encoder.
Find the Cosine Similarity between both the encoded vectors.

Why ?

The Universal Sentence Encoder encodes text into high dimensional vectors. The pre-trained Universal Sentence Encoder is publicly available in Tensorflow-hub . It comes with two variations i.e. one trained with Transformer encoder and other trained with Deep Averaging Network (DAN). It gives an encoding vector of 512 dimensions. The main reason behind using this was that this model has crossed all the baseline scores in the domain of natural language processing tasks such as text classification, semantic textual similarity, clustering, etc. We have used cosine similarity for the measure of similarity between two embedded vector as it fulfills the two basic criteria

We want a score where 0 represents highly similar and 1 represents highly dissimilar.
In natural language tasks generally, it is good to practise to use cosine similarity.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Sementic_Textual_Similaity.ipynb		Sementic_Textual_Similaity.ipynb
Text_Similarity_Dataset.csv		Text_Similarity_Dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sementic_Textual_Similarity

Task: Finding Semantic Textual Similarity Proposed Approach:

Why ?

About

Releases

Packages

Languages

niteshsukhwani/Sementic_Textual_Similarity

Folders and files

Latest commit

History

Repository files navigation

Sementic_Textual_Similarity

Task: Finding Semantic Textual Similarity Proposed Approach:

Why ?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages