DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
-
Updated
Feb 8, 2022 - Jupyter Notebook
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
String Distances in Julia
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Document Similarity using Word2Vec
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Simhash implementation in Javascript
Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.
Keras implementation of "SimGNN: A Neural Network Approach to Fast Graph Similarity Computation". Includes synthetic GED data.
Similarity Analysis to Defeat Malware Compiler Variations
A perceptual hash is a fingerprint of a multimedia file derived from various features from its content. Unlike cryptographic hash functions which rely on the avalanche effect of small changes in input leading to drastic changes in the output, perceptual hashes are "close" to one another if the features are similar.
Code for NLPCC2016 Chinese Word Similarity Task
Librarian: An Empirical Study of Security Updates in Android Apps’ Native Code
This repository consists of all the code required for similar 2-D pose detection in dance videos. This can used for any type of pose estimation application to find the similarity.
Symmetric Delete spelling correction algorithm using Java
Parallel all-pairs similarity search algorithms in ocaml #ocaml
A MatchMaker Exchange server
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
flags duplicate issues & PRs using embeddings
The implementation of sdhash, the algorithm to calculate similarity digests, rewritten in pure go language 🐹
A text similarity metric library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro, etc) to other metrics, (e.g Soundex, Chapman). This library is compiled based on the .NET standard with a lot of useful extension methods.
Add a description, image, and links to the similarity-score topic page so that developers can more easily learn about it.
To associate your repository with the similarity-score topic, visit your repo's landing page and select "manage topics."