Dataset deduplication using the spark ML lib and Scala
-
Updated
Mar 18, 2019 - Scala
Dataset deduplication using the spark ML lib and Scala
Recommendation System on cryptocurrency, using data collected from users' tweets + 10-Fold Cross Validation ( Based on the cryptocoins from each user's tweets, the program runs algorithms on the data, resulting in the recommendation of other cryptocoins for each user) ( readme in greek but soon to be translated in English )
An implementation of Locality sensitive hashing
This is a task using python to find number of similar songs within the provided songs set.
Image Retrieval implementation using Deep Learning and Kernelized Locality-Sensitive Hashing
Image classification and unsupervised learning using latent space vectors produced by convolutional neural nets together with the original vectors space
Implementation tasks for multiple algorithms to process massive data. The algorithms are written in Python.
Vectors - Nearest neighbor search and Clustering using LSH, Hypercube (and Lloyd's only at the clustering) algorithms with L2 metric.
📈|Time Series - Nearest neighbor search and Clustering using LSH, Hypercube (and Lloyd's only at the clustering) algorithms with metrics: L2, Discrete and Continuous Fréchet.
TTAK.KO-12.0276 LSH Recursive Hasher
📈 kNN using LSH and Hypercube projection & Clustering using kMeans++ for n-dim polygonal curves and time series
Search your object with hash
distill large scale web page text
A Robust Library in C# for Similarity Estimation
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度
In this repository you can find an implementation of LSH (Local | Sensitive Hashing) and Finesse algorithms, designed to find similar data based on their hashes
Add a description, image, and links to the lsh-implementation topic page so that developers can more easily learn about it.
To associate your repository with the lsh-implementation topic, visit your repo's landing page and select "manage topics."