Corpus and Vocabulary Preprocessing Utilities for Natural Language Pipelines
-
Updated
Jun 6, 2021 - C++
Corpus and Vocabulary Preprocessing Utilities for Natural Language Pipelines
Skipgram with Hierarchical Softmax
fastText v0.9.3 (C++ port)
Fast word-like N-gram embeddings
bilingual word embeddings mapping using fastText
Lossless Compression Techniques for Embedding Tables in Substantial Deep Learning-Based Recommendation System
Golang "native" implementation of word2vec algorithm (word2vec++ port)
FAISS library compiled for iOS, macOS, tvOS, watchOS
Distributed Representations of Sentences and Documents
Distributed Representations of Words using word2vec
R package to Embed All the Things! using StarSpace
R wrapper for fastText
Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Add a description, image, and links to the embeddings topic page so that developers can more easily learn about it.
To associate your repository with the embeddings topic, visit your repo's landing page and select "manage topics."