ngram
Here are 146 public repositories matching this topic...
Touch typing trainer using N-grams as data source, with options to customize the auto-generated lessons and specify the minimum typing performance needed. There are sound/color effects as well.
-
Updated
Aug 12, 2024 - JavaScript
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
-
Updated
Feb 9, 2018 - Python
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate a…
-
Updated
Nov 16, 2023 - C++
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
-
Updated
Apr 25, 2022 - Scala
Ngrams with Basic Smoothings
-
Updated
May 15, 2024 - Python
🍰 A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.
-
Updated
Mar 8, 2022 - Java
multiprocess unsupervised chinese_detect_words ngram_combination
-
Updated
Jan 2, 2019 - Python
Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms
-
Updated
Mar 14, 2024 - C++
大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning
-
Updated
Jul 25, 2024 - Java
Programming for NLP Project - Implement a basic n-gram language model and generate sentence using beam search
-
Updated
Mar 10, 2020 - Python
Improve this page
Add a description, image, and links to the ngram topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ngram topic, visit your repo's landing page and select "manage topics."