Skip to content
#

ngram

Here are 144 public repositories matching this topic...

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate a…

  • Updated Nov 16, 2023
  • C++

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

  • Updated Apr 25, 2022
  • Scala

This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.

  • Updated Sep 1, 2023
  • Python

Improve this page

Add a description, image, and links to the ngram topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ngram topic, visit your repo's landing page and select "manage topics."

Learn more