wordpredict

This is a library that predicts words for ambiguous input.

Installation

pip install wordpredict

How to use

code

import pandas as pd
from wordpredict import WordPredict


corpus = pd.read_csv(
    "./unigram_freq.csv",
    header=0,
    keep_default_na=False,
).values
wp = WordPredict(corpus[:, 0], corpus[:, 1])

print("start user input")

input = ["e", "f", "g", "h"]
print(wp.update(input))
input = ["e", "f", "g", "h"]
print(wp.update(input))
input = ["i", "j", "k", "l"]
print(wp.update(input))

print("reset user input")
wp.reset()

input = ["e", "f", "g", "h"]
print(wp.update(input))
input = ["m", "n", "o", "p"]
print(wp.update(input))

output

start user input
['for', 'e', 'from', 'he', 'has', 'have']
['he', 'get', 'here', 'her', 'help', 'few']
['help', 'held', 'felt', 'hell', 'hello', 'helps']
reset user input
['for', 'e', 'from', 'he', 'has', 'have']
['for', 'home', 'go', 'how', 'good', 'end']

corpus

e.g., https://www.kaggle.com/datasets/rtatman/english-word-frequency

execution time

%%timeit

import pandas as pd
from wordpredict import WordPredict


corpus = pd.read_csv(
    "./unigram_freq.csv",
    header=0,
    keep_default_na=False,
).values
wp = WordPredict(corpus[:, 0], corpus[:, 1])

1.42 s ± 83.7 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

%%timeit

input = ["e", "f", "g", "h"]
wp.update(input)
input = ["e", "f", "g", "h"]
wp.update(input)
input = ["i", "j", "k", "l"]
wp.update(input)

8.34 ms ± 315 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

note

autocomple was implemented with reference to https://doi.org/10.1145/3173574.3173755

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
wordpredict		wordpredict
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wordpredict

Installation

How to use

code

output

corpus

execution time

note

About

Releases 5

Packages

Languages

2ndr9/wordpredict

Folders and files

Latest commit

History

Repository files navigation

wordpredict

Installation

How to use

code

output

corpus

execution time

note

About

Resources

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages