SpellChecker

A simple spelling corrector that generates candidates on misspelled words and leverages a language model for ranking candidates. To speed up the candidate generation process, a BKTree is used to search in the space of possible words K edit distance computations away from a misspelled word.

Basic usage

Requires a training dataset of sentences.

input_training_data = [
        'they can go quite fast',
        'there were the new Japanese Honda',
    ]

train_docs = DocumentArray([
    Document(content=t) for t in input_training_data
    ])

with Flow().add(uses=SpellChecker) as f:
    f.post(on='/train', inputs=train_docs)

Check the trainer for information on what parameters it supports. These can be passed with

    f.post(on='/train', inputs=train_docs, parameters={'param1': value, ...})

If the parameters are not recognized by the trainer they will be ignored.

Then the spelling of your text Documents can be fixed as follows:

...
input_docs = DocumentArray(
            [Document(content=t) for t in incorrect_text]
            )
results = f.post(on='/index', inputs=input_docs, return_results=True)
print(results[0].docs)  # documents can be found here

Note that calling the /train again will delete the existing model and replace it with the new one it trains.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
executor		executor
tests		tests
.gitignore		.gitignore
README.md		README.md
config.yml		config.yml
manifest.yml		manifest.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpellChecker

Basic usage

About

Releases

Packages

Contributors 4

Languages

jina-ai/executor-spellchecker

Folders and files

Latest commit

History

Repository files navigation

SpellChecker

Basic usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages