dwks/silvius-train-lm
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
master
Could not load branches
Nothing to show
Could not load tags
Nothing to show
{{ refName }}
default
Name already in use
A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code
-
Clone
Use Git or checkout with SVN using the web URL.
Work fast with our official CLI. Learn more about the CLI.
- Open with GitHub Desktop
- Download ZIP
Sign In Required
Please sign in to use Codespaces.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching Xcode
If nothing happens, download Xcode and try again.
Launching Visual Studio Code
Your codespace will open once ready.
There was a problem preparing your codespace, please try again.
Latest commit
Git stats
Files
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
This project trains new dual command/English language models, intended to be used for voice coding with Silvius. It relies on Kaldi. Usage: 1. Clone kaldi (we tested git version 69cd717 but any should work). Run make in tools/, then configure and make in src. 2. Clone this repository as a subdirectory inside kaldi/egs/tedlium. 3. Put your command words into corpus.txt, separated by whitespace, as in the output of `python grammar/lm.py 1 | sort -u` from silvius frontend. You may wish to add the following words as commands: <unk> [BREATH] [COUGH] [NOISE] [SMACK] [UH] [UM] 4. Execute run6.sh. This script should take 15 minutes, once the pretrained models and data are downloaded (1.5GB). 5. If you have any non-English words, the run6.sh script will point them out and exit. Write their pronunciations in phones in corpus.dict. To generate word pronunciations automatically, try ./pronounce.sh foobar. 6. The output is in exp/tri_6a and may be imported into silvius-backend with silvius-backend/models/import.sh. Models will be about 70MB. Please join the silvius mailing list if you have any questions! https://groups.google.com/forum/#!forum/silvius
About
Scripts for training command+english language models for Silvius
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published