Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Making the vocabulary trie #59

Closed
mineshmathew opened this issue Mar 19, 2018 · 2 comments
Closed

Making the vocabulary trie #59

mineshmathew opened this issue Mar 19, 2018 · 2 comments

Comments

@mineshmathew
Copy link

mineshmathew commented Mar 19, 2018

@ryanleary I was trying to work with your fork at https://github.com/ryanleary/ctcdecode.
The vanilla decoder works like a charm. But I cant figure out how the trie is being made using the function you mentioned in the README.

import pytorch_ctc
 
lexicon = '~/language_modelling/Jaderberg_90k_lexicon.txt'
output_path = '~/tries/4gram_JaderbergLexicon/'
kenlm_path = '~/language_modelling/lm_4gram_on_lob_and_brown.klm'
labels = '_0123456789abcdefghijklmnopqrstuvwxyz '

pytorch_ctc.generate_lm_trie(lexicon, kenlm_path, output_path, labels, 0, 37)

Above is my script to generate the trie . The script runs without any errors. But nothing is being created at the specified output path

Could you please tell me If I am doing it right

@ryanleary
Copy link
Collaborator

The trie is no longer needed. Please be sure to use the latest version of this repo.

@mineshmathew
Copy link
Author

mineshmathew commented Mar 19, 2018

@ryanleary Would you please point me to some help/document on how I can use this ? On your fork there was some write up on the functions to be used and the arguments. But I dont see anything here

Basically I am pretty confused with the branch/version of this repo I should use

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants