About BK-tree you mentioned in the paper #142

JackSnowWolf · 2018-12-20T09:48:30Z

Hi!

You mentioned that you used BK-tree data structure to improve efficiency. Could you some how explain that how you use that data structure? I felt confused after I didn't find any details of BK-tree in your training code or inference code.

Thanks!

githubharald · 2019-02-04T19:49:57Z

this decoding strategy is pretty simple:

find approximation for recognized word using best path decoding
then, find the words (given as a dictionary) most similar to the approximation and put it in list of candidates (this can be done using a BK tree)
compute the probability (loss) of all possible candidates
return best scoring word

Python implementation see: https://github.com/githubharald/CTCDecoder/blob/master/src/LexiconSearch.py

JackSnowWolf · 2019-02-11T06:19:48Z

@githubharald Thank you very much!

JackSnowWolf closed this as completed Apr 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About BK-tree you mentioned in the paper #142

About BK-tree you mentioned in the paper #142

JackSnowWolf commented Dec 20, 2018

githubharald commented Feb 4, 2019

JackSnowWolf commented Feb 11, 2019

About BK-tree you mentioned in the paper #142

About BK-tree you mentioned in the paper #142

Comments

JackSnowWolf commented Dec 20, 2018

githubharald commented Feb 4, 2019

JackSnowWolf commented Feb 11, 2019