Add KenLM wrapper #71

pkuyym · 2017-06-03T08:53:52Z

resolves #2229

xinghai-sun

This LM class should be able to directly evaluate a sentence string instead of a list of word token indexes. String tokenizing, dictionary loading, dictionary lookup etc. should be included inside it to make the usage easier.

xinghai-sun · 2017-06-05T08:02:01Z

deep_speech_2/KenLMModel.py

+
+    .. code-block:: python
+
+        pip install https://github.com/kpu/kenlm/archive/master.zip


Is it possible to move it to "requirements.txt"?

xinghai-sun · 2017-06-05T08:03:20Z

deep_speech_2/KenLMModel.py

+
+        pip install https://github.com/kpu/kenlm/archive/master.zip
+
+    Please refer to **Scalable Modified Kneser-Ney Language Model Estimation** for


Add a reference link.

Add a dot at the end of the sentence.

xinghai-sun · 2017-06-05T08:04:02Z

deep_speech_2/KenLMModel.py

+class KenLMModel(object):
+    """
+    Wrapper for KenLM language model.
+    You should install KenLM python interface first.


You should install --> Please install.

xinghai-sun · 2017-06-05T08:04:41Z

deep_speech_2/KenLMModel.py

+
+class KenLMModel(object):
+    """
+    Wrapper for KenLM language model.


Is there any full name for KenLM ? Add a reference link.

xinghai-sun · 2017-06-05T08:06:05Z

deep_speech_2/KenLMModel.py

+        """
+        Initialize variables and load model.
+
+        :param model_path: Path of language model


Add a dot. Same below.

xinghai-sun · 2017-06-05T08:25:57Z

deep_speech_2/KenLMModel.py

+        :param id_list: Id list of word.
+        :param id_str_dict: list
+        """
+        assert len(id_list) > 0, 'invalid id list'


Replace it with raise ValueError.

xinghai-sun · 2017-06-05T08:29:49Z

deep_speech_2/KenLMModel.py

+        import kenlm
+        self._model = kenlm.LanguageModel(self._model_path)
+
+    def score_sentence_ids(self, id_list):


A string type is better than an id list.
And this class should be responsible for tokenization, dictionary loading and dictionary looking-up. Don't leave it to users.

xinghai-sun · 2017-06-05T08:32:57Z

deep_speech_2/KenLMModel.py

+        end token explicitly if the input sentence has been completed. 
+
+        :param id_list: Id list of word.
+        :param id_str_dict: list


Add return and rtype in the comments.

xinghai-sun · 2017-06-05T08:33:35Z

deep_speech_2/KenLMModel.py

+        return math.pow(10, score)
+
+
+if __name__ == '__main__':


Remove below or add this to a unit test file.

xinghai-sun · 2017-06-05T08:34:17Z

deep_speech_2/KenLMModel.py

+                 id_str_dict,
+                 verbose=False):
+        """
+        Initialize variables and load model.


This function comments should be moved above to class comments.

Add KenLM wrapper

0a7cc92

pkuyym requested review from xinghai-sun and lcy-seso June 3, 2017 08:54

xinghai-sun mentioned this pull request Jun 5, 2017

add ctc beam search decoder #59

Merged

xinghai-sun requested changes Jun 5, 2017

View reviewed changes

pkuyym closed this Jun 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KenLM wrapper #71

Add KenLM wrapper #71

pkuyym commented Jun 3, 2017

xinghai-sun left a comment

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017

xinghai-sun Jun 5, 2017


		.. code-block:: python

		pip install https://github.com/kpu/kenlm/archive/master.zip


		pip install https://github.com/kpu/kenlm/archive/master.zip

		Please refer to Scalable Modified Kneser-Ney Language Model Estimation for

Add KenLM wrapper #71

Add KenLM wrapper #71

Conversation

pkuyym commented Jun 3, 2017

xinghai-sun left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment