New: `add(words)` API and some code improvements #34

BrunoBerisso · 2017-01-24T12:43:16Z

This branch has some general code improvements (fix access rights, prefer guard over if, etc) and three important changes:

Add a new API to add words to the recognition dictionary on runtime. Be aware that new words can't be added while a recognition is in progress. You should add new words before start a recognition process.
The API expect an array of tuples of String with the form: (word: "HELLO", phones: "HH EH L OW"). The first component is the word in plain English. The second is the pronunciation phones as appear in the cmudict (more here: http://www.speech.cs.cmu.edu/tools/lextool.html) In the future the second component should be calculated
The decode functions now throw exceptions when apply.
There is a new approach to the live decode logic with AVAudioConverter. The idea is read the data in a more appealing format for iOS (float 32, 16000 Hz) and convert it to the Sphinx format (int 16, 16000Hz). AVAudioConverter is only available from iOS 9.0 so the deployment target needs to change. This should address Device does not support required sample rate recording #24 and ps_add_word #33

Please let everybody know your thoughts about this changes.
Thanks!

…nstead of open. The same goes for the functions

- Chenge some 'if' statements for 'guards', mostely in the tests - Use STrue | SFalse instead of 1 | 0 to denote true | false when applicable

…gin in live decoding. The idea is read the data in a more appealing format for iOS (float 32, 16000 Hz) and convert it (with AVAudioConverter) to the Sphinx format (int 16, 16000Hz). AVAudioConverter is only available from iOS 9.0 so the deployment traget needs to change.

…Be aware that new words can't be added while a recognition is in progress. You should add new words before start a recognition process. The API expect an array of tuples of String with the form: (word: 'HELLO', phones: 'HH EH L OW'). The first component is the word in plain English. The second is the pronunciation phones as appear in the cmudict (more here: http://www.speech.cs.cmu.edu/tools/lextool.html) In the future the second component should be calculated

Bruno Berisso added 4 commits January 24, 2017 12:13

Review the access rights. Now the public classes are 'public final' i…

42904e1

…nstead of open. The same goes for the functions

- Add exceptions to some decoder metods to better handle errors.

638599d

- Chenge some 'if' statements for 'guards', mostely in the tests - Use STrue | SFalse instead of 1 | 0 to denote true | false when applicable

BrunoBerisso merged commit c05fdef into development May 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New: `add(words)` API and some code improvements #34

New: `add(words)` API and some code improvements #34

BrunoBerisso commented Jan 24, 2017

New: add(words) API and some code improvements #34

New: add(words) API and some code improvements #34

Conversation

BrunoBerisso commented Jan 24, 2017

New: `add(words)` API and some code improvements #34

New: `add(words)` API and some code improvements #34