-
Notifications
You must be signed in to change notification settings - Fork 73
/
language.txt
155 lines (109 loc) · 4.87 KB
/
language.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
Spoken Language Support
============================================================================
This section documents Dragonfly's support for spoken languages.
Languages with speech recognition engine support
----------------------------------------------------------------------------
Speech recognition engines supported by Dragonfly have a set spoken
language. This language can be checked via the :attr:`engine.language`
property, which returns an `ISO 639-1 code`_ (e.g. "en")::
from dragonfly import get_engine
engine = get_engine()
# Print the engine language.
print("Engine language: {}".format(engine.language))
Each speech recognition engine supported by Dragonfly supports many
languages. These are listed below with citations.
It is worth noting that Dragonfly's use of ISO 639-1 language codes means
that no distinction is made between variants of languages. For example,
U.S. English and U.K. English will both yield :code:`"en"` and be treated as
the same language, even though there are some differences.
Languages supported by Dragon
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The following languages are supported by *Dragon Professional Individual*
version 15 [#f1]_:
* English (multiple variants)
* Dutch
* French
* German
* Italian
* Spanish
Please check the linked Nuance knowledgebase page for the languages
supported by other versions and editions of Dragon.
Languages supported by Windows Speech Recognition
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The following languages are supported by Windows Speech Recognition (WSR) as
of 2016 [#f2]_:
* English (U.S.) (*)
* English (U.K.)
* Chinese (Simplified) (*)
* Chinese (Traditional)
* French (France)
* German (Germany)
* Japanese
* Spanish (Spain)
\* Successfully tested.
Microsoft does not appear to be documenting the languages available for WSR
any more, which is why the provided citation for this section is an
archive.org link. Currently, the best way to find out if your language is
supported is to look for available speech models in the Windows language
settings: **Settings** > **Time & Language** > **Language**.
Languages supported by CMU Pocket Sphinx
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The CMU Pocket Sphinx engine documentation page has a section on
:ref:`spoken language support <RefSphinxSpokenLanguageSupport>`. There are
CMU Pocket Sphinx models and dictionaries available from Source Forge for
the following languages [#f3]_:
* English (U.S.) (*)
* English (Indian)
* Catalan
* Chinese (Mandarin) (*)
* Dutch
* French
* German
* Greek
* Hindi
* Italian
* Kazakh
* Portuguese
* Russian (*)
* Spanish
\* Successfully tested.
English (U.S.) is the default language used by the CMU Pocket Sphinx engine.
Languages supported by Kaldi
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The following languages are supported by the :ref:`RefKaldiEngine`:
* English (U.S.)
It is possible for Kaldi to support other languages in the future. This
requires finding decent models for other languages and making minor
modifications to enable their use by the `Kaldi Active Grammar`_ library.
You can request to have your language supported by opening a `new issue`_ or
by contacting `David Zurow (@daanzu)`_ directly.
Languages with built-in grammar support
----------------------------------------------------------------------------
Dragonfly's :class:`Integer`, :class:`IntegerRef` and :class:`Digits`
classes have support for multiple spoken languages. Each supported language
has a sub-package under :attr:`dragonfly.language`. The current engine
language will be used to load the language-specific content classes in these
sub-packages.
This functionality is **optional**. Languages other than those listed below
can still be used if the speech recognition supports them.
The following languages are supported:
* Arabic - "ar"
* Dutch - "nl"
* English - "en"
* German - "de"
* Indonesian - "id"
* Malaysian - "ms"
English has additional time, date and character related classes.
Language classes reference
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
.. automodule:: dragonfly.language.en.short_number
:members:
.. rubric:: References
.. [#f1] https://nuance.custhelp.com/app/answers/detail/a_id/6280/kw/Dragon%20NaturallySpeaking%20languages%20supported/related/1
.. [#f2] https://web.archive.org/web/20160501101405/http://www.microsoft.com:80/enable/products/windowsvista/speech.aspx
.. [#f3] https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/
.. Other links.
.. _David Zurow (@daanzu): https://github.com/daanzu
.. _ISO 639-1 code: https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes
.. _Kaldi Active Grammar: https://github.com/daanzu/kaldi-active-grammar
.. _new issue: https://github.com/dictation-toolbox/dragonfly/issues/new