You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
depending on whether you are using a statistical language
30
-
model or a finite-state grammar or look for a keyphase.
59
+
By default only errors are printed to standard error, but if you want more information you can pass ‘\f[CR]-loglevelINFO\fP’. Partial results are not printed, maybe they will be in the future, but don't hold your breath. Force-alignment is likely to be supported soon, however.
31
60
.SH OPTIONS
32
61
.TP
33
-
.B\-adcdev
34
-
of audio device to use for input.
35
-
.TP
36
62
.B\-agc
37
63
Automatic gain control for c0 ('max', 'emax', 'noise', or 'none')
38
64
.TP
39
65
.B\-agcthresh
40
66
Initial threshold for automatic gain control
41
67
.TP
42
68
.B\-allphone
43
-
phoneme decoding with phonetic lm
69
+
phoneme decoding with phonetic lm (given here)
44
70
.TP
45
71
.B\-allphone_ci
46
72
Perform phoneme decoding with phonetic lm and context-independent units only
47
73
.TP
48
74
.B\-alpha
49
75
Preemphasis parameter
50
76
.TP
51
-
.B\-argfile
52
-
file giving extra arguments.
53
-
.TP
54
77
.B\-ascale
55
78
Inverse of acoustic model scale for confidence score calculation
56
79
.TP
57
80
.B\-aw
58
81
Inverse weight applied to acoustic scores.
59
82
.TP
60
83
.B\-backtrace
61
-
Print results and backtraces to log file.
84
+
Print results and backtraces to log.
62
85
.TP
63
86
.B\-beam
64
87
Beam width applied to every frame in Viterbi search (smaller values mean wider beam)
@@ -73,17 +96,14 @@ Language model probability weight for bestpath search
73
96
Number of components in the input feature vector
74
97
.TP
75
98
.B\-cmn
76
-
Cepstral mean normalization scheme ('current', 'prior', or 'none')
99
+
Cepstral mean normalization scheme ('live', 'batch', or 'none')
77
100
.TP
78
101
.B\-cmninit
79
-
Initial values (comma-separated) for cepstral mean when 'prior' is used
102
+
Initial values (comma-separated) for cepstral mean when 'live' is used
80
103
.TP
81
104
.B\-compallsen
82
105
Compute all senone scores in every frame (can be faster when there are many senones)
83
106
.TP
84
-
.B\-debug
85
-
level for debugging messages
86
-
.TP
87
107
.B\-dict
88
108
pronunciation dictionary (lexicon) input file
89
109
.TP
@@ -117,6 +137,12 @@ Frame rate
117
137
.B\-fsg
118
138
format finite state grammar file
119
139
.TP
140
+
.B\-fsgdir
141
+
directory for FSG files
142
+
.TP
143
+
.B\-fsgext
144
+
extension for FSG files (including leading dot)
145
+
.TP
120
146
.B\-fsgusealtpron
121
147
Add alternate pronunciations to FSG
122
148
.TP
@@ -147,12 +173,6 @@ Run forward lexicon-tree search (1st pass)
147
173
.B\-hmm
148
174
containing acoustic model files.
149
175
.TP
150
-
.B\-infile
151
-
file to transcribe.
152
-
.TP
153
-
.B\-inmic
154
-
Transcribe audio from microphone.
155
-
.TP
156
176
.B\-input_endian
157
177
Endianness of input data, big or little, ignored if NIST or MS Wav
158
178
.TP
@@ -169,7 +189,7 @@ file with keyphrases to spot, one per line
169
189
Delay to wait for best detection score
170
190
.TP
171
191
.B\-kws_plp
172
-
Phone loop probability for keyword spotting
192
+
Phone loop probability for keyphrase spotting
173
193
.TP
174
194
.B\-kws_threshold
175
195
Threshold for p(hyp)/p(alternatives) ratio
@@ -201,6 +221,9 @@ Base in which all log-likelihoods calculated
201
221
.B\-logfn
202
222
to write log messages in
203
223
.TP
224
+
.B\-loglevel
225
+
Minimum level of log messages (DEBUG, INFO, WARN, ERROR)
226
+
.TP
204
227
.B\-logspec
205
228
Write out logspectral files instead of cepstra
206
229
.TP
@@ -250,7 +273,7 @@ Use memory-mapped I/O (if possible) for model files
250
273
Number of cep coefficients
251
274
.TP
252
275
.B\-nfft
253
-
Size of FFT
276
+
Size of FFT, or 0 to set automatically (recommended)
254
277
.TP
255
278
.B\-nfilt
256
279
Number of filter banks
@@ -286,7 +309,7 @@ to log raw audio files to
286
309
Remove DC offset from each frame
287
310
.TP
288
311
.B\-remove_noise
289
-
Remove noise with spectral subtraction in mel-energies
312
+
Remove noise using spectral subtraction
290
313
.TP
291
314
.B\-round_filters
292
315
Round mel filter frequencies to DFT points
@@ -315,9 +338,6 @@ Write out cepstral-smoothed logspectral files
315
338
.B\-svspec
316
339
specification (e.g., 24,0-11/25,12-23/26-38 or 0-12/13-25/26-38)
By default only errors are printed to standard error, but if you want more information you can pass ‘\f[CR]-loglevelINFO\fP’. Partial results are not printed, maybe they will be in the future, but don't hold your breath. Force-alignment is likely to be supported soon, however.
60
+
.SH OPTIONS
61
+
.\" ### ARGUMENTS ###
62
+
.SH AUTHOR
63
+
Written by numerous people at CMU from 1994 onwards. This manual page
64
+
by David Huggins-Daines <dhdaines@gmail.com>
65
+
.SH COPYRIGHT
66
+
Copyright \(co 1994-2016 Carnegie Mellon University. See the file
67
+
\fILICENSE\fR included with this package for more information.
0 commit comments