-
1
-
-
34047266376
-
Advances in speech transcription at IBM under the DARPA EARS program
-
Sep.
-
S. Chen, B. Kingsbury, L. Mangu, D. Povey, G. Saon, H. Soltau, and G. Zweig, "Advances in speech transcription at IBM under the DARPA EARS program," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1596-1608, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.14
, Issue.5
, pp. 1596-1608
-
-
Chen, S.1
Kingsbury, B.2
Mangu, L.3
Povey, D.4
Saon, G.5
Soltau, H.6
Zweig, G.7
-
2
-
-
34047266379
-
Progress in the CU-HTK broadcast news transcription system
-
DOI 10.1109/TASL.2006.878264
-
M. Gales, D. Kim, P. Woodland, H. Chan, D. Mrva, R. Sinha, and S. Tranter, "Progress in the CU-HTK broadcast news transcription system," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1513-1525, Sep. 2006. (Pubitemid 46547578)
-
(2006)
IEEE Transactions on Audio, Speech and Language Processing
, vol.14
, Issue.5
, pp. 1513-1525
-
-
Gales, M.J.F.1
Kim, D.Y.2
Woodland, P.C.3
Chan, H.Y.4
Mrva, D.5
Sinha, R.6
Tranter, S.E.7
-
3
-
-
34147119672
-
Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system
-
Sep.
-
S. Matsoukas, J.-L. Gauvain, G. Adda, T. Colhurst, C.-L. Kao, O. Kim-ball, L. Lamel, F. Lefevre, J. Ma, J. Makhoul, L. Nguyen, R. Prasad, R. Schwartz, H. Schwenk, and B. Xiang, "Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1541-1556, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.14
, Issue.5
, pp. 1541-1556
-
-
Matsoukas, S.1
Gauvain, J.-L.2
Adda, G.3
Colhurst, T.4
Kao, C.-L.5
Kim-Ball, O.6
Lamel, L.7
Lefevre, F.8
Ma, J.9
Makhoul, J.10
Nguyen, L.11
Prasad, R.12
Schwartz, R.13
Schwenk, H.14
Xiang, B.15
-
4
-
-
4544293504
-
Moving beyond the 'beads-on-a-string' model of speech
-
M. Ostendorf, "Moving beyond the 'beads-on-a-string' model of speech," in Proc. IEEE ASRU Workshop, 1999, pp. 79-84.
-
(1999)
Proc. IEEE ASRU Workshop
, pp. 79-84
-
-
Ostendorf, M.1
-
5
-
-
85009110188
-
Learning long-term temporal features in LVCSR using neural networks
-
B. Y. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks," in Proc. ICSLP, 2004.
-
(2004)
Proc. ICSLP
-
-
Chen, B.Y.1
Zhu, Q.2
Morgan, N.3
-
6
-
-
0032658253
-
Temporal patterns (TRAPS) in ASR of noisy speech
-
H. Hermansky and S. Sharma, "Temporal patterns (TRAPS) in ASR of noisy speech," in Proc. ICASSP, 1999, pp. 289-292.
-
(1999)
Proc. ICASSP
, pp. 289-292
-
-
Hermansky, H.1
Sharma, S.2
-
7
-
-
85009227403
-
Data driven example based continuous speech recognition
-
W. D. Wachter, K. Demuynck, D. V. Compernolle, and P. Wambacq, "Data driven example based continuous speech recognition," in Proc. Eurospeech, 2003, pp. 1133-1136.
-
(2003)
Proc. Eurospeech
, pp. 1133-1136
-
-
Wachter, W.D.1
Demuynck, K.2
Compernolle, D.V.3
Wambacq, P.4
-
8
-
-
45549086638
-
Template-based continuous speech recognition
-
May
-
M. De Wachter, M. Matton, K. Demuynck, P. Wambacq, R. Cools, and D. Van Compernolle, "Template-based continuous speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1377-1390, May 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.4
, pp. 1377-1390
-
-
De Wachter, M.1
Matton, M.2
Demuynck, K.3
Wambacq, P.4
Cools, R.5
Van Compernolle, D.6
-
9
-
-
51449113682
-
Live search for mobile: Web services by voice on the cellphone
-
A. Acero, N. Bernstein, R. Chambers, Y. Ju, X. Li, J. Odell, P. Nguyen, O. Scholz, and G. Zweig, "Live search for mobile: Web services by voice on the cellphone," in Proc. ICASSP, 2007, pp. 5256-5259.
-
(2007)
Proc. ICASSP
, pp. 5256-5259
-
-
Acero, A.1
Bernstein, N.2
Chambers, R.3
Ju, Y.4
Li, X.5
Odell, J.6
Nguyen, P.7
Scholz, O.8
Zweig, G.9
-
10
-
-
78649245809
-
-
[Online] Available
-
[Online]. Available: http://www.tellme.com/you
-
-
-
-
11
-
-
78649250686
-
-
[Online] Available
-
[Online]. Available: http://vlingo.com
-
-
-
-
12
-
-
78649297785
-
-
[Online] Available
-
[Online]. Available: http://www.google.com/mobile/apple/app.html
-
-
-
-
13
-
-
78649256443
-
-
[Online] Available
-
[Online]. Available: http://mobile.yahoo.com/onesearch
-
-
-
-
14
-
-
84946710255
-
Maximum entropy direct models for speech recognition
-
H.-K. J. Kuo and Y. Gao, "Maximum entropy direct models for speech recognition," in Proc. ASRU, 2003.
-
(2003)
Proc. ASRU
-
-
Kuo, H.-K.J.1
Gao, Y.2
-
15
-
-
70349208656
-
A flat direct model for speech recognition
-
G. Heigold, G. Zweig, X. Li, and P. Nguyen, "A flat direct model for speech recognition," in Proc. ICASSP, 2009, pp. 3861-3864.
-
(2009)
Proc. ICASSP
, pp. 3861-3864
-
-
Heigold, G.1
Zweig, G.2
Li, X.3
Nguyen, P.4
-
16
-
-
70450201983
-
Maximum mutual information multiphone units in direct modeling
-
G. Zweig and P. Nguyen, "Maximum mutual information multiphone units in direct modeling," in Proc. Interspeech, 2009.
-
(2009)
Proc. Interspeech
-
-
Zweig, G.1
Nguyen, P.2
-
17
-
-
33846253039
-
Hidden conditional Random fields for phone classification
-
A. Gunawardana, M. Mahajan, A. Acero, and J. C. Platt, "Hidden conditional random fields for phone classification," in Proc. Interspeech, 2005.
-
(2005)
Proc. Interspeech
-
-
Gunawardana, A.1
Mahajan, M.2
Acero, A.3
Platt, J.C.4
-
18
-
-
0033887568
-
A survey of smoothing techniques for ME models
-
Jan.
-
S. Chen and R. Rosenfeld, "A survey of smoothing techniques for ME models," IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 37-50, Jan. 2000.
-
(2000)
IEEE Trans. Speech Audio Process.
, vol.8
, Issue.1
, pp. 37-50
-
-
Chen, S.1
Rosenfeld, R.2
-
19
-
-
0004109478
-
Rprop\Description and implementation details Univ. of Karlsruhe Jan. 1994
-
M. Reidmiller, Rprop\Description and implementation details Univ. of Karlsruhe, Jan. 1994, Tech. Rep.
-
Tech. Rep
-
-
Reidmiller, M.1
-
20
-
-
85149106909
-
Discriminative language modeling with conditional Random fields and the perceptron algorithm
-
B. Roark, M. Saraclar, M. Collins, and M. Johnson, "Discriminative language modeling with conditional random fields and the perceptron algorithm," in Proc. ACL, 2004.
-
(2004)
Proc. ACL
-
-
Roark, B.1
Saraclar, M.2
Collins, M.3
Johnson, M.4
-
21
-
-
56149117265
-
An investigation into a simulation of episodic memory for automatic speech recognition
-
Sep.
-
V. Maier and R. Moore, "An investigation into a simulation of episodic memory for automatic speech recognition," in Proc. Interspeech, Sep. 2005.
-
(2005)
Proc. Interspeech
-
-
Maier, V.1
Moore, R.2
-
22
-
-
0032165145
-
A multispan language modeling framework for large vocabulary speech recognition
-
J. R. Bellegarda, "A multispan language modeling framework for large vocabulary speech recognition," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 456-467, 1998.
-
(1998)
IEEE Trans. Speech Audio Process.
, vol.6
, Issue.5
, pp. 456-467
-
-
Bellegarda, J.R.1
-
23
-
-
0035340439
-
Syllable-based large vocabulary continuous speech recognition
-
May
-
A. Ganapathiraju, J. Hamaker, J. Picone, M. Ordowski, and G. Dod-dington, "Syllable-based large vocabulary continuous speech recognition," IEEE Trans. Speech and Audio Processing, vol. 9, no. 4, pp. 358-366, May 2001.
-
(2001)
IEEE Trans. Speech and Audio Processing
, vol.9
, Issue.4
, pp. 358-366
-
-
Ganapathiraju, A.1
Hamaker, J.2
Picone, J.3
Ordowski, M.4
Dod-Dington, G.5
-
24
-
-
0029725372
-
Design of a speech recognition system based on acoustically derived segmental units
-
M. Bacchiani, M. Ostendorf, Y. Sagisaka, and K. Paliwal, "Design of a speech recognition system based on acoustically derived segmental units," in Proc. ICASSP, 1996, pp. 443-446.
-
(1996)
Proc. ICASSP
, pp. 443-446
-
-
Bacchiani, M.1
Ostendorf, M.2
Sagisaka, Y.3
Paliwal, K.4
-
25
-
-
0036476255
-
Automatic generation of subword units for speech recognition systems
-
Feb.
-
R. Singh, B. Raj, and R. Stern, "Automatic generation of subword units for speech recognition systems," IEEE Trans. Speech and Audio Processing, vol. 10, no. 2, pp. 89-99, Feb. 2002
-
(2002)
IEEE Trans. Speech and Audio Processing
, vol.10
, Issue.2
, pp. 89-99
-
-
Singh, R.1
Raj, B.2
Stern, R.3
|