-
1
-
-
0029288202
-
Speech recognition in noisy environments: A survey
-
Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol.16, no.3, pp.261-291, 1995.
-
(1995)
Speech Commun.
, vol.16
, Issue.3
, pp. 261-291
-
-
Gong, Y.1
-
2
-
-
0018455310
-
Suppression of acoustic noise in speech using spectral subtraction
-
S.F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust. Speech Signal Process., vol.ASSP-27, pp.113-120, 1979.
-
(1979)
IEEE Trans. Acoust. Speech Signal Process.
, vol.ASSP-27
, pp. 113-120
-
-
Boll, S.F.1
-
4
-
-
0028517164
-
RASTA processing of speech
-
H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol.2, no.4, pp.587-589, 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.4
, pp. 587-589
-
-
Hermansky, H.1
Morgan, N.2
-
5
-
-
0030245128
-
Robust continuous speech recognition using parallel model combination
-
M. Gales and S. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol.4, no.5, pp.352-359, 1996.
-
(1996)
IEEE Trans. Speech Audio Process.
, vol.4
, Issue.5
, pp. 352-359
-
-
Gales, M.1
Young, S.2
-
6
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, pp.171-185, 1995.
-
(1995)
Comput. Speech Lang.
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
7
-
-
0027465491
-
The Lombard reflex and its role on human listeners and automatic speech recognizer
-
J.C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizer," J. Acoust. Soc. Am., vol.93, pp.510-524, 1993.
-
(1993)
J. Acoust. Soc. Am.
, vol.93
, pp. 510-524
-
-
Junqua, J.C.1
-
8
-
-
84888812064
-
Towards the creation of acoustic models for stressed Japanese speech
-
K. Okuda, T. Matsui, and S. Nakamura, "Towards the creation of acoustic models for stressed Japanese speech," Eurospeech2001, vol.3, pp. 1653-1656, 2001.
-
(2001)
Eurospeech2001
, vol.3
, pp. 1653-1656
-
-
Okuda, K.1
Matsui, T.2
Nakamura, S.3
-
9
-
-
85009250844
-
Speaking rate compensation based on likelihood criterion in acoustic model training and decoding
-
K. Okuda, T. Kawahara, and S. Nakamura, "Speaking rate compensation based on likelihood criterion in acoustic model training and decoding," ICSLP2002, vol.4, pp.2589-2592, 2002.
-
(2002)
ICSLP2002
, vol.4
, pp. 2589-2592
-
-
Okuda, K.1
Kawahara, T.2
Nakamura, S.3
-
10
-
-
85009070544
-
Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition
-
H. Nanjo, K. Kato, and T. Kawahara, "Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition," Eurospeech 2001, pp.2531-2534, 2001.
-
(2001)
Eurospeech 2001
, pp. 2531-2534
-
-
Nanjo, H.1
Kato, K.2
Kawahara, T.3
-
11
-
-
0038719312
-
Noise and channel distortion robust ASR system for DARPA SPINE2 task
-
March
-
K. Markov, T. Matsui, R. Gruhn, J. Zhang, and S. Nakamura, "Noise and channel distortion robust ASR system for DARPA SPINE2 task," IEICE Trans. Inf. & Syst., vol.E86-D, no.3, March 2003.
-
(2003)
IEICE Trans. Inf. & Syst.
, vol.E86-D
, Issue.3
-
-
Markov, K.1
Matsui, T.2
Gruhn, R.3
Zhang, J.4
Nakamura, S.5
-
12
-
-
0038373389
-
Cepstrum derived from differentiated power spectrum for robust speech recognition
-
J. Chen, K.K. Paliwal, and S. Nakamura, "Cepstrum derived from differentiated power spectrum for robust speech recognition," Speech Commun., vol.41, no.2-3, pp.469-484, 2003.
-
(2003)
Speech Commun.
, vol.41
, Issue.2-3
, pp. 469-484
-
-
Chen, J.1
Paliwal, K.K.2
Nakamura, S.3
-
13
-
-
33645769257
-
HMM composition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Auroral corpus
-
M. Ida and S. Nakamura, "HMM composition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Auroral corpus," ICSLP2002, vol.1, pp.437-440, 2002.
-
(2002)
ICSLP2002
, vol.1
, pp. 437-440
-
-
Ida, M.1
Nakamura, S.2
-
14
-
-
33745218350
-
Generalized word posterior probability (GWPP) for measuring reliability of recognized words
-
F.K. Soong, W.K. Lo, and S. Nakamura, "Generalized word posterior probability (GWPP) for measuring reliability of recognized words," CD-ROM Proc. SWIM2004, 2004.
-
(2004)
CD-ROM Proc. SWIM2004
-
-
Soong, F.K.1
Lo, W.K.2
Nakamura, S.3
-
15
-
-
24144494616
-
AURORA-2J: An evaluation framework for Japanese noisy speech recognition
-
March
-
S. Nakamura, K. Takeda, K. Yamamoto, T. Yamada, S. Kuroiwa, N. Kitaoka, T. Nishiura, A. Sasou, M. Mizumachi, C. Miyajima, M. Fujimoto, and T. Endo, "AURORA-2J: An evaluation framework for Japanese noisy speech recognition," IEICE Trans. Inf. & Syst, vol.E88-D, no.3, pp.535-544, March 2005.
-
(2005)
IEICE Trans. Inf. & Syst
, vol.E88-D
, Issue.3
, pp. 535-544
-
-
Nakamura, S.1
Takeda, K.2
Yamamoto, K.3
Yamada, T.4
Kuroiwa, S.5
Kitaoka, N.6
Nishiura, T.7
Sasou, A.8
Mizumachi, M.9
Miyajima, C.10
Fujimoto, M.11
Endo, T.12
-
16
-
-
0003822743
-
-
S. Young, D. Kershow, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book, 2000.
-
(2000)
The HTK Book
-
-
Young, S.1
Kershow, D.2
Odell, J.3
Ollason, D.4
Valtchev, V.5
Woodland, P.6
-
17
-
-
4344627406
-
Automatic generation of non-uniform HMM topologies based on the MDL criterion
-
Aug.
-
T. Jitsuhiro, T. Matsui, and S. Nakamura, "Automatic generation of non-uniform HMM topologies based on the MDL criterion," IEICE Trans. Inf. & Syst., vol.E87-D, no.8, pp.2121-2129, Aug. 2004.
-
(2004)
IEICE Trans. Inf. & Syst.
, vol.E87-D
, Issue.8
, pp. 2121-2129
-
-
Jitsuhiro, T.1
Matsui, T.2
Nakamura, S.3
-
18
-
-
0038373395
-
Multi-class composite N-gram language model
-
Oct.
-
H. Yamamoto, S. Isogai, and Y. Sagisaka, "Multi-class composite N-gram language model," Speech Commun., vol.41-2003, pp.369-379, Oct. 2003.
-
(2003)
Speech Commun.
, vol.41
, Issue.2003
, pp. 369-379
-
-
Yamamoto, H.1
Isogai, S.2
Sagisaka, Y.3
-
19
-
-
0007601623
-
Speech and language databases for speech translation research in ATR
-
T. Takezawa, T. Morimoto, and Y. Sagisaka, "Speech and language databases for speech translation research in ATR," Proc. Oriental COCOSDA Workshop, pp. 148-155, 1998.
-
(1998)
Proc. Oriental COCOSDA Workshop
, pp. 148-155
-
-
Takezawa, T.1
Morimoto, T.2
Sagisaka, Y.3
-
20
-
-
84863704138
-
Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world
-
T. Takezawa, E. Sumita, F. Sugaya, H. Yamamoto, and S. Yamamoto, "Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world," Proc. LREC, vol.1, pp. 147-152, 2002.
-
(2002)
Proc. LREC
, vol.1
, pp. 147-152
-
-
Takezawa, T.1
Sumita, E.2
Sugaya, F.3
Yamamoto, H.4
Yamamoto, S.5
|