SCOPUS 정보 검색 플랫폼

IEICE Transactions on Information and Systems

Volumn E89-D, Issue 3, 2006, Pages 989-997

ATR parallel decoding based speech recognition system robust to noise and speaking styles

(4) Matsuda, Shigeki a,b,c Jitsuhiro, Takatoshi a,b,d Markov, Konstantin a,c,d,e,f Nakamura, Satoshi a,g

a ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

b Acoustic Society of Japan (Singapore)

c IPSJ (Japan)

d IEEE

e Acoustics and Speech Processing Department

f ATR SLC ^*

g UNIVERSITY OF KARLSRUHE (Germany)

Author keywords

Automatic speech recognition; Fast noise adaptation; Hyper articulated speech; Multiple acoustic models; Parallel decoding; Speaking style

Indexed keywords

DECODING; MATHEMATICAL MODELS; PARALLEL PROCESSING SYSTEMS; PROBABILITY; ROBUSTNESS (CONTROL SYSTEMS); SIGNAL TO NOISE RATIO; ACOUSTIC VARIABLES CONTROL; COMPUTER SIMULATION; SPEECH RECOGNITION;

AUTOMATIC SPEECH RECOGNITION; HYPER-ARTICULATED SPEECH; MULTIPLE ACOUSTIC MODELS; PARALLEL DECODING; FAST NOISE ADAPTATION; SPEAKING STYLE;

SPEECH RECOGNITION; DECODING;

EID: 33645752847 PISSN: 09168532 EISSN: 17451361 Source Type: Journal
DOI: 10.1093/ietisy/e89-d.3.989 Document Type: Conference Paper

Times cited : (12)

References (20)

1
- 0029288202
- Speech recognition in noisy environments: A survey
- Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol.16, no.3, pp.261-291, 1995.
- (1995) Speech Commun. , vol.16 , Issue.3 , pp. 261-291
- Gong, Y.¹

2
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- S.F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust. Speech Signal Process., vol.ASSP-27, pp.113-120, 1979.
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-27 , pp. 113-120
- Boll, S.F.¹

3
- 0442317754
- ETSI ES 202 050 v1.1.1, ETSI, April
- ETSI ES 202 050 v1.1.1 Speech Processing, Transmission and Quality aspects (STQ); Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithms, ETSI, April 2002.
- (2002) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithms

4
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol.2, no.4, pp.587-589, 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 587-589
- Hermansky, H.¹ Morgan, N.²

5
- 0030245128
- Robust continuous speech recognition using parallel model combination
- M. Gales and S. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol.4, no.5, pp.352-359, 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
- Gales, M.¹ Young, S.²

6
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, pp.171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

7
- 0027465491
- The Lombard reflex and its role on human listeners and automatic speech recognizer
- J.C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizer," J. Acoust. Soc. Am., vol.93, pp.510-524, 1993.
- (1993) J. Acoust. Soc. Am. , vol.93 , pp. 510-524
- Junqua, J.C.¹

8
- 84888812064
- Towards the creation of acoustic models for stressed Japanese speech
- K. Okuda, T. Matsui, and S. Nakamura, "Towards the creation of acoustic models for stressed Japanese speech," Eurospeech2001, vol.3, pp. 1653-1656, 2001.
- (2001) Eurospeech2001 , vol.3 , pp. 1653-1656
- Okuda, K.¹ Matsui, T.² Nakamura, S.³

9
- 85009250844
- Speaking rate compensation based on likelihood criterion in acoustic model training and decoding
- K. Okuda, T. Kawahara, and S. Nakamura, "Speaking rate compensation based on likelihood criterion in acoustic model training and decoding," ICSLP2002, vol.4, pp.2589-2592, 2002.
- (2002) ICSLP2002 , vol.4 , pp. 2589-2592
- Okuda, K.¹ Kawahara, T.² Nakamura, S.³

10
- 85009070544
- Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition
- H. Nanjo, K. Kato, and T. Kawahara, "Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition," Eurospeech 2001, pp.2531-2534, 2001.
- (2001) Eurospeech 2001 , pp. 2531-2534
- Nanjo, H.¹ Kato, K.² Kawahara, T.³

11
- 0038719312
- Noise and channel distortion robust ASR system for DARPA SPINE2 task
- March
- K. Markov, T. Matsui, R. Gruhn, J. Zhang, and S. Nakamura, "Noise and channel distortion robust ASR system for DARPA SPINE2 task," IEICE Trans. Inf. & Syst., vol.E86-D, no.3, March 2003.
- (2003) IEICE Trans. Inf. & Syst. , vol.E86-D , Issue.3
- Markov, K.¹ Matsui, T.² Gruhn, R.³ Zhang, J.⁴ Nakamura, S.⁵

12
- 0038373389
- Cepstrum derived from differentiated power spectrum for robust speech recognition
- J. Chen, K.K. Paliwal, and S. Nakamura, "Cepstrum derived from differentiated power spectrum for robust speech recognition," Speech Commun., vol.41, no.2-3, pp.469-484, 2003.
- (2003) Speech Commun. , vol.41 , Issue.2-3 , pp. 469-484
- Chen, J.¹ Paliwal, K.K.² Nakamura, S.³

13
- 33645769257
- HMM composition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Auroral corpus
- M. Ida and S. Nakamura, "HMM composition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Auroral corpus," ICSLP2002, vol.1, pp.437-440, 2002.
- (2002) ICSLP2002 , vol.1 , pp. 437-440
- Ida, M.¹ Nakamura, S.²

14
- 33745218350
- Generalized word posterior probability (GWPP) for measuring reliability of recognized words
- F.K. Soong, W.K. Lo, and S. Nakamura, "Generalized word posterior probability (GWPP) for measuring reliability of recognized words," CD-ROM Proc. SWIM2004, 2004.
- (2004) CD-ROM Proc. SWIM2004
- Soong, F.K.¹ Lo, W.K.² Nakamura, S.³

15
- 24144494616
- AURORA-2J: An evaluation framework for Japanese noisy speech recognition
- March
- S. Nakamura, K. Takeda, K. Yamamoto, T. Yamada, S. Kuroiwa, N. Kitaoka, T. Nishiura, A. Sasou, M. Mizumachi, C. Miyajima, M. Fujimoto, and T. Endo, "AURORA-2J: An evaluation framework for Japanese noisy speech recognition," IEICE Trans. Inf. & Syst, vol.E88-D, no.3, pp.535-544, March 2005.
- (2005) IEICE Trans. Inf. & Syst , vol.E88-D , Issue.3 , pp. 535-544
- Nakamura, S.¹ Takeda, K.² Yamamoto, K.³ Yamada, T.⁴ Kuroiwa, S.⁵ Kitaoka, N.⁶ Nishiura, T.⁷ Sasou, A.⁸ Mizumachi, M.⁹ Miyajima, C.¹⁰ Fujimoto, M.¹¹ Endo, T.¹²

16
- 0003822743
- S. Young, D. Kershow, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book, 2000.
- (2000) The HTK Book
- Young, S.¹ Kershow, D.² Odell, J.³ Ollason, D.⁴ Valtchev, V.⁵ Woodland, P.⁶

17
- 4344627406
- Automatic generation of non-uniform HMM topologies based on the MDL criterion
- Aug.
- T. Jitsuhiro, T. Matsui, and S. Nakamura, "Automatic generation of non-uniform HMM topologies based on the MDL criterion," IEICE Trans. Inf. & Syst., vol.E87-D, no.8, pp.2121-2129, Aug. 2004.
- (2004) IEICE Trans. Inf. & Syst. , vol.E87-D , Issue.8 , pp. 2121-2129
- Jitsuhiro, T.¹ Matsui, T.² Nakamura, S.³

18
- 0038373395
- Multi-class composite N-gram language model
- Oct.
- H. Yamamoto, S. Isogai, and Y. Sagisaka, "Multi-class composite N-gram language model," Speech Commun., vol.41-2003, pp.369-379, Oct. 2003.
- (2003) Speech Commun. , vol.41 , Issue.2003 , pp. 369-379
- Yamamoto, H.¹ Isogai, S.² Sagisaka, Y.³

19
- 0007601623
- Speech and language databases for speech translation research in ATR
- T. Takezawa, T. Morimoto, and Y. Sagisaka, "Speech and language databases for speech translation research in ATR," Proc. Oriental COCOSDA Workshop, pp. 148-155, 1998.
- (1998) Proc. Oriental COCOSDA Workshop , pp. 148-155
- Takezawa, T.¹ Morimoto, T.² Sagisaka, Y.³

20
- 84863704138
- Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world
- T. Takezawa, E. Sumita, F. Sugaya, H. Yamamoto, and S. Yamamoto, "Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world," Proc. LREC, vol.1, pp. 147-152, 2002.
- (2002) Proc. LREC , vol.1 , pp. 147-152
- Takezawa, T.¹ Sumita, E.² Sugaya, F.³ Yamamoto, H.⁴ Yamamoto, S.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.