메뉴 건너뛰기




Volumn E89-D, Issue 3, 2006, Pages 989-997

ATR parallel decoding based speech recognition system robust to noise and speaking styles

Author keywords

Automatic speech recognition; Fast noise adaptation; Hyper articulated speech; Multiple acoustic models; Parallel decoding; Speaking style

Indexed keywords

DECODING; MATHEMATICAL MODELS; PARALLEL PROCESSING SYSTEMS; PROBABILITY; ROBUSTNESS (CONTROL SYSTEMS); SIGNAL TO NOISE RATIO; ACOUSTIC VARIABLES CONTROL; COMPUTER SIMULATION; SPEECH RECOGNITION;

EID: 33645752847     PISSN: 09168532     EISSN: 17451361     Source Type: Journal    
DOI: 10.1093/ietisy/e89-d.3.989     Document Type: Conference Paper
Times cited : (12)

References (20)
  • 1
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol.16, no.3, pp.261-291, 1995.
    • (1995) Speech Commun. , vol.16 , Issue.3 , pp. 261-291
    • Gong, Y.1
  • 2
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S.F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust. Speech Signal Process., vol.ASSP-27, pp.113-120, 1979.
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-27 , pp. 113-120
    • Boll, S.F.1
  • 5
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • M. Gales and S. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol.4, no.5, pp.352-359, 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.1    Young, S.2
  • 6
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, pp.171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 7
    • 0027465491 scopus 로고
    • The Lombard reflex and its role on human listeners and automatic speech recognizer
    • J.C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizer," J. Acoust. Soc. Am., vol.93, pp.510-524, 1993.
    • (1993) J. Acoust. Soc. Am. , vol.93 , pp. 510-524
    • Junqua, J.C.1
  • 8
    • 84888812064 scopus 로고    scopus 로고
    • Towards the creation of acoustic models for stressed Japanese speech
    • K. Okuda, T. Matsui, and S. Nakamura, "Towards the creation of acoustic models for stressed Japanese speech," Eurospeech2001, vol.3, pp. 1653-1656, 2001.
    • (2001) Eurospeech2001 , vol.3 , pp. 1653-1656
    • Okuda, K.1    Matsui, T.2    Nakamura, S.3
  • 9
    • 85009250844 scopus 로고    scopus 로고
    • Speaking rate compensation based on likelihood criterion in acoustic model training and decoding
    • K. Okuda, T. Kawahara, and S. Nakamura, "Speaking rate compensation based on likelihood criterion in acoustic model training and decoding," ICSLP2002, vol.4, pp.2589-2592, 2002.
    • (2002) ICSLP2002 , vol.4 , pp. 2589-2592
    • Okuda, K.1    Kawahara, T.2    Nakamura, S.3
  • 10
    • 85009070544 scopus 로고    scopus 로고
    • Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition
    • H. Nanjo, K. Kato, and T. Kawahara, "Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition," Eurospeech 2001, pp.2531-2534, 2001.
    • (2001) Eurospeech 2001 , pp. 2531-2534
    • Nanjo, H.1    Kato, K.2    Kawahara, T.3
  • 11
    • 0038719312 scopus 로고    scopus 로고
    • Noise and channel distortion robust ASR system for DARPA SPINE2 task
    • March
    • K. Markov, T. Matsui, R. Gruhn, J. Zhang, and S. Nakamura, "Noise and channel distortion robust ASR system for DARPA SPINE2 task," IEICE Trans. Inf. & Syst., vol.E86-D, no.3, March 2003.
    • (2003) IEICE Trans. Inf. & Syst. , vol.E86-D , Issue.3
    • Markov, K.1    Matsui, T.2    Gruhn, R.3    Zhang, J.4    Nakamura, S.5
  • 12
    • 0038373389 scopus 로고    scopus 로고
    • Cepstrum derived from differentiated power spectrum for robust speech recognition
    • J. Chen, K.K. Paliwal, and S. Nakamura, "Cepstrum derived from differentiated power spectrum for robust speech recognition," Speech Commun., vol.41, no.2-3, pp.469-484, 2003.
    • (2003) Speech Commun. , vol.41 , Issue.2-3 , pp. 469-484
    • Chen, J.1    Paliwal, K.K.2    Nakamura, S.3
  • 13
    • 33645769257 scopus 로고    scopus 로고
    • HMM composition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Auroral corpus
    • M. Ida and S. Nakamura, "HMM composition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Auroral corpus," ICSLP2002, vol.1, pp.437-440, 2002.
    • (2002) ICSLP2002 , vol.1 , pp. 437-440
    • Ida, M.1    Nakamura, S.2
  • 14
    • 33745218350 scopus 로고    scopus 로고
    • Generalized word posterior probability (GWPP) for measuring reliability of recognized words
    • F.K. Soong, W.K. Lo, and S. Nakamura, "Generalized word posterior probability (GWPP) for measuring reliability of recognized words," CD-ROM Proc. SWIM2004, 2004.
    • (2004) CD-ROM Proc. SWIM2004
    • Soong, F.K.1    Lo, W.K.2    Nakamura, S.3
  • 17
    • 4344627406 scopus 로고    scopus 로고
    • Automatic generation of non-uniform HMM topologies based on the MDL criterion
    • Aug.
    • T. Jitsuhiro, T. Matsui, and S. Nakamura, "Automatic generation of non-uniform HMM topologies based on the MDL criterion," IEICE Trans. Inf. & Syst., vol.E87-D, no.8, pp.2121-2129, Aug. 2004.
    • (2004) IEICE Trans. Inf. & Syst. , vol.E87-D , Issue.8 , pp. 2121-2129
    • Jitsuhiro, T.1    Matsui, T.2    Nakamura, S.3
  • 18
    • 0038373395 scopus 로고    scopus 로고
    • Multi-class composite N-gram language model
    • Oct.
    • H. Yamamoto, S. Isogai, and Y. Sagisaka, "Multi-class composite N-gram language model," Speech Commun., vol.41-2003, pp.369-379, Oct. 2003.
    • (2003) Speech Commun. , vol.41 , Issue.2003 , pp. 369-379
    • Yamamoto, H.1    Isogai, S.2    Sagisaka, Y.3
  • 19
  • 20
    • 84863704138 scopus 로고    scopus 로고
    • Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world
    • T. Takezawa, E. Sumita, F. Sugaya, H. Yamamoto, and S. Yamamoto, "Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world," Proc. LREC, vol.1, pp. 147-152, 2002.
    • (2002) Proc. LREC , vol.1 , pp. 147-152
    • Takezawa, T.1    Sumita, E.2    Sugaya, F.3    Yamamoto, H.4    Yamamoto, S.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.