메뉴 건너뛰기




Volumn 41, Issue 3 SPEC. ISS., 2005, Pages 245-254

Toward robust speech recognition and understanding

Author keywords

Acoustic models; Adaptation; Corpus; Dialogue; Language models; Multi modal; Robustness; Speech recognition; Speech understanding; Spontaneous speech; Summarization

Indexed keywords

ACOUSTIC MODELS; ADAPTATION; CORPUS; DIALOGUE; LANGUAGE MODELS; MULTI-MODAL; SPEECH UNDERSTANDING; SPONTANEOUS SPEECH; SUMMARIZATION;

EID: 29344435134     PISSN: 13875485     EISSN: None     Source Type: Journal    
DOI: 10.1007/s11265-005-4149-x     Document Type: Conference Paper
Times cited : (3)

References (26)
  • 1
    • 0000763574 scopus 로고    scopus 로고
    • Automatic recognition and understanding of spoken language - A first step towards natural human-machine communication
    • B.-H. Juang and S. Furui, "Automatic Recognition and Understanding of Spoken Language - A First Step Towards Natural Human-Machine Communication," Proc. IEEE, vol. 88, no. 8, 2000, pp. 1142-1165.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1142-1165
    • Juang, B.-H.1    Furui, S.2
  • 4
    • 0012078715 scopus 로고    scopus 로고
    • Corpus-based statistical methods in speech and language processing
    • S. Young and G. Bloothooft (Eds.), Kluwer
    • H. Ney, "Corpus-Based Statistical Methods in Speech and Language Processing," in Corpus-based Methods in Language and Speech Processing, S. Young and G. Bloothooft (Eds.), Kluwer, 1997, pp. 1-26.
    • (1997) Corpus-based Methods in Language and Speech Processing , pp. 1-26
    • Ney, H.1
  • 6
    • 0003779990 scopus 로고    scopus 로고
    • Steps toward natural human-machine communication in the 21st century
    • Ghent
    • S. Furui, "Steps Toward Natural Human-Machine Communication in the 21st Century," in Proc. ISCA Workshop on Voice Operated Telecom Services, Ghent, 2000, pp. 17-24.
    • (2000) Proc. ISCA Workshop on Voice Operated Telecom Services , pp. 17-24
    • Furui, S.1
  • 7
    • 84963901011 scopus 로고    scopus 로고
    • The AT&T-DARPA communicator mixed-initiative spoken dialogue system
    • Beijing
    • E. Levin et al., "The AT&T-DARPA Communicator Mixed-Initiative Spoken Dialogue System," in Proc. ICSLP, Beijing, 2000, pp. II-122-125.
    • (2000) Proc. ICSLP
    • Levin, E.1
  • 8
    • 0002517880 scopus 로고    scopus 로고
    • Audio-visual large vocabulary continuous speech recognition in the broadcast domain
    • Copenhagen
    • S. Basu et al., "Audio-Visual Large Vocabulary Continuous Speech Recognition in the Broadcast Domain," in Proc. IEEE Multimedia Signal Processing (MMSP), Copenhagen, 1999, pp. 475-481.
    • (1999) Proc. IEEE Multimedia Signal Processing (MMSP) , pp. 475-481
    • Basu, S.1
  • 9
    • 85009205170 scopus 로고    scopus 로고
    • Toward spontaneous speech recognition and understanding
    • W. Chou and B.-H. Juang (Eds.), CRC Press
    • S. Furui, "Toward Spontaneous Speech Recognition and Understanding," in Pattern Recognition in Speech and language Processing, W. Chou and B.-H. Juang (Eds.), CRC Press, 2003, pp. 191-227.
    • (2003) Pattern Recognition in Speech and Language Processing , pp. 191-227
    • Furui, S.1
  • 10
    • 85009062702 scopus 로고    scopus 로고
    • Towards automatic transcription of spontaneous presentations
    • Aalborg
    • T. Shinozaki et al., "Towards Automatic Transcription of Spontaneous Presentations," in Proc. Eurospeech, Aalborg, vol. 1, 2001, pp. 491-494.
    • (2001) Proc. Eurospeech , vol.1 , pp. 491-494
    • Shinozaki, T.1
  • 11
    • 0036298775 scopus 로고    scopus 로고
    • Analysis on individual differences in automatic transcription of spontaneous presentations
    • Orlando
    • T. Shinozaki and S. Furui, "Analysis on Individual Differences in Automatic Transcription of Spontaneous Presentations," in Proc. ICASSP, Orlando, 2002, pp. 1-729-732.
    • (2002) Proc. ICASSP
    • Shinozaki, T.1    Furui, S.2
  • 12
    • 0036642566 scopus 로고    scopus 로고
    • On-line incremental speaker adaptation for broadcast news transcription
    • Z. Zhang et al., "On-Line Incremental Speaker Adaptation for Broadcast News Transcription," in Speech Communication, vol. 37, 2002, pp. 271-281.
    • (2002) Speech Communication , vol.37 , pp. 271-281
    • Zhang, Z.1
  • 13
    • 84946806722 scopus 로고    scopus 로고
    • An online incremental speaker adaptation method using speaker-clustered initial models
    • Beijing
    • Z. Zhang et al., "An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models," in Proc. ICSLP, Beijing, 2000, pp. III-694-697.
    • (2000) Proc. ICSLP
    • Zhang, Z.1
  • 14
    • 85017310148 scopus 로고
    • An improved approach to the hidden markov model decomposition of speech and noise
    • San Francisco
    • M.J.F. Gales et al., "An Improved Approach to the Hidden Markov Model Decomposition of Speech and Noise," in Proc. ICASSP, San Francisco, 1992, pp. 233-236.
    • (1992) Proc. ICASSP , pp. 233-236
    • Gales, M.J.F.1
  • 15
    • 85135371131 scopus 로고
    • Recognition of noisy speech by composition of hidden markov models
    • Berlin
    • F. Martin et al., "Recognition of Noisy Speech by Composition of Hidden Markov Models," in Proc. Eurospeech, Berlin, 1993, pp. 1031-1034.
    • (1993) Proc. Eurospeech , pp. 1031-1034
    • Martin, F.1
  • 17
    • 85009230988 scopus 로고    scopus 로고
    • Tree-structured noise-adapted HMM modeling for piecewise linear-transformation-based adaptation
    • Geneva
    • Z. Zhang et al., "Tree-Structured Noise-Adapted HMM Modeling for Piecewise Linear-Transformation-Based Adaptation," in Proc. Eurospeech, Geneva, 2003.
    • (2003) Proc. Eurospeech
    • Zhang, Z.1
  • 18
    • 85009168871 scopus 로고    scopus 로고
    • Time adjustable mixture weights for speaking rate fluctuation
    • Geneva
    • T. Shinozaki and S. Furui, "Time Adjustable Mixture Weights for Speaking Rate Fluctuation," in Proc. Eurospeech, Geneva, 2003.
    • (2003) Proc. Eurospeech
    • Shinozaki, T.1    Furui, S.2
  • 19
    • 0038784279 scopus 로고    scopus 로고
    • Bayesian network structures and inference techniques for automatic speech recognition
    • G. Zweig, "Bayesian Network Structures and Inference Techniques for Automatic Speech Recognition," Computer Speech and Language, vol. 17, 2003, pp. 173-193.
    • (2003) Computer Speech and Language , vol.17 , pp. 173-193
    • Zweig, G.1
  • 20
    • 9444287310 scopus 로고    scopus 로고
    • Unsupervised language model adaptation using word classes for spontaneous speech recognition
    • Tokyo
    • Y. Yokoyama et al., "Unsupervised Language Model Adaptation Using Word Classes for Spontaneous Speech Recognition," in Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition, Tokyo, 2003, pp. 71-74.
    • (2003) Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition , pp. 71-74
    • Yokoyama, Y.1
  • 21
    • 4544373699 scopus 로고    scopus 로고
    • Parallel computing-based architecture for mixed-initiative spoken dialogue
    • Pittsburgh
    • R. Taguma et al., "Parallel Computing-Based Architecture for Mixed-Initiative Spoken Dialogue," in Proc. IEEE Int. Conf. on Multimodal Interfaces (ICMI), Pittsburgh, 2002, pp. 53-58.
    • (2002) Proc. IEEE Int. Conf. on Multimodal Interfaces (ICMI) , pp. 53-58
    • Taguma, R.1
  • 23
    • 9444283479 scopus 로고    scopus 로고
    • Audio-visual speech recognition using lip movement extracted from side-face images
    • Geneva
    • T. Yoshinaga et al., "Audio-Visual Speech Recognition Using Lip Movement Extracted from Side-Face Images," in Proc. Eurospeech, Geneva, 2003.
    • (2003) Proc. Eurospeech
    • Yoshinaga, T.1
  • 26
    • 0037301124 scopus 로고    scopus 로고
    • A statistical approach to automatic speech summarization
    • C. Hori et al., "A Statistical Approach to Automatic Speech Summarization," EURASIP Journal on Applied Signal Processing, 2003, pp. 128-139.
    • (2003) EURASIP Journal on Applied Signal Processing , pp. 128-139
    • Hori, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.