SCOPUS 정보 검색 플랫폼

Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology

Volumn 41, Issue 3 SPEC. ISS., 2005, Pages 245-254

Toward robust speech recognition and understanding

(1) Furui, Sadaoki a,b,c

a TOKYO INSTITUTE OF TECHNOLOGY (Japan)

b IEEE (Japan)

c International Speech Communication Association ISCA (Japan)

Author keywords

Acoustic models; Adaptation; Corpus; Dialogue; Language models; Multi modal; Robustness; Speech recognition; Speech understanding; Spontaneous speech; Summarization

Indexed keywords

ACOUSTIC MODELS; ADAPTATION; CORPUS; DIALOGUE; LANGUAGE MODELS; MULTI-MODAL; SPEECH UNDERSTANDING; SPONTANEOUS SPEECH; SUMMARIZATION;

COMPUTATIONAL METHODS; ERROR DETECTION; MATHEMATICAL MODELS; MICROPHONES; PROBLEM SOLVING; ROBUSTNESS (CONTROL SYSTEMS);

SPEECH RECOGNITION;

EID: 29344435134 PISSN: 13875485 EISSN: None Source Type: Journal
DOI: 10.1007/s11265-005-4149-x Document Type: Conference Paper

Times cited : (3)

References (26)

1
- 0000763574
- Automatic recognition and understanding of spoken language - A first step towards natural human-machine communication
- B.-H. Juang and S. Furui, "Automatic Recognition and Understanding of Spoken Language - A First Step Towards Natural Human-Machine Communication," Proc. IEEE, vol. 88, no. 8, 2000, pp. 1142-1165.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1142-1165
- Juang, B.-H.¹ Furui, S.²

2
- 0004244302
- Prentice-Hall
- L.R. Rabiner and B.H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

3
- 0004072715
- Marcel Dekker
- S. Furui, Digital Speech Processing, Synthesis, and Recognition, 2nd edition, Marcel Dekker, 2000.
- (2000) Digital Speech Processing, Synthesis, and Recognition, 2nd Edition
- Furui, S.¹

4
- 0012078715
- Corpus-based statistical methods in speech and language processing
- S. Young and G. Bloothooft (Eds.), Kluwer
- H. Ney, "Corpus-Based Statistical Methods in Speech and Language Processing," in Corpus-based Methods in Language and Speech Processing, S. Young and G. Bloothooft (Eds.), Kluwer, 1997, pp. 1-26.
- (1997) Corpus-based Methods in Language and Speech Processing , pp. 1-26
- Ney, H.¹

5
- 3042730370
- Recent advances in spontaneous speech recognition and understanding
- Tokyo
- S. Furui, "Recent Advances in Spontaneous Speech Recognition and Understanding," in Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition (SSPR), Tokyo, 2003, pp. 1-6.
- (2003) Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition (SSPR) , pp. 1-6
- Furui, S.¹

6
- 0003779990
- Steps toward natural human-machine communication in the 21st century
- Ghent
- S. Furui, "Steps Toward Natural Human-Machine Communication in the 21st Century," in Proc. ISCA Workshop on Voice Operated Telecom Services, Ghent, 2000, pp. 17-24.
- (2000) Proc. ISCA Workshop on Voice Operated Telecom Services , pp. 17-24
- Furui, S.¹

7
- 84963901011
- The AT&T-DARPA communicator mixed-initiative spoken dialogue system
- Beijing
- E. Levin et al., "The AT&T-DARPA Communicator Mixed-Initiative Spoken Dialogue System," in Proc. ICSLP, Beijing, 2000, pp. II-122-125.
- (2000) Proc. ICSLP
- Levin, E.¹

8
- 0002517880
- Audio-visual large vocabulary continuous speech recognition in the broadcast domain
- Copenhagen
- S. Basu et al., "Audio-Visual Large Vocabulary Continuous Speech Recognition in the Broadcast Domain," in Proc. IEEE Multimedia Signal Processing (MMSP), Copenhagen, 1999, pp. 475-481.
- (1999) Proc. IEEE Multimedia Signal Processing (MMSP) , pp. 475-481
- Basu, S.¹

9
- 85009205170
- Toward spontaneous speech recognition and understanding
- W. Chou and B.-H. Juang (Eds.), CRC Press
- S. Furui, "Toward Spontaneous Speech Recognition and Understanding," in Pattern Recognition in Speech and language Processing, W. Chou and B.-H. Juang (Eds.), CRC Press, 2003, pp. 191-227.
- (2003) Pattern Recognition in Speech and Language Processing , pp. 191-227
- Furui, S.¹

10
- 85009062702
- Towards automatic transcription of spontaneous presentations
- Aalborg
- T. Shinozaki et al., "Towards Automatic Transcription of Spontaneous Presentations," in Proc. Eurospeech, Aalborg, vol. 1, 2001, pp. 491-494.
- (2001) Proc. Eurospeech , vol.1 , pp. 491-494
- Shinozaki, T.¹

11
- 0036298775
- Analysis on individual differences in automatic transcription of spontaneous presentations
- Orlando
- T. Shinozaki and S. Furui, "Analysis on Individual Differences in Automatic Transcription of Spontaneous Presentations," in Proc. ICASSP, Orlando, 2002, pp. 1-729-732.
- (2002) Proc. ICASSP
- Shinozaki, T.¹ Furui, S.²

12
- 0036642566
- On-line incremental speaker adaptation for broadcast news transcription
- Z. Zhang et al., "On-Line Incremental Speaker Adaptation for Broadcast News Transcription," in Speech Communication, vol. 37, 2002, pp. 271-281.
- (2002) Speech Communication , vol.37 , pp. 271-281
- Zhang, Z.¹

13
- 84946806722
- An online incremental speaker adaptation method using speaker-clustered initial models
- Beijing
- Z. Zhang et al., "An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models," in Proc. ICSLP, Beijing, 2000, pp. III-694-697.
- (2000) Proc. ICSLP
- Zhang, Z.¹

14
- 85017310148
- An improved approach to the hidden markov model decomposition of speech and noise
- San Francisco
- M.J.F. Gales et al., "An Improved Approach to the Hidden Markov Model Decomposition of Speech and Noise," in Proc. ICASSP, San Francisco, 1992, pp. 233-236.
- (1992) Proc. ICASSP , pp. 233-236
- Gales, M.J.F.¹

15
- 85135371131
- Recognition of noisy speech by composition of hidden markov models
- Berlin
- F. Martin et al., "Recognition of Noisy Speech by Composition of Hidden Markov Models," in Proc. Eurospeech, Berlin, 1993, pp. 1031-1034.
- (1993) Proc. Eurospeech , pp. 1031-1034
- Martin, F.¹

16
- 0003735142
- Noise adaptation of HMMs using neural networks
- Paris
- S. Furui et al., "Noise Adaptation of HMMs Using Neural Networks," in Proc. ISCA Workshop on Automatic Speech Recognition, Paris, 2000, pp. 160-167.
- (2000) Proc. ISCA Workshop on Automatic Speech Recognition , pp. 160-167
- Furui, S.¹

17
- 85009230988
- Tree-structured noise-adapted HMM modeling for piecewise linear-transformation-based adaptation
- Geneva
- Z. Zhang et al., "Tree-Structured Noise-Adapted HMM Modeling for Piecewise Linear-Transformation-Based Adaptation," in Proc. Eurospeech, Geneva, 2003.
- (2003) Proc. Eurospeech
- Zhang, Z.¹

18
- 85009168871
- Time adjustable mixture weights for speaking rate fluctuation
- Geneva
- T. Shinozaki and S. Furui, "Time Adjustable Mixture Weights for Speaking Rate Fluctuation," in Proc. Eurospeech, Geneva, 2003.
- (2003) Proc. Eurospeech
- Shinozaki, T.¹ Furui, S.²

19
- 0038784279
- Bayesian network structures and inference techniques for automatic speech recognition
- G. Zweig, "Bayesian Network Structures and Inference Techniques for Automatic Speech Recognition," Computer Speech and Language, vol. 17, 2003, pp. 173-193.
- (2003) Computer Speech and Language , vol.17 , pp. 173-193
- Zweig, G.¹

20
- 9444287310
- Unsupervised language model adaptation using word classes for spontaneous speech recognition
- Tokyo
- Y. Yokoyama et al., "Unsupervised Language Model Adaptation Using Word Classes for Spontaneous Speech Recognition," in Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition, Tokyo, 2003, pp. 71-74.
- (2003) Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition , pp. 71-74
- Yokoyama, Y.¹

21
- 4544373699
- Parallel computing-based architecture for mixed-initiative spoken dialogue
- Pittsburgh
- R. Taguma et al., "Parallel Computing-Based Architecture for Mixed-Initiative Spoken Dialogue," in Proc. IEEE Int. Conf. on Multimodal Interfaces (ICMI), Pittsburgh, 2002, pp. 53-58.
- (2002) Proc. IEEE Int. Conf. on Multimodal Interfaces (ICMI) , pp. 53-58
- Taguma, R.¹

22
- 9444270671
- Arobust multi-modal speech recognition method using optical-flow analysis
- Kloster Irsee
- S. Tamura et al., "Arobust Multi-Modal Speech Recognition Method Using Optical-Flow Analysis," in Proc. ISCA Workshop on Multi-modal Dialogue in Mobile Environments, Kloster Irsee, 2002.
- (2002) Proc. ISCA Workshop on Multi-modal Dialogue in Mobile Environments
- Tamura, S.¹

23
- 9444283479
- Audio-visual speech recognition using lip movement extracted from side-face images
- Geneva
- T. Yoshinaga et al., "Audio-Visual Speech Recognition Using Lip Movement Extracted from Side-Face Images," in Proc. Eurospeech, Geneva, 2003.
- (2003) Proc. Eurospeech
- Yoshinaga, T.¹

24
- 9444296230
- Speech-to-speech and speech-to-text summarization
- Sapporo
- S. Furui et al., "Speech-to-Speech and Speech-to-Text Summarization," in Proc. Int. Workshop on Language Understanding and Agents for Real World Interaction, Sapporo, 2003, pp. 100-106.
- (2003) Proc. Int. Workshop on Language Understanding and Agents for Real World Interaction , pp. 100-106
- Furui, S.¹

25
- 9444262173
- Two-stage automatic speech summarization by sentence extraction and compaction
- Tokyo
- T. Kikuchi et al., "Two-Stage Automatic Speech Summarization by Sentence Extraction and Compaction," in Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition (SSPR), Tokyo, 2003, pp. 207-210.
- (2003) Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition (SSPR) , pp. 207-210
- Kikuchi, T.¹

26
- 0037301124
- A statistical approach to automatic speech summarization
- C. Hori et al., "A Statistical Approach to Automatic Speech Summarization," EURASIP Journal on Applied Signal Processing, 2003, pp. 128-139.
- (2003) EURASIP Journal on Applied Signal Processing , pp. 128-139
- Hori, C.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.