-
1
-
-
0000763574
-
Automatic recognition and understanding of spoken language - A first step towards natural human-machine communication
-
B.-H. Juang and S. Furui, "Automatic Recognition and Understanding of Spoken Language - A First Step Towards Natural Human-Machine Communication," Proc. IEEE, vol. 88, no. 8, 2000, pp. 1142-1165.
-
(2000)
Proc. IEEE
, vol.88
, Issue.8
, pp. 1142-1165
-
-
Juang, B.-H.1
Furui, S.2
-
3
-
-
0004072715
-
-
Marcel Dekker
-
S. Furui, Digital Speech Processing, Synthesis, and Recognition, 2nd edition, Marcel Dekker, 2000.
-
(2000)
Digital Speech Processing, Synthesis, and Recognition, 2nd Edition
-
-
Furui, S.1
-
4
-
-
0012078715
-
Corpus-based statistical methods in speech and language processing
-
S. Young and G. Bloothooft (Eds.), Kluwer
-
H. Ney, "Corpus-Based Statistical Methods in Speech and Language Processing," in Corpus-based Methods in Language and Speech Processing, S. Young and G. Bloothooft (Eds.), Kluwer, 1997, pp. 1-26.
-
(1997)
Corpus-based Methods in Language and Speech Processing
, pp. 1-26
-
-
Ney, H.1
-
6
-
-
0003779990
-
Steps toward natural human-machine communication in the 21st century
-
Ghent
-
S. Furui, "Steps Toward Natural Human-Machine Communication in the 21st Century," in Proc. ISCA Workshop on Voice Operated Telecom Services, Ghent, 2000, pp. 17-24.
-
(2000)
Proc. ISCA Workshop on Voice Operated Telecom Services
, pp. 17-24
-
-
Furui, S.1
-
7
-
-
84963901011
-
The AT&T-DARPA communicator mixed-initiative spoken dialogue system
-
Beijing
-
E. Levin et al., "The AT&T-DARPA Communicator Mixed-Initiative Spoken Dialogue System," in Proc. ICSLP, Beijing, 2000, pp. II-122-125.
-
(2000)
Proc. ICSLP
-
-
Levin, E.1
-
8
-
-
0002517880
-
Audio-visual large vocabulary continuous speech recognition in the broadcast domain
-
Copenhagen
-
S. Basu et al., "Audio-Visual Large Vocabulary Continuous Speech Recognition in the Broadcast Domain," in Proc. IEEE Multimedia Signal Processing (MMSP), Copenhagen, 1999, pp. 475-481.
-
(1999)
Proc. IEEE Multimedia Signal Processing (MMSP)
, pp. 475-481
-
-
Basu, S.1
-
9
-
-
85009205170
-
Toward spontaneous speech recognition and understanding
-
W. Chou and B.-H. Juang (Eds.), CRC Press
-
S. Furui, "Toward Spontaneous Speech Recognition and Understanding," in Pattern Recognition in Speech and language Processing, W. Chou and B.-H. Juang (Eds.), CRC Press, 2003, pp. 191-227.
-
(2003)
Pattern Recognition in Speech and Language Processing
, pp. 191-227
-
-
Furui, S.1
-
10
-
-
85009062702
-
Towards automatic transcription of spontaneous presentations
-
Aalborg
-
T. Shinozaki et al., "Towards Automatic Transcription of Spontaneous Presentations," in Proc. Eurospeech, Aalborg, vol. 1, 2001, pp. 491-494.
-
(2001)
Proc. Eurospeech
, vol.1
, pp. 491-494
-
-
Shinozaki, T.1
-
11
-
-
0036298775
-
Analysis on individual differences in automatic transcription of spontaneous presentations
-
Orlando
-
T. Shinozaki and S. Furui, "Analysis on Individual Differences in Automatic Transcription of Spontaneous Presentations," in Proc. ICASSP, Orlando, 2002, pp. 1-729-732.
-
(2002)
Proc. ICASSP
-
-
Shinozaki, T.1
Furui, S.2
-
12
-
-
0036642566
-
On-line incremental speaker adaptation for broadcast news transcription
-
Z. Zhang et al., "On-Line Incremental Speaker Adaptation for Broadcast News Transcription," in Speech Communication, vol. 37, 2002, pp. 271-281.
-
(2002)
Speech Communication
, vol.37
, pp. 271-281
-
-
Zhang, Z.1
-
13
-
-
84946806722
-
An online incremental speaker adaptation method using speaker-clustered initial models
-
Beijing
-
Z. Zhang et al., "An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models," in Proc. ICSLP, Beijing, 2000, pp. III-694-697.
-
(2000)
Proc. ICSLP
-
-
Zhang, Z.1
-
14
-
-
85017310148
-
An improved approach to the hidden markov model decomposition of speech and noise
-
San Francisco
-
M.J.F. Gales et al., "An Improved Approach to the Hidden Markov Model Decomposition of Speech and Noise," in Proc. ICASSP, San Francisco, 1992, pp. 233-236.
-
(1992)
Proc. ICASSP
, pp. 233-236
-
-
Gales, M.J.F.1
-
15
-
-
85135371131
-
Recognition of noisy speech by composition of hidden markov models
-
Berlin
-
F. Martin et al., "Recognition of Noisy Speech by Composition of Hidden Markov Models," in Proc. Eurospeech, Berlin, 1993, pp. 1031-1034.
-
(1993)
Proc. Eurospeech
, pp. 1031-1034
-
-
Martin, F.1
-
17
-
-
85009230988
-
Tree-structured noise-adapted HMM modeling for piecewise linear-transformation-based adaptation
-
Geneva
-
Z. Zhang et al., "Tree-Structured Noise-Adapted HMM Modeling for Piecewise Linear-Transformation-Based Adaptation," in Proc. Eurospeech, Geneva, 2003.
-
(2003)
Proc. Eurospeech
-
-
Zhang, Z.1
-
18
-
-
85009168871
-
Time adjustable mixture weights for speaking rate fluctuation
-
Geneva
-
T. Shinozaki and S. Furui, "Time Adjustable Mixture Weights for Speaking Rate Fluctuation," in Proc. Eurospeech, Geneva, 2003.
-
(2003)
Proc. Eurospeech
-
-
Shinozaki, T.1
Furui, S.2
-
19
-
-
0038784279
-
Bayesian network structures and inference techniques for automatic speech recognition
-
G. Zweig, "Bayesian Network Structures and Inference Techniques for Automatic Speech Recognition," Computer Speech and Language, vol. 17, 2003, pp. 173-193.
-
(2003)
Computer Speech and Language
, vol.17
, pp. 173-193
-
-
Zweig, G.1
-
20
-
-
9444287310
-
Unsupervised language model adaptation using word classes for spontaneous speech recognition
-
Tokyo
-
Y. Yokoyama et al., "Unsupervised Language Model Adaptation Using Word Classes for Spontaneous Speech Recognition," in Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition, Tokyo, 2003, pp. 71-74.
-
(2003)
Proc. IEEE-ISCA Workshop on Spontaneous Speech Processing and Recognition
, pp. 71-74
-
-
Yokoyama, Y.1
-
21
-
-
4544373699
-
Parallel computing-based architecture for mixed-initiative spoken dialogue
-
Pittsburgh
-
R. Taguma et al., "Parallel Computing-Based Architecture for Mixed-Initiative Spoken Dialogue," in Proc. IEEE Int. Conf. on Multimodal Interfaces (ICMI), Pittsburgh, 2002, pp. 53-58.
-
(2002)
Proc. IEEE Int. Conf. on Multimodal Interfaces (ICMI)
, pp. 53-58
-
-
Taguma, R.1
-
23
-
-
9444283479
-
Audio-visual speech recognition using lip movement extracted from side-face images
-
Geneva
-
T. Yoshinaga et al., "Audio-Visual Speech Recognition Using Lip Movement Extracted from Side-Face Images," in Proc. Eurospeech, Geneva, 2003.
-
(2003)
Proc. Eurospeech
-
-
Yoshinaga, T.1
-
26
-
-
0037301124
-
A statistical approach to automatic speech summarization
-
C. Hori et al., "A Statistical Approach to Automatic Speech Summarization," EURASIP Journal on Applied Signal Processing, 2003, pp. 128-139.
-
(2003)
EURASIP Journal on Applied Signal Processing
, pp. 128-139
-
-
Hori, C.1
|