-
4
-
-
0028996857
-
Microsoft Windows Highly Intelligent Speech Recognizer: Whisper
-
May
-
X. Huang, A. Acero, F. Alleva, M.Y. Hwang, L. Jiang, and M. Mahajan, "Microsoft Windows Highly Intelligent Speech Recognizer: Whisper," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '95), May 1995.
-
(1995)
Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '95)
-
-
Huang, X.1
Acero, A.2
Alleva, F.3
Hwang, M.Y.4
Jiang, L.5
Mahajan, M.6
-
5
-
-
4243109553
-
Challenges in Adopting Speech Recognition
-
Jan
-
L. Deng and X. Huang, "Challenges in Adopting Speech Recognition," Comm. ACM, vol. 47, no. 1, pp. 69-75, Jan. 2004.
-
(2004)
Comm. ACM
, vol.47
, Issue.1
, pp. 69-75
-
-
Deng, L.1
Huang, X.2
-
7
-
-
0004291417
-
The SPHINX-II Speech Recognition System: An Overview
-
Technical Report CMU-CS-92-112, Carnegie Mellon Univ, Jan
-
X. Huang, F. Alleva, H.-W. Hon, M.-Y. Hwang, and R. Rosenfeld, "The SPHINX-II Speech Recognition System: An Overview," Technical Report CMU-CS-92-112, Carnegie Mellon Univ., Jan. 1992.
-
(1992)
-
-
Huang, X.1
Alleva, F.2
Hon, H.-W.3
Hwang, M.-Y.4
Rosenfeld, R.5
-
8
-
-
0005215927
-
Talk to Your Computer and Have It Answer Back with the Microsoft Speech API
-
Jan
-
M. Rozak, "Talk to Your Computer and Have It Answer Back with the Microsoft Speech API," Microsoft Systems J., Jan. 1996.
-
(1996)
Microsoft Systems J
-
-
Rozak, M.1
-
9
-
-
0000383720
-
From Sphinx-II to Whisper: Making Speech Recognition Usable
-
C. Lee, F. Soong, and K. Paliwal, eds, Kluwer Academic
-
X. Huang, A. Acero, F. Alleva, M. Hwang, L. Jiang, and M. Mahajan, "From Sphinx-II to Whisper: Making Speech Recognition Usable," Automatic Speech and Speaker Recognition, Advanced Topics, C. Lee, F. Soong, and K. Paliwal, eds., Kluwer Academic, 1996.
-
(1996)
Automatic Speech and Speaker Recognition, Advanced Topics
-
-
Huang, X.1
Acero, A.2
Alleva, F.3
Hwang, M.4
Jiang, L.5
Mahajan, M.6
-
11
-
-
34548254708
-
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book Version 2.2, Entropic Cambridge Research Laboratory, Dec. 1999
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, "The HTK Book Version 2.2," Entropic Cambridge Research Laboratory, Dec. 1999.
-
-
-
-
12
-
-
84926060821
-
Large Vocabulary Continuous Speech Recognition Using HTK
-
Apr
-
P.C. Woodland, J.J. Odell, V. Valtchev, and S.J. Young, "Large Vocabulary Continuous Speech Recognition Using HTK," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '94), vol. 2, pp. 125-128, Apr. 1994.
-
(1994)
Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '94)
, vol.2
, pp. 125-128
-
-
Woodland, P.C.1
Odell, J.J.2
Valtchev, V.3
Young, S.J.4
-
13
-
-
0002997416
-
The 1997 HTK Broadcast News Transcription System
-
P.C. Woodland, T. Hain, S.E. Johnson, T.R. Niesler, A. Tuerk, E.W.D. Whittaker, and S.J. Young, "The 1997 HTK Broadcast News Transcription System," Proc. DARPA Broadcast News Transcription and Understanding Workshop, pp. 41-48, 1998.
-
(1998)
Proc. DARPA Broadcast News Transcription and Understanding Workshop
, pp. 41-48
-
-
Woodland, P.C.1
Hain, T.2
Johnson, S.E.3
Niesler, T.R.4
Tuerk, A.5
Whittaker, E.W.D.6
Young, S.J.7
-
14
-
-
0000642642
-
The CUHTKEntropic 10xRT Broadcast News Transcription System
-
J.J. Odell, P.C. Woodland, and T. Hain, "The CUHTKEntropic 10xRT Broadcast News Transcription System," Proc. DARPA Broadcast News Workshop, pp. 271-275, 1999.
-
(1999)
Proc. DARPA Broadcast News Workshop
, pp. 271-275
-
-
Odell, J.J.1
Woodland, P.C.2
Hain, T.3
-
15
-
-
0034847002
-
The 1998 HTK System for Transcription of Conversational Telephone Speech
-
T. Hain, P.C. Woodland, T.R. Niesler, and E.W.D. Whittaker, "The 1998 HTK System for Transcription of Conversational Telephone Speech," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '99), pp. 57-60, 1999.
-
(1999)
Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '99)
, pp. 57-60
-
-
Hain, T.1
Woodland, P.C.2
Niesler, T.R.3
Whittaker, E.W.D.4
-
16
-
-
34548212992
-
CU-HTK March 2001 Hub5 System
-
May
-
P.C. Woodland, T. Hain, G. Evermann, and D. Povey, "CU-HTK March 2001 Hub5 System," Proc. Large Vocabulary Continuous Speech Recognition Hub5 Workshop, May 2001.
-
(2001)
Proc. Large Vocabulary Continuous Speech Recognition Hub5 Workshop
-
-
Woodland, P.C.1
Hain, T.2
Evermann, G.3
Povey, D.4
-
17
-
-
34047250740
-
SuperEARS: Multi-Site Broadcast News System
-
Nov
-
P.C. Woodland, H.Y. Chan, G. Evermann, M.J.F. Gales, D.Y. Kim, X.A. Liu, D. Mrva, K.C. Sim, L. Wang, K. Yu, J. Makhoul, R. Schwartz, L. Nguyen, S. Matsoukas, B. Xiang, M. Afify, S. Abdou, J.-L. Gauvain, L. Lamel, H. Schwenk, G. Adda, F. Lefevre, D. Vergyri, W. Wang, J. Zheng, A. Venkataraman, R.R. Gadde, and A. Stolcke, "SuperEARS: Multi-Site Broadcast News System," Proc. Fall Rich Transcription Workshop (RT '04), Nov. 2004.
-
(2004)
Proc. Fall Rich Transcription Workshop (RT '04)
-
-
Woodland, P.C.1
Chan, H.Y.2
Evermann, G.3
Gales, M.J.F.4
Kim, D.Y.5
Liu, X.A.6
Mrva, D.7
Sim, K.C.8
Wang, L.9
Yu, K.10
Makhoul, J.11
Schwartz, R.12
Nguyen, L.13
Matsoukas, S.14
Xiang, B.15
Afify, M.16
Abdou, S.17
Gauvain, J.-L.18
Lamel, L.19
Schwenk, H.20
Adda, G.21
Lefevre, F.22
Vergyri, D.23
Wang, W.24
Zheng, J.25
Venkataraman, A.26
Gadde, R.R.27
Stolcke, A.28
more..
-
18
-
-
34047266379
-
Progress in the CU-HTK Broadcast News Transcription System
-
Sept
-
M.J.F. Gales, Y.K. Do, P.C. Woodland, Y.C. Ho, D. Mrva, R. Sinha, and S.E. Tranter, "Progress in the CU-HTK Broadcast News Transcription System," IEEE Trans. Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1513-1525, Sept. 2006.
-
(2006)
IEEE Trans. Audio, Speech and Language Processing
, vol.14
, Issue.5
, pp. 1513-1525
-
-
Gales, M.J.F.1
Do, Y.K.2
Woodland, P.C.3
Ho, Y.C.4
Mrva, D.5
Sinha, R.6
Tranter, S.E.7
-
20
-
-
85039383593
-
Software Driving Software: Active Accessibility-Compliant Apps Give Programmers New Tools to Manipulate Software
-
Apr
-
D. Klementiev, "Software Driving Software: Active Accessibility-Compliant Apps Give Programmers New Tools to Manipulate Software," MSDN Magazine, Apr. 2000.
-
(2000)
MSDN Magazine
-
-
Klementiev, D.1
-
21
-
-
84944902727
-
-
second ed, chapter 23. Microsoft Press
-
Developing International Software, second ed., chapter 23. Microsoft Press, 2003.
-
(2003)
Developing International Software
-
-
-
22
-
-
34547501897
-
Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition
-
Sept
-
D. Yu, L. Deng, X. He, and A. Acero, "Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition," Proc. Interspeech Conf., Sept. 2006.
-
(2006)
Proc. Interspeech Conf
-
-
Yu, D.1
Deng, L.2
He, X.3
Acero, A.4
-
23
-
-
0031187171
-
Speech Recognition by Machines and Humans
-
R.P. Lippmann, "Speech Recognition by Machines and Humans," Speech Comm., vol. 22, pp. 1-15, 1997.
-
(1997)
Speech Comm
, vol.22
, pp. 1-15
-
-
Lippmann, R.P.1
-
24
-
-
0019053271
-
Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences
-
S.B. Davis and P. Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences," IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
-
(1980)
IEEE Trans. Acoustics, Speech, and Signal Processing
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
25
-
-
34548286207
-
-
Aug
-
Documentation for .Net Framework, http://msdn.microsoft.com/en-us/ netframework/default.aspx, Aug. 2006.
-
(2006)
Documentation for .Net Framework
-
-
-
26
-
-
34548292275
-
Speech Recognition Grammar Specification (SRGS) v1.0" and "Speech Synthesis Markup Language (SSML) v1.0
-
World Wide Web Consortium (W3C) recommendation
-
"Speech Recognition Grammar Specification (SRGS) v1.0" and "Speech Synthesis Markup Language (SSML) v1.0," World Wide Web Consortium (W3C) recommendation, 2004.
-
(2004)
-
-
-
27
-
-
34548222933
-
-
MSDN, MSDN Library Platform SDK, Aug
-
MSDN, "What Is the Indexing Service," MSDN Library Platform SDK, Aug. 2006.
-
(2006)
What Is the Indexing Service
-
-
-
29
-
-
33646805439
-
Maximum Entropy Based Generic Filter for Language Model Adaptation
-
Mar
-
D. Yu, M. Mahajan, P. Mau, and A. Acero, "Maximum Entropy Based Generic Filter for Language Model Adaptation," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05), Mar. 2005.
-
(2005)
Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05)
-
-
Yu, D.1
Mahajan, M.2
Mau, P.3
Acero, A.4
-
30
-
-
0004067829
-
Improved Acoustic Modeling for HMMs Using Linear Transformations,
-
PhD dissertation, Dept. of Eng, Univ. of Cambridge, Feb
-
C.J. Leggetter, "Improved Acoustic Modeling for HMMs Using Linear Transformations," PhD dissertation, Dept. of Eng., Univ. of Cambridge, Feb. 1995.
-
(1995)
-
-
Leggetter, C.J.1
-
31
-
-
0028419019
-
Maximum A Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
-
J.L. Gauvain and C.H. Lee, "Maximum A Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains," IEEE Trans. Speech and Audio Processing, vol. 2, pp. 291-298, 1994.
-
(1994)
IEEE Trans. Speech and Audio Processing
, vol.2
, pp. 291-298
-
-
Gauvain, J.L.1
Lee, C.H.2
|