SCOPUS 정보 검색 플랫폼

IEEE Transactions on Computers

Volumn 56, Issue 9, 2007, Pages 1156-1168

Architecture, user interface, and enabling technology in Windows Vista's speech systems

(2) Odell, Julian a Mukerjee, Kunal a

a MICROSOFT (United States)

Author keywords

Adaptation; Operating systems; Speech recognition and synthesis; User interfaces

Indexed keywords

COMPUTER ARCHITECTURE; SPEECH RECOGNITION; SPEECH SYNTHESIS; TECHNOLOGY; USER INTERFACES;

ADAPTATION SYSTEMS; RECOGNITION TECHNOLOGY; TECHNOLOGY DEVELOPMENTS;

WINDOWS OPERATING SYSTEM;

EID: 34548284043 PISSN: 00189340 EISSN: None Source Type: Journal
DOI: 10.1109/TC.2007.1065 Document Type: Article

Times cited : (11)

References (31)

1
- 85037531614
- Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages
- S. Burger, Z.A. Sloane, and J. Yang, "Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages," Proc. Language Resource and Evaluation Conf. (LREC '06), 2006.
- (2006) Proc. Language Resource and Evaluation Conf. (LREC '06)
- Burger, S.¹ Sloane, Z.A.² Yang, J.³

2
- 34548264482
- Discontinued Products Information in the Comp. Speech FAQ, http://www.speech.cs.cmu.edu/comp.speech/FAQ6.html, 2007.
- (2007) Discontinued Products Information in the Comp. Speech FAQ

3
- 34548291755
- Detailed Product Information for Both Products, http://www.nuance.com/ products, 2007.
- (2007) Detailed Product Information for Both Products

4
- 0028996857
- Microsoft Windows Highly Intelligent Speech Recognizer: Whisper
- May
- X. Huang, A. Acero, F. Alleva, M.Y. Hwang, L. Jiang, and M. Mahajan, "Microsoft Windows Highly Intelligent Speech Recognizer: Whisper," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '95), May 1995.
- (1995) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '95)
- Huang, X.¹ Acero, A.² Alleva, F.³ Hwang, M.Y.⁴ Jiang, L.⁵ Mahajan, M.⁶

5
- 4243109553
- Challenges in Adopting Speech Recognition
- Jan
- L. Deng and X. Huang, "Challenges in Adopting Speech Recognition," Comm. ACM, vol. 47, no. 1, pp. 69-75, Jan. 2004.
- (2004) Comm. ACM , vol.47 , Issue.1 , pp. 69-75
- Deng, L.¹ Huang, X.²

6
- 0004056285
- Prentice Hall
- X. Huang, A. Acero, and H. Hon, Spoken Language Processing. Prentice Hall, 2001.
- (2001) Spoken Language Processing
- Huang, X.¹ Acero, A.² Hon, H.³

7
- 0004291417
- The SPHINX-II Speech Recognition System: An Overview
- Technical Report CMU-CS-92-112, Carnegie Mellon Univ, Jan
- X. Huang, F. Alleva, H.-W. Hon, M.-Y. Hwang, and R. Rosenfeld, "The SPHINX-II Speech Recognition System: An Overview," Technical Report CMU-CS-92-112, Carnegie Mellon Univ., Jan. 1992.
- (1992)
- Huang, X.¹ Alleva, F.² Hon, H.-W.³ Hwang, M.-Y.⁴ Rosenfeld, R.⁵

8
- 0005215927
- Talk to Your Computer and Have It Answer Back with the Microsoft Speech API
- Jan
- M. Rozak, "Talk to Your Computer and Have It Answer Back with the Microsoft Speech API," Microsoft Systems J., Jan. 1996.
- (1996) Microsoft Systems J
- Rozak, M.¹

9
- 0000383720
- From Sphinx-II to Whisper: Making Speech Recognition Usable
- C. Lee, F. Soong, and K. Paliwal, eds, Kluwer Academic
- X. Huang, A. Acero, F. Alleva, M. Hwang, L. Jiang, and M. Mahajan, "From Sphinx-II to Whisper: Making Speech Recognition Usable," Automatic Speech and Speaker Recognition, Advanced Topics, C. Lee, F. Soong, and K. Paliwal, eds., Kluwer Academic, 1996.
- (1996) Automatic Speech and Speaker Recognition, Advanced Topics
- Huang, X.¹ Acero, A.² Alleva, F.³ Hwang, M.⁴ Jiang, L.⁵ Mahajan, M.⁶

10
- 34548232051
- SAPI Information from the Microsoft Speech Site, http://www.microsoft.com/speech/download/old/sapi5.asp, 2007.
- (2007) SAPI Information from the Microsoft Speech Site

11
- 34548254708
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book Version 2.2, Entropic Cambridge Research Laboratory, Dec. 1999
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, "The HTK Book Version 2.2," Entropic Cambridge Research Laboratory, Dec. 1999.

12
- 84926060821
- Large Vocabulary Continuous Speech Recognition Using HTK
- Apr
- P.C. Woodland, J.J. Odell, V. Valtchev, and S.J. Young, "Large Vocabulary Continuous Speech Recognition Using HTK," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '94), vol. 2, pp. 125-128, Apr. 1994.
- (1994) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '94) , vol.2 , pp. 125-128
- Woodland, P.C.¹ Odell, J.J.² Valtchev, V.³ Young, S.J.⁴

13
- 0002997416
- The 1997 HTK Broadcast News Transcription System
- P.C. Woodland, T. Hain, S.E. Johnson, T.R. Niesler, A. Tuerk, E.W.D. Whittaker, and S.J. Young, "The 1997 HTK Broadcast News Transcription System," Proc. DARPA Broadcast News Transcription and Understanding Workshop, pp. 41-48, 1998.
- (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 41-48
- Woodland, P.C.¹ Hain, T.² Johnson, S.E.³ Niesler, T.R.⁴ Tuerk, A.⁵ Whittaker, E.W.D.⁶ Young, S.J.⁷

14
- 0000642642
- The CUHTKEntropic 10xRT Broadcast News Transcription System
- J.J. Odell, P.C. Woodland, and T. Hain, "The CUHTKEntropic 10xRT Broadcast News Transcription System," Proc. DARPA Broadcast News Workshop, pp. 271-275, 1999.
- (1999) Proc. DARPA Broadcast News Workshop , pp. 271-275
- Odell, J.J.¹ Woodland, P.C.² Hain, T.³

15
- 0034847002
- The 1998 HTK System for Transcription of Conversational Telephone Speech
- T. Hain, P.C. Woodland, T.R. Niesler, and E.W.D. Whittaker, "The 1998 HTK System for Transcription of Conversational Telephone Speech," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '99), pp. 57-60, 1999.
- (1999) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '99) , pp. 57-60
- Hain, T.¹ Woodland, P.C.² Niesler, T.R.³ Whittaker, E.W.D.⁴

16
- 34548212992
- CU-HTK March 2001 Hub5 System
- May
- P.C. Woodland, T. Hain, G. Evermann, and D. Povey, "CU-HTK March 2001 Hub5 System," Proc. Large Vocabulary Continuous Speech Recognition Hub5 Workshop, May 2001.
- (2001) Proc. Large Vocabulary Continuous Speech Recognition Hub5 Workshop
- Woodland, P.C.¹ Hain, T.² Evermann, G.³ Povey, D.⁴

17
- 34047250740
- SuperEARS: Multi-Site Broadcast News System
- Nov
- P.C. Woodland, H.Y. Chan, G. Evermann, M.J.F. Gales, D.Y. Kim, X.A. Liu, D. Mrva, K.C. Sim, L. Wang, K. Yu, J. Makhoul, R. Schwartz, L. Nguyen, S. Matsoukas, B. Xiang, M. Afify, S. Abdou, J.-L. Gauvain, L. Lamel, H. Schwenk, G. Adda, F. Lefevre, D. Vergyri, W. Wang, J. Zheng, A. Venkataraman, R.R. Gadde, and A. Stolcke, "SuperEARS: Multi-Site Broadcast News System," Proc. Fall Rich Transcription Workshop (RT '04), Nov. 2004.
- (2004) Proc. Fall Rich Transcription Workshop (RT '04)
- Woodland, P.C.¹ Chan, H.Y.² Evermann, G.³ Gales, M.J.F.⁴ Kim, D.Y.⁵ Liu, X.A.⁶ Mrva, D.⁷ Sim, K.C.⁸ Wang, L.⁹ Yu, K.¹⁰ Makhoul, J.¹¹ Schwartz, R.¹² Nguyen, L.¹³ Matsoukas, S.¹⁴ Xiang, B.¹⁵ Afify, M.¹⁶ Abdou, S.¹⁷ Gauvain, J.-L.¹⁸ Lamel, L.¹⁹ Schwenk, H.²⁰ more..

18
- 34047266379
- Progress in the CU-HTK Broadcast News Transcription System
- Sept
- M.J.F. Gales, Y.K. Do, P.C. Woodland, Y.C. Ho, D. Mrva, R. Sinha, and S.E. Tranter, "Progress in the CU-HTK Broadcast News Transcription System," IEEE Trans. Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1513-1525, Sept. 2006.
- (2006) IEEE Trans. Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1513-1525
- Gales, M.J.F.¹ Do, Y.K.² Woodland, P.C.³ Ho, Y.C.⁴ Mrva, D.⁵ Sinha, R.⁶ Tranter, S.E.⁷

19
- 34548283859
- Microsoft Press
- J.V. West, Tablet PC Quick Reference. Microsoft Press, 2002.
- (2002) Tablet PC Quick Reference
- West, J.V.¹

20
- 85039383593
- Software Driving Software: Active Accessibility-Compliant Apps Give Programmers New Tools to Manipulate Software
- Apr
- D. Klementiev, "Software Driving Software: Active Accessibility-Compliant Apps Give Programmers New Tools to Manipulate Software," MSDN Magazine, Apr. 2000.
- (2000) MSDN Magazine
- Klementiev, D.¹

21
- 84944902727
- second ed, chapter 23. Microsoft Press
- Developing International Software, second ed., chapter 23. Microsoft Press, 2003.
- (2003) Developing International Software

22
- 34547501897
- Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition
- Sept
- D. Yu, L. Deng, X. He, and A. Acero, "Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition," Proc. Interspeech Conf., Sept. 2006.
- (2006) Proc. Interspeech Conf
- Yu, D.¹ Deng, L.² He, X.³ Acero, A.⁴

23
- 0031187171
- Speech Recognition by Machines and Humans
- R.P. Lippmann, "Speech Recognition by Machines and Humans," Speech Comm., vol. 22, pp. 1-15, 1997.
- (1997) Speech Comm , vol.22 , pp. 1-15
- Lippmann, R.P.¹

24
- 0019053271
- Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences
- S.B. Davis and P. Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences," IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
- (1980) IEEE Trans. Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

25
- 34548286207
- Aug
- Documentation for .Net Framework, http://msdn.microsoft.com/en-us/ netframework/default.aspx, Aug. 2006.
- (2006) Documentation for .Net Framework

26
- 34548292275
- Speech Recognition Grammar Specification (SRGS) v1.0" and "Speech Synthesis Markup Language (SSML) v1.0
- World Wide Web Consortium (W3C) recommendation
- "Speech Recognition Grammar Specification (SRGS) v1.0" and "Speech Synthesis Markup Language (SSML) v1.0," World Wide Web Consortium (W3C) recommendation, 2004.
- (2004)

27
- 34548222933
- MSDN, MSDN Library Platform SDK, Aug
- MSDN, "What Is the Indexing Service," MSDN Library Platform SDK, Aug. 2006.
- (2006) What Is the Indexing Service

28
- 0004064853
- Addison-Wesley
- R.C. Dorf, Modern Control Systems. Addison-Wesley, 1992.
- (1992) Modern Control Systems
- Dorf, R.C.¹

29
- 33646805439
- Maximum Entropy Based Generic Filter for Language Model Adaptation
- Mar
- D. Yu, M. Mahajan, P. Mau, and A. Acero, "Maximum Entropy Based Generic Filter for Language Model Adaptation," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05), Mar. 2005.
- (2005) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05)
- Yu, D.¹ Mahajan, M.² Mau, P.³ Acero, A.⁴

30
- 0004067829
- Improved Acoustic Modeling for HMMs Using Linear Transformations,
- PhD dissertation, Dept. of Eng, Univ. of Cambridge, Feb
- C.J. Leggetter, "Improved Acoustic Modeling for HMMs Using Linear Transformations," PhD dissertation, Dept. of Eng., Univ. of Cambridge, Feb. 1995.
- (1995)
- Leggetter, C.J.¹

31
- 0028419019
- Maximum A Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
- J.L. Gauvain and C.H. Lee, "Maximum A Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains," IEEE Trans. Speech and Audio Processing, vol. 2, pp. 291-298, 1994.
- (1994) IEEE Trans. Speech and Audio Processing , vol.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.