SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 5597-5601

Standalone training of context-dependent deep neural network acoustic models

(2) Zhang, C a Woodland, P C a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

HIDDEN MARKOV MODELS; ITERATIVE METHODS; SPEECH RECOGNITION;

CONTEXT DEPENDENT; CONTEXT INDEPENDENT; DECISION TREE BASED STATE TYING; DEEP NEURAL NETWORKS; GAUSSIAN MIXTURE MODEL; HIDDEN MARKOV MODELS (HMMS); OUTPUT DISTRIBUTION; WALL STREET JOURNAL;

SIGNAL PROCESSING;

EID: 84905222971 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854674 Document Type: Conference Paper

Times cited : (25)

References (22)

1
- 0003444646
- Foundations, MIT Press, Cambridge, MA, USA, Jul
- D. E. Rumelhart, J. L. McClelland, and the PDP Research Group, Parallel Distributed Processing: Explorations in the Microstructure of Cognition, Volume 1: Foundations, MIT Press, Cambridge, MA, USA, Jul. 1986.
- (1986) The PDP Research Group, Parallel Distributed Processing: Explorations in the Microstructure of Cognition , vol.1
- Rumelhart, D.E.¹ McClelland, J.L.²

2
- 0003573244
- Kluwer Academic Publishers, Norwell, MA, USA
- H. A. Bourlard and N. Morgan, Connectionist Speech Recognition: A Hybrid Approach, Kluwer Academic Publishers, Norwell, MA, USA, 1993.
- (1993) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.A.¹ Morgan, N.²

3
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- Florence, Italy, Sep
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech'11, Florence, Italy, Sep. 2011, pp. 437-440.
- (2011) Proc. Interspeech'11 , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

4
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- Jan
- G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, Jan. 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

5
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- Nov
- G. E. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kinsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Magazine, pp. 2-17, Nov. 2012.
- (2012) IEEE Signal Processing Magazine , pp. 2-17
- Hinton, G.E.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.-R.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰ Kinsbury, B.¹¹

6
- 84890492030
- An investigation of deep neural networks for noise robust speech recognition
- Vancouver, Canada
- M. L. Seltzer, D. Yu, and Y.-Q. Wang, "An investigation of deep neural networks for noise robust speech recognition," in Proc. ICASSP'13, Vancouver, Canada, 2013, pp. 7398-7402.
- (2013) Proc. ICASSP'13 , pp. 7398-7402
- Seltzer, M.L.¹ Yu, D.² Wang, Y.-Q.³

7
- 84890474716
- Deep neural network features and semi-supervised training for low resource speech recognition
- Vancouver, Canada
- S. Thomas, M. L. Seltzer, K. Church, and H. Hermansky, "Deep neural network features and semi-supervised training for low resource speech recognition," in Proc. ICASSP'13, Vancouver, Canada, 2013, pp. 6704-6708.
- (2013) Proc. ICASSP'13 , pp. 6704-6708
- Thomas, S.¹ Seltzer, M.L.² Church, K.³ Hermansky, H.⁴

8
- 84890537527
- Multi-level adaptive networks in tandem and hybrid ASR systems
- Vancouver, Canada
- P. Bell, P. Swietojanski, and S. Renals, "Multi-level adaptive networks in tandem and hybrid ASR systems," in Proc. ICASSP'13, Vancouver, Canada, 2013, pp. 7947-7951.
- (2013) Proc. ICASSP'13 , pp. 7947-7951
- Bell, P.¹ Swietojanski, P.² Renals, S.³

9
- 84893668957
- Investigation of multilingual deep neural networks for spoken term detection
- Olomouc, Czech Republic
- K. M. Knill, M. J. F. Gales, S. P. Rath, P. C. Woodland, C. Zhang, and S.-X. Zhang, "Investigation of multilingual deep neural networks for spoken term detection," in Proc. ASRU'13, Olomouc, Czech Republic, 2013, pp. 138-143.
- (2013) Proc. ASRU'13 , pp. 138-143
- Knill, K.M.¹ Gales, M.J.F.² Rath, S.P.³ Woodland, P.C.⁴ Zhang, C.⁵ Zhang, S.-X.⁶

10
- 0030648426
- Speech recognition using neural networks with forward-backward probability generated targets
- Munich, Germany
- Y.-H. Yan, M. Fanty, and R. Cole, "Speech recognition using neural networks with forward-backward probability generated targets," in Proc. ICASSP'97, Munich, Germany, 1997, pp. 3241-3244.
- (1997) Proc. ICASSP'97 , pp. 3241-3244
- Yan, Y.-H.¹ Fanty, M.² Cole, R.³

11
- 0002144369
- Tree-based state tying for high accuracy acoustic modelling
- Plainsboro, NJ, USA
- S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustic modelling," in Proc. Human Language Technology Workshop, Plainsboro, NJ, USA, 1994, pp. 307-312.
- (1994) Proc. Human Language Technology Workshop , pp. 307-312
- Young, S.J.¹ Odell, J.J.² Woodland, P.C.³

12
- 84926060821
- Large vocabulary continuous speech recognition using HTK
- Adelaide, Australia
- P. C. Woodland, J. J. Odell, V. Valtchev, and S. J. Young, "Large vocabulary continuous speech recognition using HTK," in Proc. ICASSP'94, Adelaide, Australia, 1994, vol. 2, pp. 125-128.
- (1994) Proc. ICASSP'94 , vol.2 , pp. 125-128
- Woodland, P.C.¹ Odell, J.J.² Valtchev, V.³ Young, S.J.⁴

13
- 0003487601
- Oxford University Press, Oxford, UK, Nov
- C. M. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, Oxford, UK, Nov. 1995.
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

14
- 0003571976
- Cambridge University Engineering Department, Cambridge, UK
- S. J. Young, G. Evermann, M. J. F. Gales, T. Hain., D. Kershaw, X.-Y. Liu, G. Moore, J. J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK book (for HTK version 3.4), Cambridge University Engineering Department, Cambridge, UK, 2006.
- (2006) The HTK Book (For HTK Version 3.4)
- Young, S.J.¹ Evermann, G.² Gales, M.J.F.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.-Y.⁶ Moore, G.⁷ Odell, J.J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.C.¹²

15
- 0742286348
- Robust combination of neural networks and hidden Markov models for speech recognition
- Nov
- E. Trentin and M. Gori, "Robust combination of neural networks and hidden Markov models for speech recognition," IEEE Transactions on Neural Networks, vol. 14, no. 6, pp. 1519-1531, Nov. 2003.
- (2003) IEEE Transactions on Neural Networks , vol.14 , Issue.6 , pp. 1519-1531
- Trentin, E.¹ Gori, M.²

16
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- Waikoloa, HI, USA
- F. Seide, G. Li, X. Chen, and Y. Dong, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU'11, Waikoloa, HI, USA, 2011, pp. 24-29.
- (2011) Proc. ASRU'11 , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Dong, Y.⁴

17
- 0028996852
- The 1994 HTK large vocabulary speech recognition system
- Detroit, MI, USA
- P. C. Woodland, C. J. Leggetter, J. J. Odell, V. Valtchev, and S. J. Young, "The 1994 HTK large vocabulary speech recognition system," in Proc. ICASSP'95, Detroit, MI, USA, 1995, vol. 1, pp. 73-76.
- (1995) Proc. ICASSP'95 , vol.1 , pp. 73-76
- Woodland, P.C.¹ Leggetter, C.J.² Odell, J.J.³ Valtchev, V.⁴ Young, S.J.⁵

18
- 0003871508
- Ph.D. thesis, John Hopkins University, Baltimore, MD, USA
- N. Kumar, Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition, Ph.D. thesis, John Hopkins University, Baltimore, MD, USA, 1997.
- (1997) Investigation of Silicon-auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition
- Kumar, N.¹

19
- 0141703325
- Automatic complexity control for HLDA systems
- Hong Kong, Hong Kong, Apr
- X.-Y. Liu, M. J. F. Gales, and P. C. Woodland, "Automatic complexity control for HLDA systems," in Proc. ICASSP'03, Hong Kong, Hong Kong, Apr. 2003, vol. 1, pp. 132-135.
- (2003) Proc. ICASSP'03 , vol.1 , pp. 132-135
- Liu, X.-Y.¹ Gales, M.J.F.² Woodland, P.C.³

20
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- Orlando, FL, USA
- D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP'02, Orlando, FL, USA, 2002, vol. 1, pp. 105-108.
- (2002) Proc. ICASSP'02 , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

21
- 84893712779
- D. Johnson, "QuickNet," www1.icsi.berkeley.edu/ speech/qn.html.
- QuickNet
- Johnson, D.¹

22
- 4544265717
- Ph.D. thesis, Cambridge University Engineering Department, Cambridge, UK
- D. Povey, Discriminative Training for Large Vocabulary Speech Recognition, Ph.D. thesis, Cambridge University Engineering Department, Cambridge, UK, 2003.
- (2003) Discriminative Training for Large Vocabulary Speech Recognition
- Povey, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.