SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 830-834

Distributed learning of multilingual DNN feature extractors using GPUs

(3) Miao, Yajie a Zhang, Hao a Metze, Florian a

a Carnegie Mellon University (United States)

Author keywords

Automatic speech recognition; Deep neural networks; Distributed learning

Indexed keywords

PROGRAM PROCESSORS; SPEECH COMMUNICATION; STOCHASTIC SYSTEMS;

AUTOMATIC SPEECH RECOGNITION; CROSS LANGUAGES; DEEP NEURAL NETWORKS; DISTRIBUTED LEARNING; FEATURE EXTRACTOR; LEARNING PROCESS; MULTILINGUAL TRAININGS; STOCHASTIC GRADIENT DESCENT;

SPEECH RECOGNITION;

EID: 84910068044 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (19)

References (32)

1
- 84055222005
- Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
- G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 20(1), pp. 30-42, 2012.
- (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

2
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, pp. 24-29, 2011.
- (2011) Proc. ASRU , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

3
- 84910031119
- Towards speaker adaptive training of deep neural network acoustic models
- Y. Miao, H. Zhang, and F. Metze, "Towards speaker adaptive training of deep neural network acoustic models, " to appear in Proc. Interspeech, 2014.
- (2014) Proc. Interspeech
- Miao, Y.¹ Zhang, H.² Metze, F.³

4
- 84906273501
- Improving low-resource CDDNN- HMM using dropout and multilingual DNN training
- Y. Miao, and F. Metze, "Improving low-resource CDDNN- HMM using dropout and multilingual DNN training, " in Proc. Interspeech, pp. 2237-2241, 2013.
- (2013) Proc. Interspeech , pp. 2237-2241
- Miao, Y.¹ Metze, F.²

5
- 84893701756
- Deep maxout networks for low-resource speech recognition
- Y. Miao, F. Metze, and S. Rawat, "Deep maxout networks for low-resource speech recognition, " in Proc. ASRU, 2013.
- (2013) Proc. ASRU
- Miao, Y.¹ Metze, F.² Rawat, S.³

6
- 84874278045
- Unsupervised cross-lingual knowledge transfer in DNNbased LVCSR
- P. Swietojanski, A. Ghoshal, and S. Renals, "Unsupervised cross-lingual knowledge transfer in DNNbased LVCSR, " in Proc. SLT, pp. 246-251, 2012.
- (2012) Proc. SLT , pp. 246-251
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

7
- 84878559540
- An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance
- N. T. Vu, W. Breiter, F. Metze, and T. Schultz, "An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Vu, N.T.¹ Breiter, W.² Metze, F.³ Schultz, T.⁴

8
- 85083953021
- Feature learning in deep neural networks - studies on speech recognition tasks
- D. Yu, M. L. Seltzer, J. Li, J. Huang, and F. Seide, "Feature learning in deep neural networks - studies on speech recognition tasks, " in International Conference on Learning Representations 2013.
- (2013) International Conference on Learning Representations
- Yu, D.¹ Seltzer, M.L.² Li, J.³ Huang, J.⁴ Seide, F.⁵

9
- 84890527497
- Crosslanguage knowledge transfer using multilingual deep neural network with shared hidden layers
- J.-T. Huang, J. Li, D. Yu, L. Deng, and Y. Gong, "Crosslanguage knowledge transfer using multilingual deep neural network with shared hidden layers, " in Proc. ICASSP, pp. 7304-7308, 2013.
- (2013) Proc. ICASSP , pp. 7304-7308
- Huang, J.-T.¹ Li, J.² Yu, D.³ Deng, L.⁴ Gong, Y.⁵

10
- 84905239342
- Improving deep neural network acoustic models using generalized maxout networks
- X. Zhang, J. Trmal, D. Povey, and S. Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks, " in Proc. ICASSP, 2014.
- (2014) Proc. ICASSP
- Zhang, X.¹ Trmal, J.² Povey, D.³ Khudanpur, S.⁴

11
- 84886829539
- Optimization techniques to improve training speed of deep neural networks for large speech tasks
- T. N. Sainath, B. Kingsbury, H. Soltau, and B. Ramabhadran, "Optimization techniques to improve training speed of deep neural networks for large speech tasks, " IEEE Transactions on Audio, Speech and Language Processing, vol. 21, no. 11, pp. 2267-2276, 2013.
- (2013) IEEE Transactions on Audio, Speech and Language Processing , vol.21 , Issue.11 , pp. 2267-2276
- Sainath, T.N.¹ Kingsbury, B.² Soltau, H.³ Ramabhadran, B.⁴

12
- 84878397276
- Pipelined back-propagation for context-dependent deep neural networks
- X. Chen, A. Eversole, G. Li, D. Yu, and F. Seide, "Pipelined back-propagation for context-dependent deep neural networks, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Chen, X.¹ Eversole, A.² Li, G.³ Yu, D.⁴ Seide, F.⁵

13
- 84890512601
- Asynchronous stochastic gradient descent for DNN training
- S. Zhang, C. Zhang, Z. You, R. Zheng, and B. Xu, "Asynchronous stochastic gradient descent for DNN training, " in Proc. ICASSP, pp. 6660-6663, 2013.
- (2013) Proc. ICASSP , pp. 6660-6663
- Zhang, S.¹ Zhang, C.² You, Z.³ Zheng, R.⁴ Xu, B.⁵

14
- 84890539009
- Multilingual acoustic models using distributed deep neural networks
- G. Heigold, V. Vanhoucke, A. Senior, P. Nguyen, M. Ranzato, M. Devin, and J. Dean, "Multilingual acoustic models using distributed deep neural networks, " in Proc. ICASSP, pp. 8619-8623, 2013.
- (2013) Proc. ICASSP , pp. 8619-8623
- Heigold, G.¹ Vanhoucke, V.² Senior, A.³ Nguyen, P.⁴ Ranzato, M.⁵ Devin, M.⁶ Dean, J.⁷

15
- 84877760312
- Large scale distributed deep networks
- J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Ng, "Large scale distributed deep networks, " in Proc. NIPS, 2012.
- (2012) Proc. NIPS
- Dean, J.¹ Corrado, G.² Monga, R.³ Chen, K.⁴ Devin, M.⁵ Le, Q.⁶ Mao, M.⁷ Ranzato, M.⁸ Senior, A.⁹ Tucker, P.¹⁰ Yang, K.¹¹ Ng, A.¹²

16
- 84867135575
- Building high-level features using large scale unsupervised learning
- Q. Le, M. Ranzato, R. Monga, M. Devin, K. Chen, G. Corrado, J. Dean, and A. Ng, "Building high-level features using large scale unsupervised learning, " in Proc. International Conference on Machine Learning, pp. 81- 88, 2012.
- (2012) Proc. International Conference on Machine Learning , pp. 81-88
- Le, Q.¹ Ranzato, M.² Monga, R.³ Devin, M.⁴ Chen, K.⁵ Corrado, G.⁶ Dean, J.⁷ Ng, A.⁸

17
- 84910028405
- Improving language-universal feature extraction with deep maxout and convolutional neural networks
- Y. Miao, and F. Metze, "Improving language-universal feature extraction with deep maxout and convolutional neural networks, " to appear in Proc. Interspeech, 2014.
- (2014) Proc. Interspeech
- Miao, Y.¹ Metze, F.²

18
- 84890495545
- Subspace mixture model for low-resource speech recognition in crosslingual settings
- Y. Miao, F. Metze, and A. Waibel, "Subspace mixture model for low-resource speech recognition in crosslingual settings, " in Proc. ICASSP, pp. 7339-7342, 2013.
- (2013) Proc. ICASSP , pp. 7339-7342
- Miao, Y.¹ Metze, F.² Waibel, A.³

19
- 80052652249
- Efficient large-scale distributed training of conditional maximum entropy models
- G. Mann, R. Mcdonald, M. Mohri, N. Silberman, and D. D. Walker, "Efficient large-scale distributed training of conditional maximum entropy models, " in Proc. NIPS, 2009.
- (2009) Proc. NIPS
- Mann, G.¹ Mcdonald, R.² Mohri, M.³ Silberman, N.⁴ Walker, D.D.⁵

20
- 84910080310
- Asynchronous distributed learning of topic models
- A. Asuncion, P. Smyth, and M. Welling, "Asynchronous distributed learning of topic models, " in Proc. NIPS, 2012.
- (2012) Proc. NIPS
- Asuncion, A.¹ Smyth, P.² Welling, M.³

21
- 85161967549
- Parallelized stochastic gradient descent
- M. A. Zinkevich, A. Smola, M. Weimer, L. Li, "Parallelized stochastic gradient descent, " in Proc. NIPS, 2010.
- (2010) Proc. NIPS
- Zinkevich, M.A.¹ Smola, A.² Weimer, M.³ Li, L.⁴

22
- 79251574977
- The efficient incorporation of MLP features into automatic speech recognition systems
- J. Park, F. Diehl, M.J.F. Gales, M. Tomalin, and P.C. Woodland, "The efficient incorporation of MLP features into automatic speech recognition systems, " Computer Speech and Language, volume 25, issue 3, pp. 519-534, 2011.
- (2011) Computer Speech and Language , vol.25 , Issue.3 , pp. 519-534
- Park, J.¹ Diehl, F.² Gales, M.J.F.³ Tomalin, M.⁴ Woodland, P.C.⁵

23
- 85008530698
- Transcribing Mandarin broadcast speech using multi-layer perceptron acoustic features
- F. Valente, M. M. Doss, C. Plahl, S. Ravuri, and W. Wang, "Transcribing Mandarin broadcast speech using multi-layer perceptron acoustic features, " IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 8, pp. 2439-2450, 2011.
- (2011) IEEE Transactions on Audio, Speech and Language Processing , vol.19 , Issue.8 , pp. 2439-2450
- Valente, F.¹ Doss, M.M.² Plahl, C.³ Ravuri, S.⁴ Wang, W.⁵

24
- 84878582419
- Cross-lingual and ensemble MLPs strategies for low-resource speech recognition
- Y. Qian, and J. Liu, "Cross-lingual and ensemble MLPs strategies for low-resource speech recognition, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Qian, Y.¹ Liu, J.²

25
- 84890482429
- Extracting deep bottleneck features using stacked autoencoders
- J. Gehring, Y. Miao, F. Metze, and A. Waibel, "Extracting deep bottleneck features using stacked autoencoders, " in Proc. ICASSP, pp. 3377-3381, 2013.
- (2013) Proc. ICASSP , pp. 3377-3381
- Gehring, J.¹ Miao, Y.² Metze, F.³ Waibel, A.⁴

26
- 84906273176
- Modular combination of deep neural networks for acoustic modeling
- J. Gehring, W. Lee, K. Kilgour, I. Lane, Y. Miao, and A. Waibel, "Modular Combination of Deep Neural Networks for Acoustic Modeling, " in Proc. Interspeech, pp. 94-98, 2013.
- (2013) Proc. Interspeech , pp. 94-98
- Gehring, J.¹ Lee, W.² Kilgour, K.³ Lane, I.⁴ Miao, Y.⁵ Waibel, A.⁶

27
- 84906283232
- Using conversational word bursts in spoken term detection
- J. Chiu, and A. Rudnicky, "Using conversational word bursts in spoken term detection, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Chiu, J.¹ Rudnicky, A.²

28
- 84910068915
- Combination of FST and CN search in spoken term detection
- J. Chiu, Y. Wang, J. Trmal, D. Povey, G. Chen, and A. Rudnicky, "Combination of FST and CN search in spoken term detection, " to appear in Proc. Interspeech, 2014.
- (2014) Proc. Interspeech
- Chiu, J.¹ Wang, Y.² Trmal, J.³ Povey, D.⁴ Chen, G.⁵ Rudnicky, A.⁶

29
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , pp. 75-98
- Gales, M.¹

30
- 84910038371
- arXiv:1401.6984
- Y. Miao, "Kaldi+PDNN: Building DNN-based ASR systems with Kaldi and PDNN, " arXiv:1401.6984, 2014.
- (2014) Kaldi+PDNN: Building DNN-based ASR Systems with Kaldi and PDNN
- Miao, Y.¹

31
- 84867605836
- Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
- O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " in Proc. ICASSP, pp. 4277-4280, 2012.
- (2012) Proc. ICASSP , pp. 4277-4280
- Abdel-Hamid, O.¹ Mohamed, A.² Jiang, H.³ Penn, G.⁴

32
- 84890525984
- Deep convolutional neural networks for LVCSR
- T. N. Sainath, A. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for LVCSR, " in Proc. ICASSP, pp. 8614-8618, 2013.
- (2013) Proc. ICASSP , pp. 8614-8618
- Sainath, T.N.¹ Mohamed, A.² Kingsbury, B.³ Ramabhadran, B.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.