SCOPUS 정보 검색 플랫폼

Foundations and Trends in Signal Processing

Volumn 7, Issue 3-4, 2013, Pages 197-387

Deep learning: Methods and applications

(2) Deng, Li a Yu, Dong a

a MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH RECOGNITION; TEXT PROCESSING;

APPLICATION AREA; DEEP LEARNING; ITS APPLICATIONS; MULTI-MODAL INFORMATION; NATURAL LANGUAGES; SIGNAL AND INFORMATION PROCESSING;

DATA PROCESSING;

EID: 84903724014 PISSN: 19328346 EISSN: 19328354 Source Type: Journal
DOI: 10.1561/2000000039 Document Type: Review

Times cited : (3127)

References (446)

1
- 84906214784
- Exploring convolutional neural network structures and optimization for speech recognition
- O. Abdel-Hamid, L. Deng, and D. Yu. Exploring convolutional neural network structures and optimization for speech recognition. Proceedings of Interspeech, 2013.
- (2013) Proceedings of Interspeech
- Abdel-Hamid, O.¹ Deng, L.² Yu, D.³

2
- 84906282118
- Deep segmental neural networks for speech recognition
- O. Abdel-Hamid, L. Deng, D. Yu, and H. Jiang. Deep segmental neural networks for speech recognition. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Abdel-Hamid, O.¹ Deng, L.² Yu, D.³ Jiang, H.⁴

3
- 84867605836
- Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
- O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn. Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Abdel-Hamid, O.¹ Mohamed, A.² Jiang, H.³ Penn, G.⁴

4
- 0000130170
- HMM adaptation using vector taylor series for noisy speech recognition
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang. HMM adaptation using vector taylor series for noisy speech recognition. In Proceedings of Interspeech. 2000.
- (2000) Proceedings of Interspeech
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

5
- 85083953791
- What regularized autoencoders learn from the data generating distribution
- G. Alain and Y. Bengio. What regularized autoencoders learn from the data generating distribution. In Proceedings of International Conference on Learning Representations (ICLR). 2013.
- (2013) Proceedings of International Conference on Learning Representations (ICLR)
- Alain, G.¹ Bengio, Y.²

6
- 85015872485
- Deep learning comes of age
- June
- G. Anthes. Deep learning comes of age. Communications of the Association for Computing Machinery (ACM), 56(6):13-15, June 2013.
- (2013) Communications of the Association for Computing Machinery (ACM) , vol.56 , Issue.6 , pp. 13-15
- Anthes, G.¹

7
- 77958488310
- Deep machine learning - A new frontier in artificial intelligence
- November
- I. Arel, C. Rose, and T. Karnowski. Deep machine learning - a new frontier in artificial intelligence. IEEE Computational Intelligence Magazine, 5:13-18, November 2010.
- (2010) IEEE Computational Intelligence Magazine , vol.5 , pp. 13-18
- Arel, I.¹ Rose, C.² Karnowski, T.³

8
- 85075436378
- Deep neural network language models
- E. Arisoy, T. Sainath, B. Kingsbury, and B. Ramabhadran. Deep neural network language models. In Proceedings of the Joint Human Language Technology Conference and the North American Chapter of the Association of Computational Linguistics (HLT-NAACL) Workshop. 2012.
- (2012) Proceedings of the Joint Human Language Technology Conference and the North American Chapter of the Association of Computational Linguistics (HLT-NAACL) Workshop
- Arisoy, E.¹ Sainath, T.² Kingsbury, B.³ Ramabhadran, B.⁴

9
- 84898982419
- Convex two-layer modeling
- O. Aslan, H. Cheng, D. Schuurmans, and X. Zhang. Convex two-layer modeling. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Aslan, O.¹ Cheng, H.² Schuurmans, D.³ Zhang, X.⁴

10
- 84896515095
- Adaptive dropout for training deep neural networks
- J. Ba and B. Frey. Adaptive dropout for training deep neural networks. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Ba, J.¹ Frey, B.²

11
- 85032751593
- Research developments and directions in speech recognition and understanding
- May
- J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shaughnessy. Research developments and directions in speech recognition and understanding. IEEE Signal Processing Magazine, 26(3):75-80, May 2009.
- (2009) IEEE Signal Processing Magazine , vol.26 , Issue.3 , pp. 75-80
- Baker, J.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.-H.⁵ Morgan, N.⁶ O'shaughnessy, D.⁷

12
- 85032759066
- Updated MINS report on speech recognition and understanding
- July
- J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shaughnessy. Updated MINS report on speech recognition and understanding. IEEE Signal Processing Magazine, 26(4), July 2009.
- (2009) IEEE Signal Processing Magazine , vol.26 , Issue.4
- Baker, J.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.-H.⁵ Morgan, N.⁶ O'shaughnessy, D.⁷

13
- 84896497059
- Understanding dropout
- P. Baldi and P. Sadowski. Understanding dropout. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Baldi, P.¹ Sadowski, P.²

14
- 84903729003
- E. Battenberg, E. Schmidt, and J. Bello. Deep learning for music, special session at International Conference on Acoustics Speech and Signal Processing (ICASSP) (http://www.icassp2014.org/ special-sections.html#ss8), 2014.
- (2014) Deep Learning for Music, Special Session at International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Battenberg, E.¹ Schmidt, E.² Bello, J.³

15
- 84873426072
- Analyzing drum patterns using conditional deep belief networks
- E. Batternberg and D. Wessel. Analyzing drum patterns using conditional deep belief networks. In Proceedings of International Symposium on Music Information Retrieval (ISMIR). 2012.
- (2012) Proceedings of International Symposium on Music Information Retrieval (ISMIR)
- Batternberg, E.¹ Wessel, D.²

16
- 84890537527
- Multi-level adaptive networks in tandem and hybrid ASR systems
- P. Bell, P. Swietojanski, and S. Renals. Multi-level adaptive networks in tandem and hybrid ASR systems. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Bell, P.¹ Swietojanski, P.² Renals, S.³

17
- 0013269553
- Ph.D. Thesis, McGill University, Montreal, Canada
- Y. Bengio. Artificial neural networks and their application to sequence recognition. Ph.D. Thesis, McGill University, Montreal, Canada, 1991.
- (1991) Artificial Neural Networks and Their Application to Sequence Recognition
- Bengio, Y.¹

18
- 0142192256
- Technical Report, University of Montreal
- Y. Bengio. New distributed probabilistic language models. Technical Report, University of Montreal, 2002.
- (2002) New Distributed Probabilistic Language Models
- Bengio, Y.¹

19
- 79959407847
- Neural net language models
- Y. Bengio. Neural net language models. Scholarpedia, 3, 2008.
- (2008) Scholarpedia , vol.3
- Bengio, Y.¹

20
- 69349090197
- Learning deep architectures for AI
- Y. Bengio. Learning deep architectures for AI. in Foundations and Trends in Machine Learning, 2(1):1-127, 2009.
- (2009) Foundations and Trends in Machine Learning , vol.2 , Issue.1 , pp. 1-127
- Bengio, Y.¹

21
- 84904548965
- Deep learning of representations for unsupervised and transfer learning
- Y. Bengio. Deep learning of representations for unsupervised and transfer learning. Journal of Machine Learning Research Workshop and Conference Proceedings, 27:17-37, 2012.
- (2012) Journal of Machine Learning Research Workshop and Conference Proceedings , vol.27 , pp. 17-37
- Bengio, Y.¹

22
- 84883201530
- Deep learning of representations: Looking forward
- Springer
- Y. Bengio. Deep learning of representations: Looking forward. In Statistical Language and Speech Processing, pages 1-37. Springer, 2013.
- (2013) Statistical Language and Speech Processing , pp. 1-37
- Bengio, Y.¹

23
- 84890543516
- Advances in optimizing recurrent networks
- Y. Bengio, N. Boulanger, and R. Pascanu. Advances in optimizing recurrent networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Bengio, Y.¹ Boulanger, N.² Pascanu, R.³

24
- 84879854889
- Representation learning: A review and new perspectives
- Y. Bengio, A. Courville, and P. Vincent. Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 38:1798-1828, 2013.
- (2013) IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) , vol.38 , pp. 1798-1828
- Bengio, Y.¹ Courville, A.² Vincent, P.³

25
- 0026835134
- Global optimization of a neural network-hidden markov model hybrid
- Y. Bengio, R. De Mori, G. Flammia, and R. Kompe. Global optimization of a neural network-hidden markov model hybrid. IEEE Transactions on Neural Networks, 3:252-259, 1992.
- (1992) IEEE Transactions on Neural Networks , vol.3 , pp. 252-259
- Bengio, Y.¹ De Mori, R.² Flammia, G.³ Kompe, R.⁴

26
- 0009577944
- A neural probabilistic language model
- Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin. A neural probabilistic language model. In Proceedings of Neural Information Processing Systems (NIPS). 2000.
- (2000) Proceedings of Neural Information Processing Systems (NIPS)
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³ Jauvin, C.⁴

27
- 0142166851
- A neural probabilistic language model
- Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin. A neural probabilistic language model. Journal of Machine Learning Research, 3:1137-1155, 2003.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 1137-1155
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³ Jauvin, C.⁴

28
- 85150229208
- Greedy layerwise training of deep networks
- Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle. Greedy layerwise training of deep networks. In Proceedings of Neural Information Processing Systems (NIPS). 2006.
- (2006) Proceedings of Neural Information Processing Systems (NIPS)
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴

29
- 0028392483
- Learning long-term dependencies with gradient descent is difficult
- Y. Bengio, P. Simard, and P. Frasconi. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 5:157-166, 1994.
- (1994) IEEE Transactions on Neural Networks , vol.5 , pp. 157-166
- Bengio, Y.¹ Simard, P.² Frasconi, P.³

30
- 84893376517
- Deep generative stochastic networks trainable by backprop. ArXiv 1306:1091, 2013. Also accepted to appear
- Y. Bengio, E. Thibodeau-Laufer, and J. Yosinski. Deep generative stochastic networks trainable by backprop. arXiv 1306:1091, 2013. also accepted to appear in Proceedings of International Conference on Machine Learning (ICML), 2014.
- (2014) Proceedings of International Conference on Machine Learning (ICML)
- Bengio, Y.¹ Thibodeau-Laufer, E.² Yosinski, J.³

31
- 84899017362
- Generalized denoising autoencoders as generative models
- Y. Bengio, L. Yao, G. Alain, and P. Vincent. Generalized denoising autoencoders as generative models. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Bengio, Y.¹ Yao, L.² Alain, G.³ Vincent, P.⁴

32
- 84857855190
- Random search for hyper-parameter optimization
- J. Bergstra and Y. Bengio. Random search for hyper-parameter optimization. Journal on Machine Learning Research, 3:281-305, 2012.
- (2012) Journal on Machine Learning Research , vol.3 , pp. 281-305
- Bergstra, J.¹ Bengio, Y.²

33
- 0035250280
- An application of discriminative feature extraction to filter-bank-based speech recognition
- A. Biem, S. Katagiri, E. McDermott, and B. Juang. An application of discriminative feature extraction to filter-bank-based speech recognition. IEEE Transactions on Speech and Audio Processing, 9:96-110, 2001.
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , pp. 96-110
- Biem, A.¹ Katagiri, S.² McDermott, E.³ Juang, B.⁴

34
- 85032751937
- Dynamic graphical models
- J. Bilmes. Dynamic graphical models. IEEE Signal Processing Magazine, 33:29-42, 2010.
- (2010) IEEE Signal Processing Magazine , vol.33 , pp. 29-42
- Bilmes, J.¹

35
- 85032752364
- Graphical model architectures for speech recognition
- J. Bilmes and C. Bartels. Graphical model architectures for speech recognition. IEEE Signal Processing Magazine, 22:89-100, 2005.
- (2005) IEEE Signal Processing Magazine , vol.22 , pp. 89-100
- Bilmes, J.¹ Bartels, C.²

36
- 84877727208
- A semantic matching energy function for learning with multi-relational data - Application to word-sense disambiguation
- May
- A. Bordes, X. Glorot, J. Weston, and Y. Bengio. A semantic matching energy function for learning with multi-relational data - application to word-sense disambiguation. Machine Learning, May 2013.
- (2013) Machine Learning
- Bordes, A.¹ Glorot, X.² Weston, J.³ Bengio, Y.⁴

37
- 80055048322
- Learning structured embeddings of knowledge bases
- A. Bordes, J. Weston, R. Collobert, and Y. Bengio. Learning structured embeddings of knowledge bases. In Proceedings of Association for the Advancement of Artificial Intelligence (AAAI). 2011.
- (2011) Proceedings of Association for the Advancement of Artificial Intelligence (AAAI)
- Bordes, A.¹ Weston, J.² Collobert, R.³ Bengio, Y.⁴

38
- 84890014676
- From machine learning to machine reasoning: An essay
- L. Bottou. From machine learning to machine reasoning: An essay. Journal of Machine Learning Research, 14:3207-3260, 2013.
- (2013) Journal of Machine Learning Research , vol.14 , pp. 3207-3260
- Bottou, L.¹

39
- 84899022736
- Large scale online learning
- L. Bottou and Y. LeCun. Large scale online learning. In Proceedings of Neural Information Processing Systems (NIPS). 2004.
- (2004) Proceedings of Neural Information Processing Systems (NIPS)
- Bottou, L.¹ Lecun, Y.²

40
- 84867129058
- Modeling Temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription
- N. Boulanger-Lewandowski, Y. Bengio, and P. Vincent. Modeling Temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription. In Proceedings of International Conference on Machine Learning (ICML). 2012.
- (2012) Proceedings of International Conference on Machine Learning (ICML)
- Boulanger-Lewandowski, N.¹ Bengio, Y.² Vincent, P.³

41
- 85054275611
- Audio chord recognition with recurrent neural networks
- N. Boulanger-Lewandowski, Y. Bengio, and P. Vincent. Audio chord recognition with recurrent neural networks. In Proceedings of International Symposium on Music Information Retrieval (ISMIR). 2013.
- (2013) Proceedings of International Symposium on Music Information Retrieval (ISMIR)
- Boulanger-Lewandowski, N.¹ Bengio, Y.² Vincent, P.³

42
- 0003573244
- Kluwer, Norwell, MA
- H. Bourlard and N. Morgan. Connectionist Speech Recognition: A Hybrid Approach. Kluwer, Norwell, MA, 1993.
- (1993) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

43
- 80055043740
- Ph.D. thesis, MIT
- J. Bouvrie. Hierarchical learning: Theory with applications in speech and vision. Ph.D. thesis, MIT, 2009.
- (2009) Hierarchical Learning: Theory with Applications in Speech and Vision
- Bouvrie, J.¹

44
- 0030196364
- Stacked regression
- L. Breiman. Stacked regression. Machine Learning, 24:49-64, 1996.
- (1996) Machine Learning , vol.24 , pp. 49-64
- Breiman, L.¹

45
- 84903690898
- Final Report for 1998 Workshop on Language Engineering, CLSP, Johns Hopkins
- J. Bridle, L. Deng, J. Picone, H. Richards, J. Ma, T. Kamm, M. Schuster, S. Pike, and R. Reagan. An investigation of segmental hidden dynamic models of speech coarticulation for automatic speech recognition. Final Report for 1998 Workshop on Language Engineering, CLSP, Johns Hopkins, 1998.
- (1998) An Investigation of Segmental Hidden Dynamic Models of Speech Coarticulation for Automatic Speech Recognition
- Bridle, J.¹ Deng, L.² Picone, J.³ Richards, H.⁴ Ma, J.⁵ Kamm, T.⁶ Schuster, M.⁷ Pike, S.⁸ Reagan, R.⁹

46
- 84886675337
- Large vocabulary speech recognition on parallel architectures
- November
- P. Cardinal, P. Dumouchel, and G. Boulianne. Large vocabulary speech recognition on parallel architectures. IEEE Transactions on Audio, Speech, and Language Processing, 21(11):2290-2300, November 2013.
- (2013) IEEE Transactions on Audio Speech, and Language Processing , vol.21 , Issue.11 , pp. 2290-2300
- Cardinal, P.¹ Dumouchel, P.² Boulianne, G.³

47
- 0031189914
- Multitask learning
- R. Caruana. Multitask learning. Machine Learning, 28:41-75, 1997.
- (1997) Machine Learning , vol.28 , pp. 41-75
- Caruana, R.¹

48
- 85083950550
- A primal-dual method for training recurrent neural networks constrained by the echo-state property
- April
- J. Chen and L. Deng. A primal-dual method for training recurrent neural networks constrained by the echo-state property. In Proceedings of International Conference on Learning Representations. April 2014.
- (2014) Proceedings of International Conference on Learning Representations
- Chen, J.¹ Deng, L.²

49
- 84878397276
- Pipelined backpropagation for context-dependent deep neural networks
- X. Chen, A. Eversole, G. Li, D. Yu, and F. Seide. Pipelined backpropagation for context-dependent deep neural networks. In Proceedings of Interspeech. 2012.
- (2012) Proceedings of Interspeech
- Chen, X.¹ Eversole, A.² Li, G.³ Yu, D.⁴ Seide, F.⁵

50
- 0031146514
- Hmm-based speech recognition using state-dependent, discriminatively derived transforms on Mel-warped DFT features
- R. Chengalvarayan and L. Deng. Hmm-based speech recognition using state-dependent, discriminatively derived transforms on Mel-warped DFT features. IEEE Transactions on Speech and Audio Processing, pages 243-256, 1997.
- (1997) IEEE Transactions on Speech and Audio Processing , pp. 243-256
- Chengalvarayan, R.¹ Deng, L.²

51
- 0031139776
- Use of generalized dynamic feature parameters for speech recognition
- R. Chengalvarayan and L. Deng. Use of generalized dynamic feature parameters for speech recognition. IEEE Transactions on Speech and Audio Processing, pages 232-242, 1997a.
- (1997) IEEE Transactions on Speech and Audio Processing , pp. 232-242
- Chengalvarayan, R.¹ Deng, L.²

52
- 0032206267
- Speech trajectory discrimination using the minimum classification error learning
- R. Chengalvarayan and L. Deng. Speech trajectory discrimination using the minimum classification error learning. IEEE Transactions on Speech and Audio Processing, 6(6):505-515, 1998.
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.6 , pp. 505-515
- Chengalvarayan, R.¹ Deng, L.²

53
- 78149327741
- Kernel methods for deep learning
- Y. Cho and L. Saul. Kernel methods for deep learning. In Proceedings of Neural Information Processing Systems (NIPS), pages 342-350. 2009.
- (2009) Proceedings of Neural Information Processing Systems (NIPS) , pp. 342-350
- Cho, Y.¹ Saul, L.²

54
- 84877789057
- Deep neural networks segment neuronal membranes in electron microscopy images
- D. Ciresan, A. Giusti, L. Gambardella, and J. Schmidhuber. Deep neural networks segment neuronal membranes in electron microscopy images. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Ciresan, D.¹ Giusti, A.² Gambardella, L.³ Schmidhuber, J.⁴

55
- 78649669320
- Deep, big, simple neural nets for handwritten digit recognition
- December
- D. Ciresan, U. Meier, L. Gambardella, and J. Schmidhuber. Deep, big, simple neural nets for handwritten digit recognition. Neural Computation, December 2010.
- (2010) Neural Computation
- Ciresan, D.¹ Meier, U.² Gambardella, L.³ Schmidhuber, J.⁴

56
- 80054740693
- A committee of neural networks for traffic sign classification
- D. Ciresan, U. Meier, J. Masci, and J. Schmidhuber. A committee of neural networks for traffic sign classification. In Proceedings of International Joint Conference on Neural Networks (IJCNN). 2011.
- (2011) Proceedings of International Joint Conference on Neural Networks (IJCNN)
- Ciresan, D.¹ Meier, U.² Masci, J.³ Schmidhuber, J.⁴

57
- 84866714584
- Multi-column deep neural networks for image classification
- D. Ciresan, U. Meier, and J. Schmidhuber. Multi-column deep neural networks for image classification. In Proceedings of Computer Vision and Pattern Recognition (CVPR). 2012.
- (2012) Proceedings of Computer Vision and Pattern Recognition (CVPR)
- Ciresan, D.¹ Meier, U.² Schmidhuber, J.³

58
- 84865094974
- Transfer learning for Latin and Chinese characters with deep neural networks
- D. C. Ciresan, U. Meier, and J. Schmidhuber. Transfer learning for Latin and Chinese characters with deep neural networks. In Proceedings of International Joint Conference on Neural Networks (IJCNN). 2012.
- (2012) Proceedings of International Joint Conference on Neural Networks (IJCNN)
- Ciresan, D.C.¹ Meier, U.² Schmidhuber, J.³

59
- 84897484337
- Deep learning with COTS HPC
- A. Coates, B. Huval, T. Wang, D. Wu, A. Ng, and B. Catanzaro. Deep learning with COTS HPC. In Proceedings of International Conference on Machine Learning (ICML). 2013.
- (2013) Proceedings of International Conference on Machine Learning (ICML)
- Coates, A.¹ Huval, B.² Wang, T.³ Wu, D.⁴ Ng, A.⁵ Catanzaro, B.⁶

60
- 84880708659
- Stacked sequential learning
- W. Cohen and R. V. de Carvalho. Stacked sequential learning. In Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), pages 671-676. 2005.
- (2005) Proceedings of International Joint Conference on Artificial Intelligence (IJCAI) , pp. 671-676
- Cohen, W.¹ De Carvalho, R.V.²

61
- 84876808666
- Deep learning for efficient discriminative parsing
- R. Collobert. Deep learning for efficient discriminative parsing. In Proceedings of Artificial Intelligence and Statistics (AISTATS). 2011.
- (2011) Proceedings of Artificial Intelligence and Statistics (AISTATS)
- Collobert, R.¹

62
- 56449095373
- A unified architecture for natural language processing: Deep neural networks with multitask learning
- R. Collobert and J. Weston. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proceedings of International Conference on Machine Learning (ICML). 2008.
- (2008) Proceedings of International Conference on Machine Learning (ICML)
- Collobert, R.¹ Weston, J.²

63
- 80053558787
- Natural language processing (almost) from scratch
- R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu, and P. Kuksa. Natural language processing (almost) from scratch. Journal on Machine Learning Research, 12:2493-2537, 2011.
- (2011) Journal on Machine Learning Research , vol.12 , pp. 2493-2537
- Collobert, R.¹ Weston, J.² Bottou, L.³ Karlen, M.⁴ Kavukcuoglu, K.⁵ Kuksa, P.⁶

64
- 85162069624
- Phone recognition with the mean-covariance restricted boltzmann machine
- G. Dahl, M. Ranzato, A. Mohamed, and G. Hinton. Phone recognition with the mean-covariance restricted boltzmann machine. In Proceedings of Neural Information Processing Systems (NIPS), volume 23, pages 469-477. 2010.
- (2010) Proceedings of Neural Information Processing Systems (NIPS) , vol.23 , pp. 469-477
- Dahl, G.¹ Ranzato, M.² Mohamed, A.³ Hinton, G.⁴

65
- 84890527827
- Improving deep neural networks for LVCSR using rectified linear units and dropout
- G. Dahl, T. Sainath, and G. Hinton. Improving deep neural networks for LVCSR using rectified linear units and dropout. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Dahl, G.¹ Sainath, T.² Hinton, G.³

66
- 84890516914
- Large-scale malware classification using random projections and neural networks
- G. Dahl, J. Stokes, L. Deng, and D. Yu. Large-scale malware classification using random projections and neural networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Dahl, G.¹ Stokes, J.² Deng, L.³ Yu, D.⁴

67
- 84905237729
- Context-dependent DBNHMMs in large vocabulary continuous speech recognition
- G. Dahl, D. Yu, L. Deng, and A. Acero. Context-dependent DBNHMMs in large vocabulary continuous speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2011.
- (2011) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

68
- 84055222005
- Context-dependent, pre-trained deep neural networks for large vocabulary speech recognition
- January
- G. Dahl, D. Yu, L. Deng, and A. Acero. Context-dependent, pre-trained deep neural networks for large vocabulary speech recognition. IEEE Transactions on Audio, Speech, & Language Processing, 20(1):30-42, January 2012.
- (2012) IEEE Transactions on Audio Speech, & Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

69
- 84877760312
- Large scale distributed deep networks
- J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Ng. Large scale distributed deep networks. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Dean, J.¹ Corrado, G.² Monga, R.³ Chen, K.⁴ Devin, M.⁵ Le, Q.⁶ Mao, M.⁷ Ranzato, M.⁸ Senior, A.⁹ Tucker, P.¹⁰ Yang, K.¹¹ Ng, A.¹²

70
- 84893695632
- Porting concepts from DNNs back to GMMs
- K. Demuynck and F. Triefenbach. Porting concepts from DNNs back to GMMs. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Demuynck, K.¹ Triefenbach, F.²

71
- 0026854213
- A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
- L. Deng. A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal. Signal Processing, 27(1):65-78, 1992.
- (1992) Signal Processing , vol.27 , Issue.1 , pp. 65-78
- Deng, L.¹

72
- 0027678649
- A stochastic model of speech incorporating hierarchical nonstationarity
- L. Deng. A stochastic model of speech incorporating hierarchical nonstationarity. IEEE Transactions on Speech and Audio Processing, 1(4):471-475, 1993.
- (1993) IEEE Transactions on Speech and Audio Processing , vol.1 , Issue.4 , pp. 471-475
- Deng, L.¹

73
- 0032119268
- A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
- L. Deng. A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition. Speech Communication, 24(4):299-323, 1998.
- (1998) Speech Communication , vol.24 , Issue.4 , pp. 299-323
- Deng, L.¹

74
- 0039503389
- Computational models for speech production
- Springer Verlag
- L. Deng. Computational models for speech production. In Computational Models of Speech Pattern Processing, pages 199-213. Springer Verlag, 1999.
- (1999) Computational Models of Speech Pattern Processing , pp. 199-213
- Deng, L.¹

75
- 33744966595
- Switching dynamic system models for speech articulation and acoustics
- Springer-Verlag, New York
- L. Deng. Switching dynamic system models for speech articulation and acoustics. In Mathematical Foundations of Speech and Language Processing, pages 115-134. Springer-Verlag, New York, 2003.
- (2003) Mathematical Foundations of Speech and Language Processing , pp. 115-134
- Deng, L.¹

76
- 34547507549
- Morgan & Claypool, December
- L. Deng. Dynamic Speech Models - Theory, Algorithm, and Application. Morgan & Claypool, December 2006.
- (2006) Dynamic Speech Models - Theory, Algorithm, and Application
- Deng, L.¹

77
- 84866857547
- An overview of deep-structured learning for information processing
- October
- L. Deng. An overview of deep-structured learning for information processing. In Proceedings of Asian-Pacific Signal & Information Processing Annual Summit and Conference (APSIPA-ASC). October 2011.
- (2011) Proceedings of Asian-Pacific Signal & Information Processing Annual Summit and Conference (APSIPA-ASC)
- Den, L.¹

78
- 85032752689
- The MNIST database of handwritten digit images for machine learning research
- November
- L. Deng. The MNIST database of handwritten digit images for machine learning research. IEEE Signal Processing Magazine, 29(6), November 2012.
- (2012) IEEE Signal Processing Magazine , vol.29 , Issue.6
- Deng, L.¹

79
- 85048991523
- Design and learning of output representations for speech recognition
- December
- L. Deng. Design and learning of output representations for speech recognition. In Neural Information Processing Systems (NIPS) Workshop on Learning Output Representations. December 2013.
- (2013) Neural Information Processing Systems (NIPS) Workshop on Learning Output Representations
- Deng, L.¹

80
- 84903716586
- A tutorial survey of architectures, algorithms, and applications for deep learning
- L. Deng. A tutorial survey of architectures, algorithms, and applications for deep learning. In Asian-Pacific Signal & Information Processing Association Transactions on Signal and Information Processing. 2013.
- (2013) Asian-Pacific Signal & Information Processing Association Transactions on Signal and Information Processing
- Deng, L.¹

81
- 84890545163
- A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
- L. Deng, O. Abdel-Hamid, and D. Yu. A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ Abdel-Hamid, O.² Yu, D.³

82
- 0034855352
- High performance robust speech recognition using stereo training data
- L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang. High performance robust speech recognition using stereo training data. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2001.
- (2001) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ Acero, A.² Jiang, L.³ Droppo, J.⁴ Huang, X.⁵

83
- 0031185482
- Speaker-independent phonetic classification using hidden markov models with state-conditioned mixtures of trend functions
- L. Deng and M. Aksmanovic. Speaker-independent phonetic classification using hidden markov models with state-conditioned mixtures of trend functions. IEEE Transactions on Speech and Audio Processing, 5:319-324, 1997.
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , pp. 319-324
- Deng, L.¹ Aksmanovic, M.²

84
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
- L. Deng, M. Aksmanovic, D. Sun, and J. Wu. Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states. IEEE Transactions on Speech and Audio Processing, 2(4):507-520, 1994.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.4 , pp. 507-520
- Deng, L.¹ Aksmanovic, M.² Sun, D.³ Wu, J.⁴

85
- 84905280906
- Sequence classification using the high-level features extracted from deep neural networks
- L. Deng and J. Chen. Sequence classification using the high-level features extracted from deep neural networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2014.
- (2014) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ Chen, J.²

86
- 0026458724
- Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech units
- L. Deng and K. Erler. Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech units. Journal of the Acoustical Society of America, 92(6):3058-3067, 1992.
- (1992) Journal of the Acoustical Society of America , vol.92 , Issue.6 , pp. 3058-3067
- Deng, L.¹ Erler, K.²

87
- 0028256706
- Analysis of correlation structure for a neural predictive model with application to speech recognition
- L. Deng, K. Hassanein, and M. Elmasry. Analysis of correlation structure for a neural predictive model with application to speech recognition. Neural Networks, 7(2):331-339, 1994.
- (1994) Neural Networks , vol.7 , Issue.2 , pp. 331-339
- Deng, L.¹ Hassanein, K.² Elmasry, M.³

88
- 84890494546
- Deep stacking networks for information retrieval
- L. Deng, X. He, and J. Gao. Deep stacking networks for information retrieval. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013c.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ He, X.² Gao, J.³

89
- 84890526837
- New types of deep neural network learning for speech recognition and related applications: An overview
- L. Deng, G. Hinton, and B. Kingsbury. New types of deep neural network learning for speech recognition and related applications: An overview. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013b.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ Hinton, G.² Kingsbury, B.³

90
- 4243109553
- Challenges in adopting speech recognition
- January
- L. Deng and X. D. Huang. Challenges in adopting speech recognition. Communications of the Association for Computing Machinery (ACM), 47(1):11-13, January 2004.
- (2004) Communications of the Association for Computing Machinery (ACM) , vol.47 , Issue.1 , pp. 11-13
- Deng, L.¹ Huang, X.D.²

91
- 84878540767
- Parallel training of deep stacking networks
- L. Deng, B. Hutchinson, and D. Yu. Parallel training of deep stacking networks. In Proceedings of Interspeech. 2012b.
- (2012) Proceedings of Interspeech
- Deng, L.¹ Hutchinson, B.² Yu, D.³

92
- 0026189555
- Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition
- L. Deng, M. Lennig, V. Gupta, F. Seitz, P. Mermelstein, and P. Kenny. Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition. IEEE Transactions on Signal Processing, 39(7):1677-1681, 1991.
- (1991) IEEE Transactions on Signal Processing , vol.39 , Issue.7 , pp. 1677-1681
- Deng, L.¹ Lennig, M.² Gupta, V.³ Seitz, F.⁴ Mermelstein, P.⁵ Kenny, P.⁶

93
- 10244257175
- Large vocabulary word recognition using context-dependent allophonic hidden Markov models
- L. Deng, M. Lennig, F. Seitz, and P. Mermelstein. Large vocabulary word recognition using context-dependent allophonic hidden Markov models. Computer Speech and Language, 4(4):345-357, 1990.
- (1990) Computer Speech and Language , vol.4 , Issue.4 , pp. 345-357
- Deng, L.¹ Lennig, M.² Seitz, F.³ Mermelstein, P.⁴

94
- 84890491198
- Recent advances in deep learning for speech research at Microsoft
- L. Deng, J. Li, K. Huang, Yao, D. Yu, F. Seide, M. Seltzer, G. Zweig, X. He, J. Williams, Y. Gong, and A. Acero. Recent advances in deep learning for speech research at Microsoft. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013a.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ Li, J.² Huang, K.³ Yao⁴ Yu, D.⁵ Seide, F.⁶ Seltzer, M.⁷ Zweig, G.⁸ He, X.⁹ Williams, J.¹⁰ Gong, Y.¹¹ Acero, A.¹²

95
- 84876672166
- Machine learning paradigms in speech recognition: An overview
- May
- L. Deng and X. Li. Machine learning paradigms in speech recognition: An overview. IEEE Transactions on Audio, Speech, & Language, 21:1060-1089, May 2013.
- (2013) IEEE Transactions on Audio, Speech, & Language , vol.21 , pp. 1060-1089
- Deng, L.¹ Li, X.²

96
- 0033623527
- Spontaneous speech recognition using a statistical coarticulatory model for the vocal tract resonance dynamics
- L. Deng and J. Ma. Spontaneous speech recognition using a statistical coarticulatory model for the vocal tract resonance dynamics. Journal of the Acoustical Society America, 108:3036-3048, 2000.
- (2000) Journal of the Acoustical Society America , vol.108 , pp. 3036-3048
- Deng, L.¹ Ma, J.²

97
- 4243117872
- Marcel Dekker
- L. Deng and D. O'Shaughnessy. Speech Processing - A Dynamic and Optimization-Oriented Approach. Marcel Dekker, 2003.
- (2003) Speech Processing - A Dynamic and Optimization-oriented Approach
- Deng, L.¹ O'shaughnessy, D.²

98
- 0031198059
- Production models as a structural basis for automatic speech recognition
- August
- L. Deng, G. Ramsay, and D. Sun. Production models as a structural basis for automatic speech recognition. Speech Communication, 33(2-3):93-111, August 1997.
- (1997) Speech Communication , vol.33 , Issue.2-3 , pp. 93-111
- Deng, L.¹ Ramsay, G.² Sun, D.³

99
- 0030190520
- Transitional speech units and their representation by regressive Markov states: Applications to speech recognition
- July
- L. Deng and H. Sameti. Transitional speech units and their representation by regressive Markov states: Applications to speech recognition. IEEE Transactions on speech and audio processing, 4(4):301-306, July 1996.
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.4 , pp. 301-306
- Deng, L.¹ Sameti, H.²

100
- 79959842828
- Binary coding of speech spectrograms using a deep autoencoder
- L. Deng, M. Seltzer, D. Yu, A. Acero, A. Mohamed, and G. Hinton. Binary coding of speech spectrograms using a deep autoencoder. In Proceedings of Interspeech. 2010.
- (2010) Proceedings of Interspeech
- Deng, L.¹ Seltzer, M.² Yu, D.³ Acero, A.⁴ Mohamed, A.⁵ Hinton, G.⁶

101
- 0028234947
- A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
- L. Deng and D. Sun. A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features. Journal of the Acoustical Society of America, 85(5):2702-2719, 1994.
- (1994) Journal of the Acoustical Society of America , vol.85 , Issue.5 , pp. 2702-2719
- Deng, L.¹ Sun, D.²

102
- 84874256530
- Use of kernel deep convex networks and end-to-end learning for spoken language understanding
- December
- L. Deng, G. Tur, X. He, and D. Hakkani-Tur. Use of kernel deep convex networks and end-to-end learning for spoken language understanding. In Proceedings of IEEE Workshop on Spoken Language Technologies. December 2012.
- (2012) Proceedings of IEEE Workshop on Spoken Language Technologies
- Deng, L.¹ Tur, G.² He, X.³ Hakkani-Tur, D.⁴

103
- 0036880074
- Distributed speech processing in mipad's multimodal user interface
- L. Deng, K.Wang, A. Acero, H.W. Hon, J. Droppo, C. Boulis, Y.Wang, D. Jacoby, M. Mahajan, C. Chelba, and X. Huang. Distributed speech processing in mipad's multimodal user interface. IEEE Transactions on Speech and Audio Processing, 10(8):605-619, 2002.
- (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.8 , pp. 605-619
- Deng, L.¹ Wang, K.² Acero, A.³ Hon, H.W.⁴ Droppo, J.⁵ Boulis, C.⁶ Wang, Y.⁷ Jacoby, D.⁸ Mahajan, M.⁹ Chelba, C.¹⁰ Huang, X.¹¹

104
- 18744401086
- Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
- L. Deng, J. Wu, J. Droppo, and A. Acero. Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion. IEEE Transactions on Speech and Audio Processing, 13(3):412-421, 2005.
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.3 , pp. 412-421
- Deng, L.¹ Wu, J.² Droppo, J.³ Acero, A.⁴

105
- 34547551709
- Use of differential cepstra as acoustic features in hidden trajectory modeling for phonetic recognition
- L. Deng and D. Yu. Use of differential cepstra as acoustic features in hidden trajectory modeling for phonetic recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2007.
- (2007) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ Yu, D.²

106
- 84865768819
- Deep convex network: A scalable architecture for speech pattern classification
- L. Deng and D. Yu. Deep convex network: A scalable architecture for speech pattern classification. In Proceedings of Interspeech. 2011.
- (2011) Proceedings of Interspeech
- Deng, L.¹ Yu, D.²

107
- 33744966561
- A bidirectional target filtering model of speech coarticulation: Two-stage implementation for phonetic recognition
- January
- L. Deng, D. Yu, and A. Acero. A bidirectional target filtering model of speech coarticulation: Two-stage implementation for phonetic recognition. IEEE Transactions on Audio and Speech Processing, 14(1):256-265, January 2006.
- (2006) IEEE Transactions on Audio and Speech Processing , vol.14 , Issue.1 , pp. 256-265
- Deng, L.¹ Yu, D.² Acero, A.³

108
- 34047266395
- Structured speech modeling
- September
- L. Deng, D. Yu, and A. Acero. Structured speech modeling. IEEE Transactions on Audio, Speech and Language Processing, 14(5):1492-1504, September 2006.
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1492-1504
- Deng, L.¹ Yu, D.² Acero, A.³

109
- 84890468916
- Deep learning for speech recognition and related applications
- L. Deng, D. Yu, and G. Hinton. Deep learning for speech recognition and related applications. Neural Information Processing Systems (NIPS) Workshop, 2009.
- (2009) Neural Information Processing Systems (NIPS) Workshop
- Deng, L.¹ Yu, D.² Hinton, G.³

110
- 84867614591
- Scalable stacking and learning for building deep architectures
- L. Deng, D. Yu, and J. Platt. Scalable stacking and learning for building deep architectures. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012a.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Deng, L.¹ Yu, D.² Platt, J.³

111
- 84991233704
- A deep learning approach to machine transliteration
- Athens, Greece, March
- T. Deselaers, S. Hasan, O. Bender, and H. Ney. A deep learning approach to machine transliteration. In Proceedings of 4th Workshop on Statistical Machine Translation, pages 233-241. Athens, Greece, March 2009.
- (2009) Proceedings of 4th Workshop on Statistical Machine Translation , pp. 233-241
- Deselaers, T.¹ Hasan, S.² Bender, O.³ Ney, H.⁴

112
- 84903704484
- Thesis, Universidad Autonoma de Madrid, SPAIN, September
- A. Diez. Automatic language recognition using deep neural networks. Thesis, Universidad Autonoma de Madrid, SPAIN, September 2013.
- (2013) Automatic Language Recognition Using Deep Neural Networks
- Diez, A.¹

113
- 84893690218
- Combining stochastic average gradient and hessian-free optimization for sequence training of deep neural networks
- P. Dognin and V. Goel. Combining stochastic average gradient and hessian-free optimization for sequence training of deep neural networks. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Dognin, P.¹ Goel, V.²

114
- 80055055551
- Why does unsupervised pre-training help deep learning?
- D. Erhan, Y. Bengio, A. Courvelle, P.Manzagol, P. Vencent, and S. Bengio. Why does unsupervised pre-training help deep learning? Journal on Machine Learning Research, pages 201-208, 2010.
- (2010) Journal on Machine Learning Research , pp. 201-208
- Erhan, D.¹ Bengio, Y.² Courvelle, A.³ Manzagol, P.⁴ Vencent, P.⁵ Bengio, S.⁶

115
- 84890522099
- F0 contour prediction with a deep belief network-gaussian process hybrid model
- R. Fernandez, A. Rendel, B. Ramabhadran, and R. Hoory. F0 contour prediction with a deep belief network-gaussian process hybrid model. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 6885-6889. 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 6885-6889
- Fernandez, R.¹ Rendel, A.² Ramabhadran, B.³ Hoory, R.⁴

116
- 0032119668
- The hierarchical hidden Markov model: Analysis and applications
- S. Fine, Y. Singer, and N. Tishby. The hierarchical hidden Markov model: Analysis and applications. Machine Learning, 32:41-62, 1998.
- (1998) Machine Learning , vol.32 , pp. 41-62
- Fine, S.¹ Singer, Y.² Tishby, N.³

117
- 84898958665
- Devise: A deep visual-semantic embedding model
- A. Frome, G. Corrado, J. Shlens, S. Bengio, J. Dean, M. Ranzato, and T. Mikolov. Devise: A deep visual-semantic embedding model. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Frome, A.¹ Corrado, G.² Shlens, J.³ Bengio, S.⁴ Dean, J.⁵ Ranzato, M.⁶ Mikolov, T.⁷

118
- 44849099965
- Phone-discriminating minimum classification error (p-mce) training for phonetic recognition
- Q. Fu, X. He, and L. Deng. Phone-discriminating minimum classification error (p-mce) training for phonetic recognition. In Proceedings of Interspeech. 2007.
- (2007) Proceedings of Interspeech
- Fu, Q.¹ He, X.² Deng, L.³

119
- 84893675167
- Model-based approaches to handling uncertainty
- Springer
- M. Gales. Model-based approaches to handling uncertainty. In Robust Speech Recognition of Uncertain or Missing Data: Theory and Application, pages 101-125. Springer, 2011.
- (2011) Robust Speech Recognition of Uncertain or Missing Data: Theory and Application , pp. 101-125
- Gales, M.¹

120
- 78651327667
- Clickthrough-based translation models for web search: From word models to phrase models
- J. Gao, X. He, and J.-Y. Nie. Clickthrough-based translation models for web search: From word models to phrase models. In Proceedings of Conference on Information and Knowledge Management (CIKM). 2010.
- (2010) Proceedings of Conference on Information and Knowledge Management (CIKM)
- Gao, J.¹ He, X.² Nie, J.-Y.³

121
- 84955635544
- Learning semantic representations for the phrase translation model
- December
- J. Gao, X. He, W. Yih, and L. Deng. Learning semantic representations for the phrase translation model. In Proceedings of Neural Information Processing Systems (NIPS) Workshop on Deep Learning. December 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS) Workshop on Deep Learning
- Gao, J.¹ He, X.² Yih, W.³ Deng, L.⁴

122
- 84903689430
- MSR-TR-2013-88, September
- J. Gao, X. He, W. Yih, and L. Deng. Learning semantic representations for the phrase translation model. MSR-TR-2013-88, September 2013.
- (2013) Learning Semantic Representations for the Phrase Translation Model
- Gao, J.¹ He, X.² Yih, W.³ Deng, L.⁴

123
- 84906932220
- Learning continuous phrase representations for translation modeling
- J. Gao, X. He, W. Yih, and L. Deng. Learning continuous phrase representations for translation modeling. In Proceedings of Association for Computational Linguistics (ACL). 2014.
- (2014) Proceedings of Association for Computational Linguistics (ACL)
- Gao, J.¹ He, X.² Yih, W.³ Deng, L.⁴

124
- 80052118865
- Clickthrough-based latent semantic models for web search
- J. Gao, K. Toutanova, andW.-T. Yih. Clickthrough-based latent semantic models for web search. In Proceedings of Special Interest Group on Information Retrieval (SIGIR). 2011.
- (2011) Proceedings of Special Interest Group on Information Retrieval (SIGIR)
- Gao, J.¹ Toutanova, K.² Yih, W.-T.³

125
- 84877731706
- Discriminative learning of sum-product networks
- R. Gens and P. Domingo. Discriminative learning of sum-product networks. Neural Information Processing Systems (NIPS), 2012.
- (2012) Neural Information Processing Systems (NIPS)
- Gens, R.¹ Domingo, P.²

126
- 66149086672
- Ph.D. thesis, Stanford University
- D. George. How the brain might work: A hierarchical and temporal model for learning and recognition. Ph.D. thesis, Stanford University, 2008.
- (2008) How the Brain Might Work: A Hierarchical and Temporal Model for Learning and Recognition
- George, D.¹

127
- 77955783938
- Error approximation and minimum phone error acoustic model estimation
- August
- M. Gibson and T. Hain. Error approximation and minimum phone error acoustic model estimation. IEEE Transactions on Audio, Speech, and Language Processing, 18(6):1269-1279, August 2010.
- (2010) IEEE Transactions on Audio Speech, and Language Processing , vol.18 , Issue.6 , pp. 1269-1279
- Gibson, M.¹ Hain, T.²

128
- 84906343066
- arXiv:1311.2524v1
- R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. arXiv:1311.2524v1, 2013.
- (2013) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
- Girshick, R.¹ Donahue, J.² Darrell, T.³ Malik, J.⁴

129
- 79951563340
- Understanding the difficulty of training deep feed-forward neural networks
- X. Glorot and Y. Bengio. Understanding the difficulty of training deep feed-forward neural networks. In Proceedings of Artificial Intelligence and Statistics (AISTATS). 2010.
- (2010) Proceedings of Artificial Intelligence and Statistics (AISTATS)
- Glorot, X.¹ Bengio, Y.²

130
- 84872555593
- Deep sparse rectifier neural networks
- April
- X. Glorot, A. Bordes, and Y. Bengio. Deep sparse rectifier neural networks. In Proceedings of Artificial Intelligence and Statistics (AISTATS). April 2011.
- (2011) Proceedings of Artificial Intelligence and Statistics (AISTATS)
- Glorot, X.¹ Bordes, A.² Bengio, Y.³

131
- 84898988737
- Multi-prediction deep boltzmann machines
- I. Goodfellow, M. Mirza, A. Courville, and Y. Bengio. Multi-prediction deep boltzmann machines. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Goodfellow, I.¹ Mirza, M.² Courville, A.³ Bengio, Y.⁴

132
- 84903705932
- arXiv:1311.2746v1
- E. Grais, M. Sen, and H. Erdogan. Deep neural networks for single channel source separation. arXiv:1311.2746v1, 2013.
- (2013) Deep Neural Networks for Single Channel Source Separation
- Grais, E.¹ Sen, M.² Erdogan, H.³

133
- 84897549167
- Sequence transduction with recurrent neural networks
- A. Graves. Sequence transduction with recurrent neural networks. Representation Learning Workshop, International Conference on Machine Learning (ICML), 2012.
- (2012) Representation Learning Workshop, International Conference on Machine Learning (ICML)
- Graves, A.¹

134
- 34250704813
- Connectionist temporal classification: Labeling unsegmented sequence data with recurrent neural networks
- A. Graves, S. Fernandez, F. Gomez, and J. Schmidhuber. Connectionist temporal classification: Labeling unsegmented sequence data with recurrent neural networks. In Proceedings of International Conference on Machine Learning (ICML). 2006.
- (2006) Proceedings of International Conference on Machine Learning (ICML)
- Graves, A.¹ Fernandez, S.² Gomez, F.³ Schmidhuber, J.⁴

135
- 84893701254
- Hybrid speech recognition with deep bidirectional LSTM
- A. Graves, N. Jaitly, and A. Mohamed. Hybrid speech recognition with deep bidirectional LSTM. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Graves, A.¹ Jaitly, N.² Mohamed, A.³

136
- 84890543083
- Speech recognition with deep recurrent neural networks
- A. Graves, A. Mohamed, and G. Hinton. Speech recognition with deep recurrent neural networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Graves, A.¹ Mohamed, A.² Hinton, G.³

137
- 51449103447
- Optimizing bottle-neck features for LVCSR
- F. Grezl and P. Fousek. Optimizing bottle-neck features for LVCSR. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2008.
- (2008) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Grezl, F.¹ Fousek, P.²

138
- 84903743611
- C. Gulcehre, K. Cho, R. Pascanu, and Y. Bengio. Learnednorm pooling for deep feedforward and recurrent neural networks. http://arxiv.org/abs/1311.1780, 2014.
- (2014) Learnednorm Pooling for Deep Feedforward and Recurrent Neural Networks
- Gulcehre, C.¹ Cho, K.² Pascanu, R.³ Bengio, Y.⁴

139
- 84857892556
- Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics
- M. Gutmann and A. Hyvarinen. Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics. Journal of Machine Learning Research, 13:307-361, 2012.
- (2012) Journal of Machine Learning Research , vol.13 , pp. 307-361
- Gutmann, M.¹ Hyvarinen, A.²

140
- 85008520364
- Transcribing meetings with the AMIDA systems
- T. Hain, L. Burget, J. Dines, P. Garner, F. Grezl, A. Hannani, M. Huijbregts, M. Karafiat, M. Lincoln, and V. Wan. Transcribing meetings with the AMIDA systems. IEEE Transactions on Audio, Speech, and Language Processing, 20:486-498, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , pp. 486-498
- Hain, T.¹ Burget, L.² Dines, J.³ Garner, P.⁴ Grezl, F.⁵ Hannani, A.⁶ Huijbregts, M.⁷ Karafiat, M.⁸ Lincoln, M.⁹ Wan, V.¹⁰

141
- 84873584268
- Learning features from music audio with deep belief networks
- P. Hamel and D. Eck. Learning features from music audio with deep belief networks. In Proceedings of International Symposium on Music Information Retrieval (ISMIR). 2010.
- (2010) Proceedings of International Symposium on Music Information Retrieval (ISMIR)
- Hamel, P.¹ Eck, D.²

142
- 84855358050
- Numenta Technical Report, December 10 2010
- G. Hawkins, S. Ahmad, and D. Dubinsky. Hierarchical temporal memory including HTM cortical learning algorithms. Numenta Technical Report, December 10 2010.
- Hierarchical Temporal Memory Including HTM Cortical Learning Algorithms
- Hawkins, G.¹ Ahmad, S.² Dubinsky, D.³

143
- 33748168088
- Times Books, New York
- J. Hawkins and S. Blakeslee. On Intelligence: How a New Understanding of the Brain will lead to the Creation of Truly Intelligent Machines. Times Books, New York, 2004.
- (2004) On Intelligence: How A New Understanding of the Brain Will Lead to the Creation of Truly Intelligent Machines
- Hawkins, J.¹ Blakeslee, S.²

144
- 85032751114
- Speech recognition, machine translation, and speech translation - A unifying discriminative framework
- November 2011
- X. He and L. Deng. Speech recognition, machine translation, and speech translation - a unifying discriminative framework. IEEE Signal Processing Magazine, 28, November 2011.
- IEEE Signal Processing Magazine , vol.28
- He, X.¹ Deng, L.²

145
- 84867608216
- Optimization in speech-centric information processing: Criteria and techniques
- X. He and L. Deng. Optimization in speech-centric information processing: Criteria and techniques. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- He, X.¹ Deng, L.²

146
- 84876669905
- Speech-centric information processing: An optimization-oriented approach
- X. He and L. Deng. Speech-centric information processing: An optimization-oriented approach. In Proceedings of the IEEE. 2013.
- (2013) Proceedings of the IEEE
- He, X.¹ Deng, L.²

147
- 85032750905
- Discriminative learning in sequential pattern recognition - A unifying review for optimization-oriented speech recognition
- X. He, L. Deng, andW. Chou. Discriminative learning in sequential pattern recognition - a unifying review for optimization-oriented speech recognition. IEEE Signal Processing Magazine, 25:14-36, 2008.
- (2008) IEEE Signal Processing Magazine , vol.25 , pp. 14-36
- He, X.¹ Deng, L.² Chou, W.³

148
- 85008035419
- Equivalence of generative and log-liner models
- February
- G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schluter. Equivalence of generative and log-liner models. IEEE Transactions on Audio, Speech, and Language Processing, 19(5):1138-1148, February 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.5 , pp. 1138-1148
- Heigold, G.¹ Ney, H.² Lehnen, P.³ Gass, T.⁴ Schluter, R.⁵

149
- 84887376734
- Investigations on an EM-style optimization algorithm for discriminative training of HMMs
- December
- G. Heigold, H. Ney, and R. Schluter. Investigations on an EM-style optimization algorithm for discriminative training of HMMs. IEEE Transactions on Audio, Speech, and Language Processing, 21(12):2616-2626, December 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.12 , pp. 2616-2626
- Heigold, G.¹ Ney, H.² Schluter, R.³

150
- 84890539009
- Multilingual acoustic models using distributed deep neural networks
- G. Heigold, V. Vanhoucke, A. Senior, P. Nguyen, M. Ranzato,M. Devin, and J. Dean. Multilingual acoustic models using distributed deep neural networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Heigold, G.¹ Vanhoucke, V.² Senior, A.³ Nguyen, P.⁴ Ranzatom. Devin, M.⁵ Dean, J.⁶

151
- 69249105007
- Discriminative input stream combination for conditional random field phone recognition
- November
- I. Heintz, E. Fosler-Lussier, and C. Brew. Discriminative input stream combination for conditional random field phone recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(8):1533-1546, November 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.8 , pp. 1533-1546
- Heintz, I.¹ Fosler-Lussier, E.² Brew, C.³

152
- 84987895197
- Deep neural network approach for the dialog state tracking challenge
- M. Henderson, B. Thomson, and S. Young. Deep neural network approach for the dialog state tracking challenge. In Proceedings of Special Interest Group on Disclosure and Dialogue (SIGDIAL). 2013.
- (2013) Proceedings of Special Interest Group on Disclosure and Dialogue (SIGDIAL)
- Henderson, M.¹ Thomson, B.² Young, S.³

153
- 84898931970
- Training and analysing deep recurrent neural networks
- M. Hermans and B. Schrauwen. Training and analysing deep recurrent neural networks. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Hermans, M.¹ Schrauwen, B.²

154
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- H. Hermansky, D. Ellis, and S. Sharma. Tandem connectionist feature extraction for conventional HMM systems. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2000.
- (2000) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Hermansky, H.¹ Ellis, D.² Sharma, S.³

155
- 70350435251
- Speech recognition using augmented conditional random fields
- February
- Y. Hifny and S. Renals. Speech recognition using augmented conditional random fields. IEEE Transactions on Audio, Speech, and Language Processing, 17(2):354-365, February 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.2 , pp. 354-365
- Hifny, Y.¹ Renals, S.²

156
- 0025519204
- Mapping part-whole hierarchies into connectionist networks
- G. Hinton. Mapping part-whole hierarchies into connectionist networks. Artificial Intelligence, 46:47-75, 1990.
- (1990) Artificial Intelligence , vol.46 , pp. 47-75
- Hinton, G.¹

157
- 0009438133
- Preface to the special issue on connectionist symbol processing
- G. Hinton. Preface to the special issue on connectionist symbol processing. Artificial Intelligence, 46:1-4, 1990.
- (1990) Artificial Intelligence , vol.46 , pp. 1-4
- Hinton, G.¹

158
- 0037327724
- The ups and downs of Hebb synapses
- G. Hinton. The ups and downs of Hebb synapses. Canadian Psychology, 44:10-13, 2003.
- (2003) Canadian Psychology , vol.44 , pp. 10-13
- Hinton, G.¹

159
- 78650474133
- UTML Tech Report 2010-003, Univ. Toronto, August
- G. Hinton. A practical guide to training restricted boltzmann machines. UTML Tech Report 2010-003, Univ. Toronto, August 2010.
- (2010) A Practical Guide to Training Restricted Boltzmann Machines
- Hinton, G.¹

160
- 84903733700
- A better way to learn features
- October
- G. Hinton. A better way to learn features. Communications of the Association for Computing Machinery (ACM), 54(10), October 2011.
- (2011) Communications of the Association for Computing Machinery (ACM) , vol.54 , Issue.10
- Hinton, G.¹

161
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- November
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury. Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine, 29(6):82-97, November 2012.
- (2012) IEEE Signal Processing Magazine , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

162
- 84883152678
- Transforming autoencoders
- G. Hinton, A. Krizhevsky, and S. Wang. Transforming autoencoders. In Proceedings of International Conference on Artificial Neural Networks. 2011.
- (2011) Proceedings of International Conference on Artificial Neural Networks
- Hinton, G.¹ Krizhevsky, A.² Wang, S.³

163
- 33745805403
- A fast learning algorithm for deep belief nets
- G. Hinton, S. Osindero, and Y. Teh. A fast learning algorithm for deep belief nets. Neural Computation, 18:1527-1554, 2006.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.¹ Osindero, S.² Teh, Y.³

164
- 33746600649
- Reducing the dimensionality of data with neural networks
- July
- G. Hinton and R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504-507, July 2006.
- (2006) Science , vol.313 , Issue.5786 , pp. 504-507
- Hinton, G.¹ Salakhutdinov, R.²

165
- 79961245273
- Discovering binary codes for documents by learning deep generative models
- G. Hinton and R. Salakhutdinov. Discovering binary codes for documents by learning deep generative models. Topics in Cognitive Science, pages 1-18, 2010.
- (2010) Topics in Cognitive Science , pp. 1-18
- Hinton, G.¹ Salakhutdinov, R.²

166
- 84867720412
- arXiv: 1207.0580v1
- G. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. arXiv: 1207.0580v1, 2012.
- (2012) Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
- Hinton, G.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

167
- 0003575034
- Diploma thesis, Institut fur Informatik, Technische Universitat Munchen
- S. Hochreiter. Untersuchungen zu dynamischen neuronalen netzen. Diploma thesis, Institut fur Informatik, Technische Universitat Munchen, 1991.
- (1991) Untersuchungen zu Dynamischen Neuronalen Netzen
- Hochreiter, S.¹

168
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Computation, 9:1735-1780, 1997.
- (1997) Neural Computation , vol.9 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

169
- 84878180089
- Improving word representations via global context and multiple word prototypes
- E. Huang, R. Socher, C. Manning, and A. Ng. Improving word representations via global context and multiple word prototypes. In Proceedings of Association for Computational Linguistics (ACL). 2012.
- (2012) Proceedings of Association for Computational Linguistics (ACL)
- Huang, E.¹ Socher, R.² Manning, C.³ Ng, A.⁴

170
- 84890527497
- Cross-language knowledge transfer using multilingual deep neural networks with shared hidden layers
- J. Huang, J. Li, L. Deng, and D. Yu. Cross-language knowledge transfer using multilingual deep neural networks with shared hidden layers. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Huang, J.¹ Li, J.² Deng, L.³ Yu, D.⁴

171
- 84890480288
- Random features for kernel deep convex network
- P. Huang, L. Deng, M. Hasegawa-Johnson, and X. He. Random features for kernel deep convex network. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Huang, P.¹ Deng, L.² Hasegawa-Johnson, M.³ He, X.⁴

172
- 84889566627
- Learning deep structured semantic models for web search using clickthrough data
- P. Huang, X. He, J. Gao, L. Deng, A. Acero, and L. Heck. Learning deep structured semantic models for web search using clickthrough data. Association for Computing Machinery (ACM) International Conference Information and Knowledge Management (CIKM), 2013.
- (2013) Association for Computing Machinery (ACM) International Conference Information and Knowledge Management (CIKM)
- Huang, P.¹ He, X.² Gao, J.³ Deng, L.⁴ Acero, A.⁵ Heck, L.⁶

173
- 84890479086
- Predicting speech recognition confidence using deep learning with word identity and score features
- P. Huang, K. Kumar, C. Liu, Y. Gong, and L. Deng. Predicting speech recognition confidence using deep learning with word identity and score features. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Huang, P.¹ Kumar, K.² Liu, C.³ Gong, Y.⁴ Deng, L.⁵

174
- 77956280276
- Hierarchical bayesian language models for conversational speech recognition
- November
- S. Huang and S. Renals. Hierarchical bayesian language models for conversational speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 18(8):1941-1954, November 2010.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.8 , pp. 1941-1954
- Huang, S.¹ Renals, S.²

175
- 0034842339
- Mipad: A multimodal interaction prototype
- X. Huang, A. Acero, C. Chelba, L. Deng, J. Droppo, D. Duchene, J. Goodman, and H. Hon. Mipad: A multimodal interaction prototype. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2001.
- (2001) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Huang, X.¹ Acero, A.² Chelba, C.³ Deng, L.⁴ Droppo, J.⁵ Duchene, D.⁶ Goodman, J.⁷ Hon, H.⁸

176
- 84906218045
- Semi-supervised GMMand DNN acoustic model training with multi-system combination and confidence re-calibration
- Y. Huang, D. Yu, Y. Gong, and C. Liu. Semi-supervised GMMand DNN acoustic model training with multi-system combination and confidence re-calibration. In Proceedings of Interspeech, pages 2360-2364. 2013.
- (2013) Proceedings of Interspeech , pp. 2360-2364
- Huang, Y.¹ Yu, D.² Gong, Y.³ Liu, C.⁴

177
- 84873577775
- Rethinking automatic chord recognition with convolutional neural networks
- E. Humphrey and J. Bello. Rethinking automatic chord recognition with convolutional neural networks. In Proceedings of International Conference on Machine Learning and Application (ICMLA). 2012a.
- (2012) Proceedings of International Conference on Machine Learning and Application (ICMLA)
- Humphrey, E.¹ Bello, J.²

178
- 84873453413
- Moving beyond feature design: Deep architectures and automatic feature learning in music informatics
- E. Humphrey, J. Bello, and Y. LeCun. Moving beyond feature design: Deep architectures and automatic feature learning in music informatics. In Proceedings of International Symposium on Music Information Retrieval (ISMIR). 2012.
- (2012) Proceedings of International Symposium on Music Information Retrieval (ISMIR)
- Humphrey, E.¹ Bello, J.² LeCun, Y.³

179
- 84888315556
- Feature learning and deep architectures: New directions for music informatics
- E. Humphrey, J. Bello, and Y. LeCun. Feature learning and deep architectures: New directions for music informatics. Journal of Intelligent Information Systems, 2013.
- (2013) Journal of Intelligent Information Systems
- Humphrey, E.¹ Bello, J.² Lecun, Y.³

180
- 84867606917
- A deep architecture with bilinear modeling of hidden representations: Applications to phonetic recognition
- B. Hutchinson, L. Deng, and D. Yu. A deep architecture with bilinear modeling of hidden representations: Applications to phonetic recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Hutchinson, B.¹ Deng, L.² Yu, D.³

181
- 84879301618
- Tensor deep stacking networks
- B. Hutchinson, L. Deng, and D. Yu. Tensor deep stacking networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35:1944-1957, 2013.
- (2013) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.35 , pp. 1944-1957
- Hutchinson, B.¹ Deng, L.² Yu, D.³

182
- 84893708321
- Impact of deep MLP architecture on different modeling techniques for under-resourced speech recognition
- D. Imseng, P. Motlicek, P. Garner, and H. Bourlard. Impact of deep MLP architecture on different modeling techniques for under-resourced speech recognition. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Imseng, D.¹ Motlicek, P.² Garner, P.³ Bourlard, H.⁴

183
- 80051609011
- Learning a better representation of speech sound waves using restricted boltzmann machines
- N. Jaitly and G. Hinton. Learning a better representation of speech sound waves using restricted boltzmann machines. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2011.
- (2011) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Jaitly, N.¹ Hinton, G.²

184
- 84878539964
- Application of pre-trained deep neural networks to large vocabulary speech recognition
- N. Jaitly, P. Nguyen, and V. Vanhoucke. Application of pre-trained deep
- (2012) Proceedings of Interspeech
- Jaitly, N.¹ Nguyen, P.² Vanhoucke, V.³

185
- 77953183471
- What is the best multistage architecture for object recognition?
- K. Jarrett, K. Kavukcuoglu, and Y. LeCun. What is the best multistage architecture for object recognition? In Proceedings of International Conference on Computer Vision, pages 2146-2153. 2009.
- (2009) Proceedings of International Conference on Computer Vision , pp. 2146-2153
- Jarrett, K.¹ Kavukcuoglu, K.² Lecun, Y.³

186
- 85032751120
- Parameter estimation of statistical models using convex optimization: An advanced method of discriminative training for speech and language processing
- H. Jiang and X. Li. Parameter estimation of statistical models using convex optimization: An advanced method of discriminative training for speech and language processing. IEEE Signal Processing Magazine, 27(3):115-127, 2010.
- (2010) IEEE Signal Processing Magazine , vol.27 , Issue.3 , pp. 115-127
- Jiang, H.¹ Li, X.²

187
- 0022691022
- Maximum likelihood estimation for multivariate mixture observations of Markov chains
- B. Juang, S. Levinson, and M. Sondhi. Maximum likelihood estimation for multivariate mixture observations of Markov chains. IEEE Transactions on Information Theory, 32:307-309, 1986.
- (1986) IEEE Transactions on Information Theory , vol.32 , pp. 307-309
- Juang, B.¹ Levinson, S.² Sondhi, M.³

188
- 0031139839
- Minimum classification error rate methods for speech recognition
- B.-H. Juang, W. Chou, and C.-H. Lee. Minimum classification error rate methods for speech recognition. IEEE Transactions On Speech and Audio Processing, 5:257-265, 1997.
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , pp. 257-265
- Juang, B.-H.¹ Chou, W.² Lee, C.-H.³

189
- 84892582758
- Combining modality specific deep neural networks for emotion recognition in video
- S. Kahou et al. Combining modality specific deep neural networks for emotion recognition in video. In Proceedings of International Conference on Multimodal Interaction (ICMI). 2013.
- (2013) Proceedings of International Conference on Multimodal Interaction (ICMI)
- Kahou, S.¹

190
- 84890527090
- Multi-distribution deep belief network for speech synthesis
- S. Kang, X. Qian, and H. Meng. Multi-distribution deep belief network for speech synthesis. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 8012-8016. 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 8012-8016
- Kang, S.¹ Qian, X.² Meng, H.³

191
- 84893709342
- Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition
- Y. Kashiwagi, D. Saito, N. Minematsu, and K. Hirose. Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Kashiwagi, Y.¹ Saito, D.² Minematsu, N.³ Hirose, K.⁴

192
- 85162460675
- Learning convolutional feature hierarchies for visual recognition
- K. Kavukcuoglu, P. Sermanet, Y. Boureau, K. Gregor, M. Mathieu, and Y. LeCun. Learning convolutional feature hierarchies for visual recognition. In Proceedings of Neural Information Processing Systems (NIPS). 2010.
- (2010) Proceedings of Neural Information Processing Systems (NIPS)
- Kavukcuoglu, K.¹ Sermanet, P.² Boureau, Y.³ Gregor, K.⁴ Mathieu, M.⁵ Lecun, Y.⁶

193
- 77955803591
- Enhanced phone posteriors for improving speech recognition systems
- August
- H. Ketabdar and H. Bourlard. Enhanced phone posteriors for improving speech recognition systems. IEEE Transactions on Audio, Speech, and Language Processing, 18(6):1094-1106, August 2010.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.6 , pp. 1094-1106
- Ketabdar, H.¹ Bourlard, H.²

194
- 70349213445
- Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
- B. Kingsbury. Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2009.
- (2009) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Kingsbury, B.¹

195
- 84878379108
- Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
- B. Kingsbury, T. Sainath, and H. Soltau. Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization. In Proceedings of Interspeech. 2012.
- (2012) Proceedings of Interspeech
- Kingsbury, B.¹ Sainath, T.² Soltau, H.³

196
- 84906924453
- Multimodal neural language models
- R. Kiros, R. Zemel, and R. Salakhutdinov. Multimodal neural language models. In Proceedings of Neural Information Processing Systems (NIPS) Deep Learning Workshop. 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS) Deep Learning Workshop
- Kiros, R.¹ Zemel, R.² Salakhutdinov, R.³

197
- 84890495150
- Eigentriphones for context-dependent acoustic modeling
- T. Ko and B. Mak. Eigentriphones for context-dependent acoustic modeling. IEEE Transactions on Audio, Speech, and Language Processing, 21(6):1285-1294, 2013.
- (2013) IEEE Transactions on Audio Speech, and Language Processing , vol.21 , Issue.6 , pp. 1285-1294
- Ko, T.¹ Mak, B.²

198
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. Hinton. Imagenet classification with deep convolutional neural networks. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.³

199
- 84878534913
- Integrating deep neural networks into structural classification approach based on weighted finite-state transducers
- Y. Kubo, T. Hori, and A. Nakamura. Integrating deep neural networks into structural classification approach based on weighted finite-state transducers. In Proceedings of Interspeech. 2012.
- (2012) Proceedings of Interspeech
- Kubo, Y.¹ Hori, T.² Nakamura, A.³

200
- 84876357166
- Viking Books, December
- R. Kurzweil. How to Create a Mind. Viking Books, December 2012.
- (2012) How to Create A Mind
- Kurzweil, R.¹

201
- 84887376692
- Cross-lingual automatic speech recognition using tandem features
- December
- P. Lal and S. King. Cross-lingual automatic speech recognition using tandem features. IEEE Transactions on Audio, Speech, and Language Processing, 21(12):2506-2515, December 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.12 , pp. 2506-2515
- Lal, P.¹ King, S.²

202
- 0025254722
- A time-delay neural network architecture for isolated word recognition
- K. Lang, A. Waibel, and G. Hinton. A time-delay neural network architecture for isolated word recognition. Neural Networks, 3(1):23-43, 1990.
- (1990) Neural Networks , vol.3 , Issue.1 , pp. 23-43
- Lang, K.¹ Waibel, A.² Hinton, G.³

203
- 56449110012
- Classification using discriminative restricted boltzmann machines
- H. Larochelle and Y. Bengio. Classification using discriminative restricted boltzmann machines. In Proceedings of International Conference on Machine Learning (ICML). 2008.
- (2008) Proceedings of International Conference on Machine Learning (ICML)
- Larochelle, H.¹ Bengio, Y.²

204
- 84893690785
- Emotion recognition from spontaneous speech using hidden markov models with deep belief networks
- D. Le and P. Mower. Emotion recognition from spontaneous speech using hidden markov models with deep belief networks. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Le, D.¹ Mower, P.²

205
- 80053276362
- Training continuous space language models: Some practical issues
- H. Le, A. Allauzen, G. Wisniewski, and F. Yvon. Training continuous space language models: Some practical issues. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP), pages 778-788. 2010.
- (2010) Proceedings of Empirical Methods in Natural Language Processing (EMNLP) , pp. 778-788
- Le, H.¹ Allauzen, A.² Wisniewski, G.³ Yvon, F.⁴

206
- 80051619076
- Structured output layer neural network language model
- H. Le, I. Oparin, A. Allauzen, J. Gauvain, and F. Yvon. Structured output layer neural network language model. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2011.
- (2011) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Le, H.¹ Oparin, I.² Allauzen, A.³ Gauvain, J.⁴ Yvon, F.⁵

207
- 84869479578
- Structured output layer neural network language models for speech recognition
- January
- H. Le, I. Oparin, A. Allauzen, J.-L. Gauvain, and F. Yvon. Structured output layer neural network language models for speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 21(1):197-206, January 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.1 , pp. 197-206
- Le, H.¹ Oparin, I.² Allauzen, A.³ Gauvain, J.-L.⁴ Yvon, F.⁵

208
- 80053437034
- On optimization methods for deep learning
- Q. Le, J. Ngiam, A. Coates, A. Lahiri, B. Prochnow, and A. Ng. On optimization methods for deep learning. In Proceedings of International Conference on Machine Learning (ICML). 2011.
- (2011) Proceedings of International Conference on Machine Learning (ICML)
- Le, Q.¹ Ngiam, J.² Coates, A.³ Lahiri, A.⁴ Prochnow, B.⁵ Ng, A.⁶

209
- 84867135575
- Building high-level features using large scale unsupervised learning
- Q. Le, M. Ranzato, R. Monga, M. Devin, G. Corrado, K. Chen, J. Dean, and A. Ng. Building high-level features using large scale unsupervised learning. In Proceedings of International Conference on Machine Learning (ICML). 2012.
- (2012) Proceedings of International Conference on Machine Learning (ICML)
- Le, Q.¹ Ranzato, M.² Monga, R.³ Devin, M.⁴ Corrado, G.⁵ Chen, K.⁶ Dean, J.⁷ Ng, A.⁸

210
- 84873600957
- Learning invariant feature hierarchies
- Y. LeCun. Learning invariant feature hierarchies. In Proceedings of European Conference on Computer Vision (ECCV). 2012.
- (2012) Proceedings of European Conference on Computer Vision (ECCV)
- Lecun, Y.¹

211
- 0002263996
- Convolutional networks for images, speech, and time series
- In M. Arbib, editor MIT Press, Cambridge, Massachusetts
- Y. LeCun and Y. Bengio. Convolutional networks for images, speech, and time series. In M. Arbib, editor, The Handbook of Brain Theory and Neural Networks, pages 255-258. MIT Press, Cambridge, Massachusetts, 1995.
- (1995) The Handbook of Brain Theory and Neural Networks , pp. 255-258
- Lecun, Y.¹ Bengio, Y.²

212
- 0032203257
- Gradient-based learning applied to document recognition
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86:2278-2324, 1998.
- (1998) Proceedings of the IEEE , vol.86 , pp. 2278-2324
- Lecun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

213
- 51249093914
- Energy-based models in document recognition and computer vision
- Y. LeCun, S. Chopra, M. Ranzato, and F. Huang. Energy-based models in document recognition and computer vision. In Proceedings of International Conference on Document Analysis and Recognition (ICDAR). 2007.
- (2007) Proceedings of International Conference on Document Analysis and Recognition (ICDAR)
- Lecun, Y.¹ Chopra, S.² Ranzato, M.³ Huang, F.⁴

214
- 85009128804
- From knowledge-ignorant to knowledge-rich modeling: A new speech research paradigm for next-generation automatic speech recognition
- C.-H. Lee. From knowledge-ignorant to knowledge-rich modeling: A new speech research paradigm for next-generation automatic speech recognition. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 109-111. 2004.
- (2004) Proceedings of International Conference on Spoken Language Processing (ICSLP) , pp. 109-111
- Lee, C.-H.¹

215
- 71149119164
- Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
- H. Lee, R. Grosse, R. Ranganath, and A. Ng. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In Proceedings of International Conference on Machine Learning (ICML). 2009.
- (2009) Proceedings of International Conference on Machine Learning (ICML)
- Lee, H.¹ Grosse, R.² Ranganath, R.³ Ng, A.⁴

216
- 80053540444
- Unsupervised learning of hierarchical representations with convolutional deep belief networks
- October
- H. Lee, R. Grosse, R. Ranganath, and A. Ng. Unsupervised learning of hierarchical representations with convolutional deep belief networks. Communications of the Association for Computing Machinery (ACM), 54(10):95-103, October 2011.
- (2011) Communications of the Association for Computing Machinery (ACM) , vol.54 , Issue.10 , pp. 95-103
- Lee, H.¹ Grosse, R.² Ranganath, R.³ Ng, A.⁴

217
- 77956502334
- Unsupervised feature learning for audio classification using convolutional deep belief networks
- H. Lee, Y. Largman, P. Pham, and A. Ng. Unsupervised feature learning for audio classification using convolutional deep belief networks. In Proceedings of Neural Information Processing Systems (NIPS). 2010.
- (2010) Proceedings of Neural Information Processing Systems (NIPS)
- Lee, H.¹ Largman, Y.² Pham, P.³ Ng, A.⁴

218
- 84877785043
- Deep spatiotemporal architectures and learning for protein structure prediction
- P. Lena, K. Nagata, and P. Baldi. Deep spatiotemporal architectures and learning for protein structure prediction. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Lena, P.¹ Nagata, K.² Baldi, P.³

219
- 85015993006
- arXiv:1311.1761v1
- S. Levine. Exploring deep and recurrent architectures for optimal control. arXiv:1311.1761v1.
- Exploring Deep and Recurrent Architectures for Optimal Control
- Levine, S.¹

220
- 84897943848
- An overview of noise-robust automatic speech recognition
- J. Li, L. Deng, Y. Gong, and R. Haeb-Umbach. An overview of noise-robust automatic speech recognition. IEEE/Association for Computing Machinery (ACM) Transactions on Audio, Speech, and Language Processing, pages 1-33, 2014.
- (2014) IEEE/Association for Computing Machinery (ACM) Transactions on Audio, Speech, and Language Processing , pp. 1-33
- Li, J.¹ Deng, L.² Gong, Y.³ Haeb-Umbach, R.⁴

221
- 84874282188
- Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM
- J. Li, D. Yu, J. Huang, and Y. Gong. Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM. In Proceedings of IEEE Spoken Language Technology (SLT). 2012.
- (2012) Proceedings of IEEE Spoken Language Technology (SLT)
- Li, J.¹ Yu, D.² Huang, J.³ Gong, Y.⁴

222
- 84893307972
- Hybrid deep neural network- hidden markov model (DNN-HMM) based speech emotion recognition
- September 2013
- L. Li, Y. Zhao, D. Jiang, and Y. Zhang etc. Hybrid deep neural network- hidden markov model (DNN-HMM) based speech emotion recognition. In Proceedings Conference on Affective Computing and Intelligent Interaction (ACII), pages 312-317. September 2013.
- Proceedings Conference on Affective Computing and Intelligent Interaction (ACII) , pp. 312-317
- Li, L.¹ Zhao, Y.² Jiang, D.³ Zhang Etc, Y.⁴

223
- 84890521103
- Speaker adaptation of context dependent deep neural networks
- H. Liao. Speaker adaptation of context dependent deep neural networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Liao, H.¹

224
- 84893703162
- Large scale deep neural network acoustic modeling with semi-supervised training data for youtube video transcription
- H. Liao, E. McDermott, and A. Senior. Large scale deep neural network acoustic modeling with semi-supervised training data for youtube video transcription. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Liao, H.¹ McDermott, E.² Senior, A.³

225
- 70349220094
- A study on multilingual acoustic modeling for large vocabulary ASR
- H. Lin, L. Deng, D. Yu, Y. Gong, A. Acero, and C.-H. Lee. A study on multilingual acoustic modeling for large vocabulary ASR. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2009.
- (2009) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Lin, H.¹ Deng, L.² Yu, D.³ Gong, Y.⁴ Acero, A.⁵ Lee, C.-H.⁶

226
- 80052870284
- Large-scale image classification: Fast feature extraction and SVM training
- Y. Lin, F. Lv, S. Zhu, M. Yang, T. Cour, K. Yu, L. Cao, and T. Huang. Large-scale image classification: Fast feature extraction and SVM training. In Proceedings of Computer Vision and Pattern Recognition (CVPR). 2011.
- (2011) Proceedings of Computer Vision and Pattern Recognition (CVPR)
- Lin, Y.¹ Lv, F.² Zhu, S.³ Yang, M.⁴ Cour, T.⁵ Yu, K.⁶ Cao, L.⁷ Huang, T.⁸

227
- 84901237776
- Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
- Z. Ling, L. Deng, and D. Yu. Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis. IEEE Transactions on Audio Speech Language Processing, 21(10):2129-2139, 2013.
- (2013) IEEE Transactions on Audio Speech Language Processing , vol.21 , Issue.10 , pp. 2129-2139
- Ling, Z.¹ Deng, L.² Yu, D.³

228
- 84890447002
- Modeling spectral envelopes using restricted boltzmann machines for statistical parametric speech synthesis
- Z. Ling, L. Deng, and D. Yu. Modeling spectral envelopes using restricted boltzmann machines for statistical parametric speech synthesis. In International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 7825-7829. 2013.
- (2013) International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 7825-7829
- Ling, Z.¹ Deng, L.² Yu, D.³

229
- 84869440340
- Articulatory control of HMMbased parametric speech synthesis using feature-space-switched multiple regression
- January
- Z. Ling, K. Richmond, and J. Yamagishi. Articulatory control of HMMbased parametric speech synthesis using feature-space-switched multiple regression. IEEE Transactions on Audio, Speech, and Language Processing, 21, January 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21
- Ling, Z.¹ Richmond, K.² Yamagishi, J.³

230
- 84880526709
- Joint uncertainty decoding for noise robust subspace gaussian mixture models
- L. Lu, K. Chin, A. Ghoshal, and S. Renals. Joint uncertainty decoding for noise robust subspace gaussian mixture models. IEEE Transactions on Audio, Speech, and Language Processing, 21(9):1791-1804, 2013.
- (2013) IEEE Transactions on Audio Speech, and Language Processing , vol.21 , Issue.9 , pp. 1791-1804
- Lu, L.¹ Chin, K.² Ghoshal, A.³ Renals, S.⁴

231
- 0001523807
- A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamical model of speech
- J. Ma and L. Deng. A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamical model of speech. Computer, Speech and Language, 2000.
- (2000) Computer, Speech and Language
- Ma, J.¹ Deng, L.²

232
- 0347968275
- Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model
- J. Ma and L. Deng. Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model. IEEE Transactions on Speech and Audio Processing, 11(6):590-602, 2003.
- (2003) IEEE Transactions on Speech and Audio Processing , vol.11 , Issue.6 , pp. 590-602
- Ma, J.¹ Deng, L.²

233
- 0742307392
- Target-directed mixture dynamic models for spontaneous speech recognition
- J. Ma and L. Deng. Target-directed mixture dynamic models for spontaneous speech recognition. IEEE Transactions on Speech and Audio Processing, 12(1):47-58, 2004.
- (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , Issue.1 , pp. 47-58
- Ma, J.¹ Deng, L.²

234
- 84905286094
- Rectifier nonlinearities improve neural network acoustic models
- A. Maas, A. Hannun, and A. Ng. Rectifier nonlinearities improve neural network acoustic models. International Conference on Machine Learning (ICML) Workshop on Deep Learning for Audio, Speech, and Language Processing, 2013.
- (2013) International Conference on Machine Learning (ICML) Workshop on Deep Learning for Audio, Speech, and Language Processing
- Maas, A.¹ Hannun, A.² Ng, A.³

235
- 84878409063
- Recurrent neural networks for noise reduction in robust ASR
- A. Maas, Q. Le, T. O'Neil, O. Vinyals, P. Nguyen, and P. Ng. Recurrent neural networks for noise reduction in robust ASR. In Proceedings of Interspeech. 2012.
- (2012) Proceedings of Interspeech
- Maas, A.¹ Le, Q.² O'neil, T.³ Vinyals, O.⁴ Nguyen, P.⁵ Ng, P.⁶

236
- 34548080780
- Cambridge University Press
- C. Manning, P. Raghavan, and H. Schutze. Introduction to Information Retrieval. Cambridge University Press, 2009.
- (2009) Introduction to Information Retrieval
- Manning, C.¹ Raghavan, P.² Schutze, H.³

237
- 84903700854
- Scientists see promise in deep-learning programs
- November 24
- J. Markoff. Scientists see promise in deep-learning programs. New York Times, November 24 2012.
- (2012) New York Times
- Markoff, J.¹

238
- 77956541496
- Deep learning with hessian-free optimization
- J.Martens. Deep learning with hessian-free optimization. In Proceedings of International Conference on Machine Learning (ICML). 2010.
- (2010) Proceedings of International Conference on Machine Learning (ICML)
- Martens, J.¹

239
- 80053451847
- Learning recurrent neural networks with hessian-free optimization
- J. Martens and I. Sutskever. Learning recurrent neural networks with hessian-free optimization. In Proceedings of International Conference on Machine Learning (ICML). 2011.
- (2011) Proceedings of International Conference on Machine Learning (ICML)
- Martens, J.¹ Sutskever, I.²

240
- 84903702560
- ArXive1307.2118, July
- D. McAllester. A PAC-bayesian tutorial with a dropout bound. ArXive1307.2118, July 2013.
- (2013) A PAC-bayesian Tutorial with A Dropout Bound
- McAllester, D.¹

241
- 84871369973
- Learning lexicons from speech using a pronunciation mixture model
- February
- I. McGraw, I. Badr, and J. R. Glass. Learning lexicons from speech using a pronunciation mixture model. IEEE Transactions on Audio, Speech, and Language Processing, 21(2):357,366, February 2013.
- (2013) IEEE Transactions on Audio Speech, and Language Processing , vol.21 , Issue.2 , pp. 357-366
- McGraw, I.¹ Badr, I.² Glass, J.R.³

242
- 84906237242
- Investigation of recurrentneural- network architectures and learning methods for spoken language understanding
- G. Mesnil, X. He, L. Deng, and Y. Bengio. Investigation of recurrentneural- network architectures and learning methods for spoken language understanding. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Mesnil, G.¹ He, X.² Deng, L.³ Bengio, Y.⁴

243
- 84906273501
- Improving low-resource CD-DNN-HMM using dropout and multilingual DNN training
- Y. Miao and F. Metze. Improving low-resource CD-DNN-HMM using dropout and multilingual DNN training. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Miao, Y.¹ Metze, F.²

244
- 84893701756
- Deep maxout networks for low resource speech recognition
- Y. Miao, S. Rawat, and F. Metze. Deep maxout networks for low resource speech recognition. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Miao, Y.¹ Rawat, S.² Metze, F.³

245
- 84874250121
- Ph.D. thesis, Brno University of Technology
- T. Mikolov. Statistical language models based on neural networks. Ph.D. thesis, Brno University of Technology, 2012.
- (2012) Statistical Language Models Based on Neural Networks
- Mikolov, T.¹

246
- 85083951332
- Efficient estimation of word representations in vector space
- T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. In Proceedings of International Conference on Learning Representations (ICLR). 2013.
- (2013) Proceedings of International Conference on Learning Representations (ICLR)
- Mikolov, T.¹ Chen, K.² Corrado, G.³ Dean, J.⁴

247
- 84858966958
- Strategies for training large scale neural network language models
- T. Mikolov, A. Deoras, D. Povey, L. Burget, and J. Cernocky. Strategies for training large scale neural network language models. In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). 2011.
- (2011) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
- Mikolov, T.¹ Deoras, A.² Povey, D.³ Burget, L.⁴ Cernocky, J.⁵

248
- 79959829092
- Recurrent neural network based language model
- T. Mikolov, M. Karafiat, L. Burget, J. Cernocky, and S. Khudanpur. Recurrent neural network based language model. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 1045-1048. 2010.
- (2010) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 1045-1048
- Mikolov, T.¹ Karafiat, M.² Burget, L.³ Cernocky, J.⁴ Khudanpur, S.⁵

249
- 84899922637
- arXiv:1309.4168v1
- T. Mikolov, Q. Le, and I. Sutskever. Exploiting similarities among languages for machine translation. arXiv:1309.4168v1, 2013.
- (2013) Exploiting Similarities among Languages for Machine Translation
- Mikolov, T.¹ Le, Q.² Sutskever, I.³

250
- 84898956512
- Distributed representations of words and phrases and their compositionality
- T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Mikolov, T.¹ Sutskever, I.² Chen, K.³ Corrado, G.⁴ Dean, J.⁵

251
- 0036293703
- A recognition method with parametric trajectory synthesized using direct relations between static and dynamic feature vector time series
- Y. Minami, E. McDermott, A. Nakamura, and S. Katagiri. A recognition method with parametric trajectory synthesized using direct relations between static and dynamic feature vector time series. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 957-960. 2002.
- (2002) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 957-960
- Minami, Y.¹ McDermott, E.² Nakamura, A.³ Katagiri, S.⁴

252
- 34547970628
- Three new graphical models for statistical language modeling
- A. Mnih and G. Hinton. Three new graphical models for statistical language modeling. In Proceedings of International Conference on Machine Learning (ICML), pages 641-648. 2007.
- (2007) Proceedings of International Conference on Machine Learning (ICML) , pp. 641-648
- Mnih, A.¹ Hinton, G.²

253
- 84858779990
- A scalable hierarchical distributed language model
- A. Mnih and G. Hinton. A scalable hierarchical distributed language model. In Proceedings of Neural Information Processing Systems (NIPS), pages 1081-1088. 2008.
- (2008) Proceedings of Neural Information Processing Systems (NIPS) , pp. 1081-1088
- Mnih, A.¹ Hinton, G.²

254
- 84898987069
- Learning word embeddings efficiently with noise-contrastive estimation
- A. Mnih and K. Kavukcuoglu. Learning word embeddings efficiently with noise-contrastive estimation. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Mnih, A.¹ Kavukcuoglu, K.²

255
- 84867118996
- A fast and simple algorithm for training neural probabilistic language models
- A. Mnih and W.-T. Teh. A fast and simple algorithm for training neural probabilistic language models. In Proceedings of International Conference on Machine Learning (ICML), pages 1751-1758. 2012.
- (2012) Proceedings of International Conference on Machine Learning (ICML) , pp. 1751-1758
- Mnih, A.¹ Teh, W.-T.²

256
- 84904867557
- Playing arari with deep reinforcement learning
- also arXiv:1312.5602v1
- V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller. Playing arari with deep reinforcement learning. Neural Information Processing Systems (NIPS) Deep Learning Workshop, 2013. also arXiv:1312.5602v1.
- (2013) Neural Information Processing Systems (NIPS) Deep Learning Workshop
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Graves, A.⁴ Antonoglou, I.⁵ Wierstra, D.⁶ Riedmiller, M.⁷

257
- 78649297301
- Deep belief networks for phone recognition
- A. Mohamed, G. Dahl, and G. Hinton. Deep belief networks for phone recognition. In Proceedings of Neural Information Processing Systems (NIPS) Workshop Deep Learning for Speech Recognition and Related Applications. 2009.
- (2009) Proceedings of Neural Information Processing Systems (NIPS) Workshop Deep Learning for Speech Recognition and Related Applications
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

258
- 84055211743
- Acoustic modeling using deep belief networks
- January
- A. Mohamed, G. Dahl, and G. Hinton. Acoustic modeling using deep belief networks. IEEE Transactions on Audio, Speech, & Language Processing, 20(1), January 2012.
- (2012) IEEE Transactions on Audio, Speech, & Language Processing , vol.20 , Issue.1
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

259
- 84867585919
- Understanding how deep belief networks perform acoustic modelling
- A. Mohamed, G. Hinton, and G. Penn. Understanding how deep belief networks perform acoustic modelling. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Mohamed, A.¹ Hinton, G.² Penn, G.³

260
- 79959840616
- Investigation of full-sequence training of deep belief networks for speech recognition
- A. Mohamed, D. Yu, and L. Deng. Investigation of full-sequence training of deep belief networks for speech recognition. In Proceedings of Interspeech. 2010.
- (2010) Proceedings of Interspeech
- Mohamed, A.¹ Yu, D.² Deng, L.³

261
- 84255177123
- Deep and wide: Multiple layers in automatic speech recognition
- January
- N. Morgan. Deep and wide: Multiple layers in automatic speech recognition. IEEE Transactions on Audio, Speech, & Language Processing, 20(1), January 2012.
- (2012) IEEE Transactions on Audio, Speech, & Language Processing , vol.20 , Issue.1
- Morgan, N.¹

262
- 85032751546
- Pushing the envelope- aside [speech recognition]
- September
- N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cretin, H. Bourlard, and M. Athineos. Pushing the envelope- aside [speech recognition]. IEEE Signal Processing Magazine, 22(5):81-88, September 2005.
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 81-88
- Morgan, N.¹ Zhu, Q.² Stolcke, A.³ Sonmez, K.⁴ Sivadas, S.⁵ Shinozaki, T.⁶ Ostendorf, M.⁷ Jain, P.⁸ Hermansky, H.⁹ Ellis, D.¹⁰ Doddington, G.¹¹ Chen, B.¹² Cretin, O.¹³ Bourlard, H.¹⁴ Athineos, M.¹⁵

263
- 34547997987
- Hierarchical probabilistic neural network language models
- F. Morin and Y. Bengio. Hierarchical probabilistic neural network language models. In Proceedings of Artificial Intelligence and Statistics (AISTATS). 2005.
- (2005) Proceedings of Artificial Intelligence and Statistics (AISTATS)
- Morin, F.¹ Bengio, Y.²

264
- 84857466151
- TheMIT Press
- K. Murphy. Machine Learning - A Probabilistic Perspective. TheMIT Press, 2012.
- (2012) Machine Learning - A Probabilistic Perspective
- Murphy, K.¹

265
- 78149306047
- 3-d object recognition with deep belief nets
- V. Nair and G. Hinton. 3-d object recognition with deep belief nets. In Proceedings of Neural Information Processing Systems (NIPS). 2009.
- (2009) Proceedings of Neural Information Processing Systems (NIPS)
- Nair, V.¹ Hinton, G.²

266
- 84906280857
- Voice conversion in high-order eigen space using deep belief nets
- T. Nakashika, R. Takashima, T. Takiguchi, and Y. Ariki. Voice conversion in high-order eigen space using deep belief nets. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Nakashika, T.¹ Takashima, R.² Takiguchi, T.³ Ariki, Y.⁴

267
- 0032654483
- Speech translation: Coupling of recognition and translation
- H. Ney. Speech translation: Coupling of recognition and translation. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 1999.
- (1999) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Ney, H.¹

268
- 80053445973
- Learning deep energy models
- J. Ngiam, Z. Chen, P. Koh, and A. Ng. Learning deep energy models. In Proceedings of International Conference on Machine Learning (ICML). 2011.
- (2011) Proceedings of International Conference on Machine Learning (ICML)
- Ngiam, J.¹ Chen, Z.² Koh, P.³ Ng, A.⁴

269
- 80053437179
- Multimodal deep learning
- J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Ng. Multimodal deep learning. In Proceedings of International Conference on Machine Learning (ICML). 2011.
- (2011) Proceedings of International Conference on Machine Learning (ICML)
- Ngiam, J.¹ Khosla, A.² Kim, M.³ Nam, J.⁴ Lee, H.⁵ Ng, A.⁶

270
- 84898979068
- arXiv:1312.5650v2
- M. Norouzi, T. Mikolov, S. Bengio, J. Shlens, A. Frome, G. Corrado, and J. Dean. Zero-shot learning by convex combination of semantic embeddings. arXiv:1312.5650v2, 2013.
- (2013) Zero-shot Learning by Convex Combination of Semantic Embeddings
- Norouzi, M.¹ Mikolov, T.² Bengio, S.³ Shlens, J.⁴ Frome, A.⁵ Corrado, G.⁶ Dean, J.⁷

271
- 4944221356
- Layered representations for learning and inferring office activity from multiple sensory channels
- N. Oliver, A. Garg, and E. Horvitz. Layered representations for learning and inferring office activity from multiple sensory channels. Computer Vision and Image Understanding, 96:163-180, 2004.
- (2004) Computer Vision and Image Understanding , vol.96 , pp. 163-180
- Oliver, N.¹ Garg, A.² Horvitz, E.³

272
- 84903723176
- Can 'deep learning' offer deep insights about visual representation?
- B. Olshausen. Can 'deep learning' offer deep insights about visual representation? Neural Information Processing Systems (NIPS) Workshop on Deep Learning and Unsupervised Feature Learning, 2012.
- (2012) Neural Information Processing Systems (NIPS) Workshop on Deep Learning and Unsupervised Feature Learning
- Olshausen, B.¹

273
- 4544293504
- Moving beyond the 'beads-on-a-string' model of speech
- M. Ostendorf. Moving beyond the 'beads-on-a-string' model of speech. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 1999.
- (1999) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Ostendorf, M.¹

274
- 0030245363
- From HMMs to segment models: A unified view of stochastic modeling for speech recognition
- September
- M. Ostendorf, V. Digalakis, and O. Kimball. From HMMs to segment models: A unified view of stochastic modeling for speech recognition. IEEE Transactions on Speech and Audio Processing, 4(5), September 1996.
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5
- Ostendorf, M.¹ Digalakis, V.² Kimball, O.³

275
- 80052069937
- Probabilistic template-based chord recognition
- November
- L. Oudre, C. Fevotte, and Y. Grenier. Probabilistic template-based chord recognition. IEEE Transactions on Audio, Speech, and Language Processing, 19(8):2249-2259, November 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.8 , pp. 2249-2259
- Oudre, L.¹ Fevotte, C.² Grenier, Y.³

276
- 84982839648
- Learning input and recurrent weight matrices in echo state networks
- December
- H. Palangi, L. Deng, and R. Ward. Learning input and recurrent weight matrices in echo state networks. Neural Information Processing Systems (NIPS) Deep Learning Workshop, December 2013.
- (2013) Neural Information Processing Systems (NIPS) Deep Learning Workshop
- Palangi, H.¹ Deng, L.² Ward, R.³

277
- 84890502600
- Using deep stacking network to improve structured compressive sensing with multiple measurement vectors
- H. Palangi, R. Ward, and L. Deng. Using deep stacking network to improve structured compressive sensing with multiple measurement vectors. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Palangi, H.¹ Ward, R.² Deng, L.³

278
- 69849103259
- Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition
- G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos. Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17:423-435, 2009.
- (2009) IEEE Transactions on Audio Speech, and Language Processing , vol.17 , pp. 423-435
- Papandreou, G.¹ Katsamanis, A.² Pitsikalis, V.³ Maragos, P.⁴

279
- 85083951919
- How to construct deep recurrent neural networks
- R. Pascanu, C. Gulcehre, K. Cho, and Y. Bengio. How to construct deep recurrent neural networks. In Proceedings of International Conference on Learning Representations (ICLR). 2014.
- (2014) Proceedings of International Conference on Learning Representations (ICLR)
- Pascanu, R.¹ Gulcehre, C.² Cho, K.³ Bengio, Y.⁴

280
- 84897497795
- On the difficulty of training recurrent neural networks
- R. Pascanu, T. Mikolov, and Y. Bengio. On the difficulty of training recurrent neural networks. In Proceedings of International Conference on Machine Learning (ICML). 2013.
- (2013) Proceedings of International Conference on Machine Learning (ICML)
- Pascanu, R.¹ Mikolov, T.² Bengio, Y.³

281
- 84863373241
- Conditional neural fields
- J. Peng, L. Bo, and J. Xu. Conditional neural fields. In Proceedings of Neural Information Processing Systems (NIPS). 2009.
- (2009) Proceedings of Neural Information Processing Systems (NIPS)
- Peng, J.¹ Bo, L.² Xu, J.³

282
- 0032639922
- Initial evaluation of hidden dynamic models on conversational speech
- P. Picone, S. Pike, R. Regan, T. Kamm, J. bridle, L. Deng, Z. Ma, H. Richards, and M. Schuster. Initial evaluation of hidden dynamic models on conversational speech. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 1999.
- (1999) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Picone, P.¹ Pike, S.² Regan, R.³ Kamm, T.⁴ Bridle, J.⁵ Deng, L.⁶ Ma, Z.⁷ Richards, H.⁸ Schuster, M.⁹

283
- 78049251448
- Analysis of MLP-based hierarchical phone posterior probability estimators
- February
- J. Pinto, S. Garimella, M. Magimai-Doss, H. Hermansky, and H. Bourlard. Analysis of MLP-based hierarchical phone posterior probability estimators. IEEE Transactions on Audio, Speech, and Language Processing, 19(2), February 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.2
- Pinto, J.¹ Garimella, S.² Magimai-Doss, M.³ Hermansky, H.⁴ Bourlard, H.⁵

284
- 84867584652
- Improved pre-training of deep belief networks using sparse encoding symmetric machines
- C. Plahl, T. Sainath, B. Ramabhadran, and D. Nahamoo. Improved pre-training of deep belief networks using sparse encoding symmetric machines. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Plahl, C.¹ Sainath, T.² Ramabhadran, B.³ Nahamoo, D.⁴

285
- 79959844505
- Hierarchical bottleneck features for LVCSR
- C. Plahl, R. Schluter, and H. Ney. Hierarchical bottleneck features for LVCSR. In Proceedings of Interspeech. 2010.
- (2010) Proceedings of Interspeech
- Plahl, C.¹ Schluter, R.² Ney, H.³

286
- 0029310084
- Holographic reduced representations
- May
- T. Plate. Holographic reduced representations. IEEE Transactions on Neural Networks, 6(3):623-641, May 1995.
- (1995) IEEE Transactions on Neural Networks , vol.6 , Issue.3 , pp. 623-641
- Plate, T.¹

287
- 84903722546
- How the brain might work: The role of information and learning in understanding and replicating intelligence
- In G. Jacovitt, A. Pettorossi, R. Consolo, and V. Senni, editors Lateran University Press
- T. Poggio. How the brain might work: The role of information and learning in understanding and replicating intelligence. In G. Jacovitt, A. Pettorossi, R. Consolo, and V. Senni, editors, Information: Science and Technology for the New Century, pages 45-61. Lateran University Press, 2007.
- (2007) Information: Science and Technology for the New Century , pp. 45-61
- Poggio, T.¹

288
- 0025519291
- Recursive distributed representations
- J. Pollack. Recursive distributed representations. Artificial Intelligence, 46:77-105, 1990.
- (1990) Artificial Intelligence , vol.46 , pp. 77-105
- Pollack, J.¹

289
- 80053162579
- Sum-product networks: A new deep architecture
- H. Poon and P. Domingos. Sum-product networks: A new deep architecture. In Proceedings of Uncertainty in Artificial Intelligence. 2011.
- (2011) Proceedings of Uncertainty in Artificial Intelligence
- Poon, H.¹ Domingos, P.²

290
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- D. Povey and P. Woodland. Minimum phone error and I-smoothing for improved discriminative training. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2002.
- (2002) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Povey, D.¹ Woodland, P.²

291
- 78049406405
- Backpropagation training for multilayer conditional random field based phone recognition
- R. Prabhavalkar and E. Fosler-Lussier. Backpropagation training for multilayer conditional random field based phone recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2010.
- (2010) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Prabhavalkar, R.¹ Fosler-Lussier, E.²

292
- 0031003679
- Optimality: From neural networks to universal grammar
- A. Prince and P. Smolensky. Optimality: From neural networks to universal grammar. Science, 275:1604-1610, 1997.
- (1997) Science , vol.275 , pp. 1604-1610
- Prince, A.¹ Smolensky, P.²

293
- 0024610919
- A tutorial on hidden markov models and selected applications in speech recognition
- L. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. In Proceedings of the IEEE, pages 257-286. 1989.
- (1989) Proceedings of the IEEE , pp. 257-286
- Rabiner, L.¹

294
- 70049094447
- Sparse feature learning for deep belief networks
- M. Ranzato, Y. Boureau, and Y. LeCun. Sparse feature learning for deep belief networks. In Proceedings of Neural Information Processing Systems (NIPS). 2007.
- (2007) Proceedings of Neural Information Processing Systems (NIPS)
- Ranzato, M.¹ Boureau, Y.² Lecun, Y.³

295
- 51249093914
- Energy-based models in document recognition and computer vision
- M. Ranzato, S. Chopra, Y. LeCun, and F.-J. Huang. Energy-based models in document recognition and computer vision. In Proceedings of International Conference on Document Analysis and Recognition (ICDAR). 2007.
- (2007) Proceedings of International Conference on Document Analysis and Recognition (ICDAR)
- Ranzato, M.¹ Chopra, S.² Lecun, Y.³ Huang, F.-J.⁴

296
- 77955989954
- Modeling pixel means and covariances using factorized third-order boltzmann machines
- M. Ranzato and G. Hinton. Modeling pixel means and covariances using factorized third-order boltzmann machines. In Proceedings of Computer Vision and Pattern Recognition (CVPR). 2010.
- (2010) Proceedings of Computer Vision and Pattern Recognition (CVPR)
- Ranzato, M.¹ Hinton, G.²

297
- 85112276587
- Efficient learning of sparse representations with an energy-based model
- M. Ranzato, C. Poultney, S. Chopra, and Y. LeCun. Efficient learning of sparse representations with an energy-based model. In Proceedings of Neural Information Processing Systems (NIPS). 2006.
- (2006) Proceedings of Neural Information Processing Systems (NIPS)
- Ranzato, M.¹ Poultney, C.² Chopra, S.³ Lecun, Y.⁴

298
- 80052877144
- On deep generative models with applications to recognition
- M. Ranzato, J. Susskind, V. Mnih, and G. Hinton. On deep generative models with applications to recognition. In Proceedings of Computer Vision and Pattern Recognition (CVPR). 2011.
- (2011) Proceedings of Computer Vision and Pattern Recognition (CVPR)
- Ranzato, M.¹ Susskind, J.² Mnih, V.³ Hinton, G.⁴

299
- 0030419718
- Construction of state-dependent dynamic parameters by maximum likelihood: Applications to speech recognition
- C. Rathinavalu and L. Deng. Construction of state-dependent dynamic parameters by maximum likelihood: Applications to speech recognition. Signal Processing, 55(2):149-165, 1997.
- (1997) Signal Processing , vol.55 , Issue.2 , pp. 149-165
- Rathinavalu, C.¹ Deng, L.²

300
- 84867621164
- Factorial hidden restricted boltzmann machines for noise robust speech recognition
- S. Rennie, K. Fouset, and P. Dognin. Factorial hidden restricted boltzmann machines for noise robust speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Rennie, S.¹ Fouset, K.² Dognin, P.³

301
- 85032751986
- Single-channel multi-talker speech recognition - Graphical modeling approaches
- S. Rennie, H. Hershey, and P. Olsen. Single-channel multi-talker speech recognition - graphical modeling approaches. IEEE Signal Processing Magazine, 33:66-80, 2010.
- (2010) IEEE Signal Processing Magazine , vol.33 , pp. 66-80
- Rennie, S.¹ Hershey, H.² Olsen, P.³

302
- 84943274699
- A direct adaptive method for faster backpropagation learning: The RPROP algorithm
- M. Riedmiller and H. Braun. A direct adaptive method for faster backpropagation learning: The RPROP algorithm. In Proceedings of the IEEE International Conference on Neural Networks. 1993.
- (1993) Proceedings of the IEEE International Conference on Neural Networks
- Riedmiller, M.¹ Braun, H.²

303
- 80053460450
- Contractive autoencoders: Explicit invariance during feature extraction
- S. Rifai, P. Vincent, X. Muller, X. Glorot, and Y. Bengio. Contractive autoencoders: Explicit invariance during feature extraction. In Proceedings of International Conference on Machine Learning (ICML), pages 833-840. 2011.
- (2011) Proceedings of International Conference on Machine Learning (ICML) , pp. 833-840
- Rifai, S.¹ Vincent, P.² Muller, X.³ Glorot, X.⁴ Bengio, Y.⁵

304
- 0028392167
- An application of recurrent nets to phone probability estimation
- A. Robinson. An application of recurrent nets to phone probability estimation. IEEE Transactions on Neural Networks, 5:298-305, 1994.
- (1994) IEEE Transactions on Neural Networks , vol.5 , pp. 298-305
- Robinson, A.¹

305
- 84903710549
- arXiv: 1309.1508v3
- T. Sainath, L. Horesh, B. Kingsbury, A. Aravkin, and B. Ramabhadran. Accelerating hessian-free optimization for deep neural networks by implicit pre-conditioning and sampling. arXiv: 1309.1508v3, 2013.
- (2013) Accelerating Hessian-free Optimization for Deep Neural Networks by Implicit Pre-conditioning and Sampling
- Sainath, T.¹ Horesh, L.² Kingsbury, B.³ Aravkin, A.⁴ Ramabhadran, B.⁵

306
- 84893654379
- Improvements to deep convolutional neural networks for LVCSR
- T. Sainath, B. Kingsbury, A. Mohamed, G. Dahl, G. Saon, H. Soltau, T. Beran, A. Aravkin, and B. Ramabhadran. Improvements to deep convolutional neural networks for LVCSR. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Sainath, T.¹ Kingsbury, B.² Mohamed, A.³ Dahl, G.⁴ Saon, G.⁵ Soltau, H.⁶ Beran, T.⁷ Aravkin, A.⁸ Ramabhadran, B.⁹

307
- 84893688455
- Learning filter banks within a deep neural network framework
- T. Sainath, B. Kingsbury, A. Mohamed, and B. Ramabhadran. Learning filter banks within a deep neural network framework. In Proceedings of The Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Sainath, T.¹ Kingsbury, B.² Mohamed, A.³ Ramabhadran, B.⁴

308
- 84867593213
- Autoencoder bottleneck features using deep belief networks
- T. Sainath, B. Kingsbury, and B. Ramabhadran. Autoencoder bottleneck features using deep belief networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Sainath, T.¹ Kingsbury, B.² Ramabhadran, B.³

309
- 84858972572
- Making deep belief networks effective for large vocabulary continuous speech recognition
- T. Sainath, B. Kingsbury, B. Ramabhadran, P. Novak, and A. Mohamed. Making deep belief networks effective for large vocabulary continuous speech recognition. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2011.
- (2011) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Sainath, T.¹ Kingsbury, B.² Ramabhadran, B.³ Novak, P.⁴ Mohamed, A.⁵

310
- 84890454527
- Low-rank matrix factorization for deep neural network training with high-dimensional output targets
- T. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran. Low-rank matrix factorization for deep neural network training with high-dimensional output targets. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Sainath, T.¹ Kingsbury, B.² Sindhwani, V.³ Arisoy, E.⁴ Ramabhadran, B.⁵

311
- 84886829539
- Optimization techniques to improve training speed of deep neural networks for large speech tasks
- November
- T. Sainath, B. Kingsbury, H. Soltau, and B. Ramabhadran. Optimization techniques to improve training speed of deep neural networks for large speech tasks. IEEE Transactions on Audio, Speech, and Language Processing, 21(11):2267-2276, November 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.11 , pp. 2267-2276
- Sainath, T.¹ Kingsbury, B.² Soltau, H.³ Ramabhadran, B.⁴

312
- 84890525984
- Convolutional neural networks for LVCSR
- T. Sainath, A. Mohamed, B. Kingsbury, and B. Ramabhadran. Convolutional neural networks for LVCSR. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Sainath, T.¹ Mohamed, A.² Kingsbury, B.³ Ramabhadran, B.⁴

313
- 80053610626
- Exemplar-based sparse representation features: From TIMIT to LVCSR
- November
- T. Sainath, B. Ramabhadran, M. Picheny, D. Nahamoo, and D. Kanevsky. Exemplar-based sparse representation features: From TIMIT to LVCSR. IEEE Transactions on Speech and Audio Processing, November 2011.
- (2011) IEEE Transactions on Speech and Audio Processing
- Sainath, T.¹ Ramabhadran, B.² Picheny, M.³ Nahamoo, D.⁴ Kanevsky, D.⁵

314
- 70350693506
- Semantic hashing
- R. Salakhutdinov and G. Hinton. Semantic hashing. In Proceedings of Special Interest Group on Information Retrieval (SIGIR) Workshop on Information Retrieval and Applications of Graphical Models. 2007.
- (2007) Proceedings of Special Interest Group on Information Retrieval (SIGIR) Workshop on Information Retrieval and Applications of Graphical Models
- Salakhutdinov, R.¹ Hinton, G.²

315
- 73249147662
- Deep boltzmann machines
- R. Salakhutdinov and G. Hinton. Deep boltzmann machines. In Proceedings of Artificial Intelligence and Statistics (AISTATS). 2009.
- (2009) Proceedings of Artificial Intelligence and Statistics (AISTATS)
- Salakhutdinov, R.¹ Hinton, G.²

316
- 84877755914
- A better way to pretrain deep boltzmann machines
- R. Salakhutdinov and G. Hinton. A better way to pretrain deep boltzmann machines. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Salakhutdinov, R.¹ Hinton, G.²

317
- 84893691530
- Speaker adaptation of neural network acoustic models using i-vectors
- G. Saon, H. Soltau, D. Nahamoo, and M. Picheny. Speaker adaptation of neural network acoustic models using i-vectors. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

318
- 80051625051
- Deep belief nets for natural language call-routing
- R. Sarikaya, G. Hinton, and B. Ramabhadran. Deep belief nets for natural language call-routing. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 5680-5683. 2011.
- (2011) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 5680-5683
- Sarikaya, R.¹ Hinton, G.² Ramabhadran, B.³

319
- 83455238990
- Learning emotion-based acoustic features with deep belief networks
- E. Schmidt and Y. Kim. Learning emotion-based acoustic features with deep belief networks. In Proceedings IEEE of Signal Processing to Audio and Acoustics. 2011.
- (2011) Proceedings IEEE of Signal Processing to Audio and Acoustics
- Schmidt, E.¹ Kim, Y.²

320
- 84905273821
- Continuous space translation models for phrase-based statistical machine translation
- H. Schwenk. Continuous space translation models for phrase-based statistical machine translation. In Proceedings of Computional Linguistics. 2012.
- (2012) Proceedings of Computional Linguistics
- Schwenk, H.¹

321
- 85045980083
- Large, pruned or continuous space language models on a gpu for statistical machine translation
- H. Schwenk, A. Rousseau, and A. Mohammed. Large, pruned or continuous space language models on a gpu for statistical machine translation. In Proceedings of the Joint Human Language Technology Conference and the North American Chapter of the Association of Computational Linguistics (HLT-NAACL) 2012 Workshop on the future of language modeling for Human Language Technology (HLT), pages 11-19.
- Proceedings of the Joint Human Language Technology Conference and the North American Chapter of the Association of Computational Linguistics (HLT-NAACL) 2012 Workshop on the Future of Language Modeling for Human Language Technology (HLT) , pp. 11-19
- Schwenk, H.¹ Rousseau, A.² Mohammed, A.³

322
- 84905269646
- On parallelizability of stochastic gradient descent for speech DNNs
- F. Seide, H. Fu, J. Droppo, G. Li, and D. Yu. On parallelizability of stochastic gradient descent for speech DNNs. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2014.
- (2014) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Seide, F.¹ Fu, H.² Droppo, J.³ Li, G.⁴ Yu, D.⁵

323
- 84858976070
- Feature engineering in contextdependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu. Feature engineering in contextdependent deep neural networks for conversational speech transcription. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU), pages 24-29. 2011.
- (2011) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

324
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu. Conversational speech transcription using context-dependent deep neural networks. In Proceedings of Interspeech, pages 437-440. 2011.
- (2011) Proceedings of Interspeech , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

325
- 84890492030
- An investigation of deep neural networks for noise robust speech recognition
- M. Seltzer, D. Yu, and E. Wang. An investigation of deep neural networks for noise robust speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Seltzer, M.¹ Yu, D.² Wang, E.³

326
- 84872190545
- Autoregressive models for statistical parametric speech synthesis
- M. Shannon, H. Zen, and W. Byrne. Autoregressive models for statistical parametric speech synthesis. IEEE Transactions on Audio, Speech, Language Processing, 21(3):587-597, 2013.
- (2013) IEEE Transactions on Audio, Speech, Language Processing , vol.21 , Issue.3 , pp. 587-597
- Shannon, M.¹ Zen, H.² Byrne, W.³

327
- 0028195651
- Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization
- H. Sheikhzadeh and L. Deng. Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization. IEEE Transactions on on Speech and Audio Processing (ICASSP), 2:80-91, 1994.
- (1994) IEEE Transactions on on Speech and Audio Processing (ICASSP) , vol.2 , pp. 80-91
- Sheikhzadeh, H.¹ Deng, L.²

328
- 84990946747
- Learning semantic representations using convolutional neural networks for web search
- Y. Shen, X. He, J. Gao, L. Deng, and G. Mesnil. Learning semantic representations using convolutional neural networks for web search. In Proceedings World Wide Web. 2014.
- (2014) Proceedings World Wide Web
- Shen, Y.¹ He, X.² Gao, J.³ Deng, L.⁴ Mesnil, G.⁵

329
- 84897374881
- Deep fisher networks for large-scale image classification
- K. Simonyan, A. Vedaldi, and A. Zisserman. Deep fisher networks for large-scale image classification. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Simonyan, K.¹ Vedaldi, A.² Zisserman, A.³

330
- 84881054791
- Hermitian polynomial for speaker adaptation of connectionist speech recognition systems
- M. Siniscalchi, J. Li, and C. Lee. Hermitian polynomial for speaker adaptation of connectionist speech recognition systems. IEEE Transactions on Audio, Speech, and Language Processing, 21(10):2152-2161, 2013a.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.10 , pp. 2152-2161
- Siniscalchi, M.¹ Li, J.² Lee, C.³

331
- 84872967500
- A bottom-up modular search approach to large vocabulary continuous speech recognition
- M. Siniscalchi, T. Svendsen, and C.-H. Lee. A bottom-up modular search approach to large vocabulary continuous speech recognition. IEEE Transactions on Audio, Speech, Language Processing, 21, 2013.
- (2013) IEEE Transactions on Audio, Speech, Language Processing , vol.21
- Siniscalchi, M.¹ Svendsen, T.² Lee, C.-H.³

332
- 84875405186
- Exploiting deep neural networks for detection-based speech recognition
- M. Siniscalchi, D. Yu, L. Deng, and C.-H. Lee. Exploiting deep neural networks for detection-based speech recognition. Neurocomputing, 106:148-157, 2013.
- (2013) Neurocomputing , vol.106 , pp. 148-157
- Siniscalchi, M.¹ Yu, D.² Deng, L.³ Lee, C.-H.⁴

333
- 84873303660
- Speech recognition using long-span temporal patterns in a deep network model
- March
- M. Siniscalchi, D. Yu, L. Deng, and C.-H. Lee. Speech recognition using long-span temporal patterns in a deep network model. IEEE Signal Processing Letters, 20(3):201-204, March 2013.
- (2013) IEEE Signal Processing Letters , vol.20 , Issue.3 , pp. 201-204
- Siniscalchi, M.¹ Yu, D.² Deng, L.³ Lee, C.-H.⁴

334
- 84055212007
- Sparse multilayer perceptrons for phoneme recognition
- January
- G. Sivaram and H. Hermansky. Sparse multilayer perceptrons for phoneme recognition. IEEE Transactions on Audio, Speech, & Language Processing, 20(1), January 2012.
- (2012) IEEE Transactions on Audio, Speech, & Language Processing , vol.20 , Issue.1
- Sivaram, G.¹ Hermansky, H.²

335
- 0025516779
- Tensor product variable binding and the representation of symbolic structures in connectionist systems
- P. Smolensky. Tensor product variable binding and the representation of symbolic structures in connectionist systems. Artificial Intelligence, 46:159-216, 1990.
- (1990) Artificial Intelligence , vol.46 , pp. 159-216
- Smolensky, P.¹

336
- 33748699965
- The MIT Press, Cambridge, MA
- P. Smolensky and G. Legendre. The Harmonic Mind - From Neural Computation to Optimality-Theoretic Grammar. The MIT Press, Cambridge, MA, 2006.
- (2006) The Harmonic Mind - From Neural Computation to Optimality-Theoretic Grammar
- Smolensky, P.¹ Legendre, G.²

337
- 84869201485
- Practical bayesian optimization of machine learning algorithms
- J. Snoek, H. Larochelle, and R. Adams. Practical bayesian optimization of machine learning algorithms. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Snoek, J.¹ Larochelle, H.² Adams, R.³

338
- 84903722893
- New directions in deep learning: Structured models, tasks, and datasets
- R. Socher. New directions in deep learning: Structured models, tasks, and datasets. Neural Information Processing Systems (NIPS) Workshop on Deep Learning and Unsupervised Feature Learning, 2012.
- (2012) Neural Information Processing Systems (NIPS) Workshop on Deep Learning and Unsupervised Feature Learning
- Socher, R.¹

339
- 84905233165
- Tutorial at Association of Computational Logistics (ACL), 2012, and North American Chapter of the Association of Computational Linguistics (NAACL)
- R. Socher, Y. Bengio, and C. Manning. Deep learning for NLP. Tutorial at Association of Computational Logistics (ACL), 2012, and North American Chapter of the Association of Computational Linguistics (NAACL), 2013. http://www.socher.org/index.php/DeepLearning Tutorial.
- (2013) Deep Learning for NLP
- Socher, R.¹ Bengio, Y.² Manning, C.³

340
- 84898956227
- Reasoning with neural tensor networks for knowledge base completion
- R. Socher, D. Chen, C. Manning, and A. Ng. Reasoning with neural tensor networks for knowledge base completion. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Socher, R.¹ Chen, D.² Manning, C.³ Ng, A.⁴

341
- 77955998009
- Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora
- R. Socher and L. Fei-Fei. Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora. In Proceedings of Computer Vision and Pattern Recognition (CVPR). 2010.
- (2010) Proceedings of Computer Vision and Pattern Recognition (CVPR)
- Socher, R.¹ Fei-Fei, L.²

342
- 84898938559
- Zero-shot learning through cross-modal transfer
- R. Socher, M. Ganjoo, H. Sridhar, O. Bastani, C. Manning, and A. Ng. Zero-shot learning through cross-modal transfer. In Proceedings of Neural Information Processing Systems (NIPS). 2013b.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Socher, R.¹ Ganjoo, M.² Sridhar, H.³ Bastani, O.⁴ Manning, C.⁵ Ng, A.⁶

343
- 84928030723
- Grounded compositional semantics for finding and describing images with sentences
- R. Socher, Q. Le, C. Manning, and A. Ng. Grounded compositional semantics for finding and describing images with sentences. Neural Information Processing Systems (NIPS) Deep Learning Workshop, 2013c.
- (2013) Neural Information Processing Systems (NIPS) Deep Learning Workshop
- Socher, R.¹ Le, Q.² Manning, C.³ Ng, A.⁴

344
- 80053438267
- Parsing natural scenes and natural language with recursive neural networks
- R. Socher, C. Lin, A. Ng, and C. Manning. Parsing natural scenes and natural language with recursive neural networks. In Proceedings of International Conference on Machine Learning (ICML). 2011.
- (2011) Proceedings of International Conference on Machine Learning (ICML)
- Socher, R.¹ Lin, C.² Ng, A.³ Manning, C.⁴

345
- 85162476102
- Dynamic pooling and unfolding recursive autoencoders for paraphrase detection
- R. Socher, J. Pennington, E. Huang, A. Ng, and C. Manning. Dynamic pooling and unfolding recursive autoencoders for paraphrase detection. In Proceedings of Neural Information Processing Systems (NIPS). 2011.
- (2011) Proceedings of Neural Information Processing Systems (NIPS)
- Socher, R.¹ Pennington, J.² Huang, E.³ Ng, A.⁴ Manning, C.⁵

346
- 80053261327
- Semisupervised recursive autoencoders for predicting sentiment distributions
- R. Socher, J. Pennington, E. Huang, A. Ng, and C. Manning. Semisupervised recursive autoencoders for predicting sentiment distributions. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP). 2011.
- (2011) Proceedings of Empirical Methods in Natural Language Processing (EMNLP)
- Socher, R.¹ Pennington, J.² Huang, E.³ Ng, A.⁴ Manning, C.⁵

347
- 84926358845
- Recursive deep models for semantic compositionality over a sentiment treebank
- R. Socher, A. Perelygin, J. Wu, J. Chuang, C. Manning, A. Ng, and C. Potts. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP). 2013.
- (2013) Proceedings of Empirical Methods in Natural Language Processing (EMNLP)
- Socher, R.¹ Perelygin, A.² Wu, J.³ Chuang, J.⁴ Manning, C.⁵ Ng, A.⁶ Potts, C.⁷

348
- 84877724347
- Multimodal learning with deep boltzmann machines
- N. Srivastava and R. Salakhutdinov. Multimodal learning with deep boltzmann machines. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Srivastava, N.¹ Salakhutdinov, R.²

349
- 84898957541
- Discriminative transfer learning with tree-based priors
- N. Srivastava and R. Salakhutdinov. Discriminative transfer learning with tree-based priors. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Srivastava, N.¹ Salakhutdinov, R.²

350
- 84898978862
- Compete to compute
- R. Srivastava, J. Masci, S. Kazerounian, F. Gomez, and J. Schmidhuber. Compete to compute. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Srivastava, R.¹ Masci, J.² Kazerounian, S.³ Gomez, F.⁴ Schmidhuber, J.⁵

351
- 85073226083
- Preliminary investigation of boltzmann machine classifiers for speaker recognition
- T. Stafylakis, P. Kenny, M. Senoussaoui, and P. Dumouchel. Preliminary investigation of boltzmann machine classifiers for speaker recognition. In Proceedings of Odyssey, pages 109-116. 2012.
- (2012) Proceedings of Odyssey , pp. 109-116
- Stafylakis, T.¹ Kenny, P.² Senoussaoui, M.³ Dumouchel, P.⁴

352
- 84883148756
- Empirical risk minimization of graphical model parameters given approximate inference, decoding, and model structure
- V. Stoyanov, A. Ropson, and J. Eisner. Empirical risk minimization of graphical model parameters given approximate inference, decoding, and model structure. In Proceedings of Artificial Intelligence and Statistics (AISTATS). 2011.
- (2011) Proceedings of Artificial Intelligence and Statistics (AISTATS)
- Stoyanov, V.¹ Ropson, A.² Eisner, J.³

353
- 84890543852
- Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
- H. Su, G. Li, D. Yu, and F. Seide. Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Su, H.¹ Li, G.² Yu, D.³ Seide, F.⁴

354
- 33750560608
- Multi-sensory speech processing: Incorporating automatically extracted hidden dynamic information
- Amsterdam, July
- A. Subramanya, L. Deng, Z. Liu, and Z. Zhang. Multi-sensory speech processing: Incorporating automatically extracted hidden dynamic information. In Proceedings of IEEE International Conference on Multimedia & Expo (ICME). Amsterdam, July 2005.
- (2005) Proceedings of IEEE International Conference on Multimedia & Expo (ICME)
- Subramanya, A.¹ Deng, L.² Liu, Z.³ Zhang, Z.⁴

355
- 0036165806
- An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition
- J. Sun and L. Deng. An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition. Journal on Acoustical Society of America, 111(2):1086-1101, 2002.
- (2002) Journal on Acoustical Society of America , vol.111 , Issue.2 , pp. 1086-1101
- Sun, J.¹ Deng, L.²

356
- 84884966819
- Ph.D. Thesis, University of Toronto
- I. Sutskever. Training recurrent neural networks. Ph.D. Thesis, University of Toronto, 2013.
- (2013) Training Recurrent Neural Networks
- Sutskever, I.¹

357
- 80053459857
- Generating text with recurrent neural networks
- I. Sutskever, J. Martens, and G. Hinton. Generating text with recurrent neural networks. In Proceedings of International Conference on Machine Learning (ICML). 2011.
- (2011) Proceedings of International Conference on Machine Learning (ICML)
- Sutskever, I.¹ Martens, J.² Hinton, G.³

358
- 77956542104
- Deep networks for robust visual recognition
- Y. Tang and C. Eliasmith. Deep networks for robust visual recognition. In Proceedings of International Conference on Machine Learning (ICML). 2010.
- (2010) Proceedings of International Conference on Machine Learning (ICML)
- Tang, Y.¹ Eliasmith, C.²

359
- 84898947294
- NIPS
- Y. Tang and R. Salakhutdinov. Learning Stochastic Feedforward Neural Networks. NIPS, 2013.
- (2013) Learning Stochastic Feedforward Neural Networks
- Tang, Y.¹ Salakhutdinov, R.²

360
- 51949119257
- Small codes and large image databases for recognition
- A. Tarralba, R. Fergus, and Y. Weiss. Small codes and large image databases for recognition. In Proceedings of Computer Vision and Pattern Recognition (CVPR). 2008.
- (2008) Proceedings of Computer Vision and Pattern Recognition (CVPR)
- Tarralba, A.¹ Fergus, R.² Weiss, Y.³

361
- 84864026688
- Modeling human motion using binary latent variables
- G. Taylor, G. E. Hinton, and S. Roweis. Modeling human motion using binary latent variables. In Proceedings of Neural Information Processing Systems (NIPS). 2007.
- (2007) Proceedings of Neural Information Processing Systems (NIPS)
- Taylor, G.¹ Hinton, G.E.² Roweis, S.³

362
- 84890474716
- Deep neural network features and semi-supervised training for low resource speech recognition
- S. Thomas, M. Seltzer, K. Church, and H. Hermansky. Deep neural network features and semi-supervised training for low resource speech recognition. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Thomas, S.¹ Seltzer, M.² Church, K.³ Hermansky, H.⁴

363
- 56449086223
- Training restricted boltzmann machines using approximations to the likelihood gradient
- T. Tieleman. Training restricted boltzmann machines using approximations to the likelihood gradient. In Proceedings of International Conference on Machine Learning (ICML). 2008.
- (2008) Proceedings of International Conference on Machine Learning (ICML)
- Tieleman, T.¹

364
- 84876687945
- Speech synthesis based on hidden markov models
- K. Tokuda, Y. Nankaku, T. Toda, H. Zen, H. Yamagishi, and K. Oura. Speech synthesis based on hidden markov models. Proceedings of the IEEE, 101(5):1234-1252, 2013.
- (2013) Proceedings of the IEEE , vol.101 , Issue.5 , pp. 1234-1252
- Tokuda, K.¹ Nankaku, Y.² Toda, T.³ Zen, H.⁴ Yamagishi, H.⁵ Oura, K.⁶

365
- 84886714036
- Acoustic modeling with hierarchical reservoirs
- November
- F. Triefenbach, A. Jalalvand, K. Demuynck, and J.-P.Martens. Acoustic modeling with hierarchical reservoirs. IEEE Transactions on Audio, Speech, and Language Processing, 21(11):2439-2450, November 2013.
- (2013) IEEE Transactions on Audio Speech, and Language Processing , vol.21 , Issue.11 , pp. 2439-2450
- Triefenbach, F.¹ Jalalvand, A.² Demuynck, K.³ Martens, J.-P.⁴

366
- 84867605416
- Towards deep understanding: Deep convex networks for semantic utterance classification
- G. Tur, L. Deng, D. Hakkani-Tur, and X. He. Towards deep understanding: Deep convex networks for semantic utterance classification. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Tur, G.¹ Deng, L.² Hakkani-Tur, D.³ He, X.⁴

367
- 80053495924
- Word representations: A simple and general method for semi-supervised learning
- J. Turian, L. Ratinov, and Y. Bengio. Word representations: A simple and general method for semi-supervised learning. In Proceedings of Association for Computational Linguistics (ACL). 2010.
- (2010) Proceedings of Association for Computational Linguistics (ACL)
- Turian, J.¹ Ratinov, L.² Bengio, Y.³

368
- 84878403164
- Context-dependent MLPs for LVCSR: TANDEM, hybrid or both?
- Z. Tuske, M. Sundermeyer, R. Schluter, and H. Ney. Context-dependent MLPs for LVCSR: TANDEM, hybrid or both? In Proceedings of Interspeech. 2012.
- (2012) Proceedings of Interspeech
- Tuske, Z.¹ Sundermeyer, M.² Schluter, R.³ Ney, H.⁴

369
- 84874282835
- A deep neural network for acoustic-articulatory speech inversion
- B. Uria, S. Renals, and K. Richmond. A deep neural network for acoustic-articulatory speech inversion. Neural Information Processing Systems (NIPS) Workshop on Deep Learning and Unsupervised Feature Learning, 2011.
- (2011) Neural Information Processing Systems (NIPS) Workshop on Deep Learning and Unsupervised Feature Learning
- Uria, B.¹ Renals, S.² Richmond, K.³

370
- 79951668781
- Extended VTS for noise-robust speech recognition
- R. van Dalen and M. Gales. Extended VTS for noise-robust speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 19(4):733-743, 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.4 , pp. 733-743
- Van Dalen, R.¹ Gales, M.²

371
- 84898973716
- Deep content-based music recommendation
- A. van den Oord, S. Dieleman, and B. Schrauwen. Deep content-based music recommendation. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Oord Den A.Van¹ Dieleman, S.² Schrauwen, B.³

372
- 84957551591
- Speaker recognition by means of deep belief networks
- V. Vasilakakis, S. Cumani, and P. Laface. Speaker recognition by means of deep belief networks. In Proceedings of Biometric Technologies in Forensic Science. 2013.
- (2013) Proceedings of Biometric Technologies in Forensic Science
- Vasilakakis, V.¹ Cumani, S.² Laface, P.³

373
- 84906274730
- Sequence-discriminative training of deep neural networks
- K. Vesely, A. Ghoshal, L. Burget, and D. Povey. Sequence-discriminative training of deep neural networks. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Vesely, K.¹ Ghoshal, A.² Burget, L.³ Povey, D.⁴

374
- 84893650076
- Semi-supervised training of deep neural networks
- K. Vesely, M. Hannemann, and L. Burget. Semi-supervised training of deep neural networks. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Vesely, K.¹ Hannemann, M.² Burget, L.³

375
- 79959575293
- A connection between score matching and denoising autoencoder
- P. Vincent. A connection between score matching and denoising autoencoder. Neural Computation, 23(7):1661-1674, 2011.
- (2011) Neural Computation , vol.23 , Issue.7 , pp. 1661-1674
- Vincent, P.¹

376
- 79551480483
- Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
- P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P. Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research, 11:3371-3408, 2010.
- (2010) Journal of Machine Learning Research , vol.11 , pp. 3371-3408
- Vincent, P.¹ Larochelle, H.² Lajoie, I.³ Bengio, Y.⁴ Manzagol, P.⁵

377
- 84877777313
- Learning with recursive perceptual representations
- O. Vinyals, Y. Jia, L. Deng, and T. Darrell. Learning with recursive perceptual representations. In Proceedings of Neural Information Processing Systems (NIPS). 2012.
- (2012) Proceedings of Neural Information Processing Systems (NIPS)
- Vinyals, O.¹ Jia, Y.² Deng, L.³ Darrell, T.⁴

378
- 84867614640
- Krylov subspace descent for deep learning
- O. Vinyals and D. Povey. Krylov subspace descent for deep learning. In Proceedings of Artificial Intelligence and Statistics (AISTATS). 2012.
- (2012) Proceedings of Artificial Intelligence and Statistics (AISTATS)
- Vinyals, O.¹ Povey, D.²

379
- 80051644173
- Comparing multilayer perceptron to deep belief network tandem features for robust ASR
- O. Vinyals and S. Ravuri. Comparing multilayer perceptron to deep belief network tandem features for robust ASR. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2011.
- (2011) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Vinyals, O.¹ Ravuri, S.²

380
- 84867626068
- Revisiting recurrent neural networks for robust ASR
- O. Vinyals, S. Ravuri, and D. Povey. Revisiting recurrent neural networks for robust ASR. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Vinyals, O.¹ Ravuri, S.² Povey, D.³

381
- 84896538964
- Dropout training as adaptive regularization
- S. Wager, S. Wang, and P. Liang. Dropout training as adaptive regularization. In Proceedings of Neural Information Processing Systems (NIPS). 2013.
- (2013) Proceedings of Neural Information Processing Systems (NIPS)
- Wager, S.¹ Wang, S.² Liang, P.³

382
- 0024634603
- Phoneme recognition using time-delay neural networks
- A.Waibel, T. Hanazawa, G. Hinton, K. Shikano, and K. Lang. Phoneme recognition using time-delay neural networks. IEEE Transactions on Acoustical Speech, and Signal Processing, 37:328-339, 1989.
- (1989) IEEE Transactions on Acoustical Speech, and Signal Processing , vol.37 , pp. 328-339
- Waibel, A.¹ Hanazawa, T.² Hinton, G.³ Shikano, K.⁴ Lang, K.⁵

383
- 84893699565
- Context-dependent modelling of deep neural network using logistic regression
- G. Wang and K. Sim. Context-dependent modelling of deep neural network using logistic regression. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2013.
- (2013) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Wang, G.¹ Sim, K.²

384
- 84916199887
- Regression-based context-dependent modeling of deep neural networks for speech recognition
- G. Wang and K. Sim. Regression-based context-dependent modeling of deep neural networks for speech recognition. IEEE/Association for Computing Machinery (ACM) Transactions on Audio, Speech, and Language Processing, 2014.
- (2014) IEEE/Association for Computing Machinery (ACM) Transactions on Audio, Speech, and Language Processing
- Wang, G.¹ Sim, K.²

385
- 85083951533
- An empirical analysis of dropout in piecewise linear networks
- D. Warde-Farley, I. Goodfellow, A. Courville, and Y. Bengi. An empirical analysis of dropout in piecewise linear networks. In Proceedings of International Conference on Learning Representations (ICLR). 2014.
- (2014) Proceedings of International Conference on Learning Representations (ICLR)
- Warde-Farley, D.¹ Goodfellow, I.² Courville, A.³ Bengi, Y.⁴

386
- 84899000641
- Exponential family harmoniums with an application to information retrieval
- M. Welling, M. Rosen-Zvi, and G. Hinton. Exponential family harmoniums with an application to information retrieval. In Proceedings of Neural Information Processing Systems (NIPS). 2005.
- (2005) Proceedings of Neural Information Processing Systems (NIPS)
- Welling, M.¹ Rosen-Zvi, M.² Hinton, G.³

387
- 84905269210
- Single-channel mixed speech recognition using deep neural networks
- C.Weng, D. Yu, M. Seltzer, and J. Droppo. Single-channel mixed speech recognition using deep neural networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2014.
- (2014) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Weng, C.¹ Yu, D.² Seltzer, M.³ Droppo, J.⁴

388
- 77955654853
- Large scale image annotation: Learning to rank with joint word-image embeddings
- J. Weston, S. Bengio, and N. Usunier. Large scale image annotation: Learning to rank with joint word-image embeddings. Machine Learning, 81(1):21-35, 2010.
- (2010) Machine Learning , vol.81 , Issue.1 , pp. 21-35
- Weston, J.¹ Bengio, S.² Usunier, N.³

389
- 84867117593
- Wsabie: Scaling up to large vocabulary image annotation
- J. Weston, S. Bengio, and N. Usunier. Wsabie: Scaling up to large vocabulary image annotation. In Proceedings of International Joint Conference on Artificial Intelligence (IJCAI). 2011.
- (2011) Proceedings of International Joint Conference on Artificial Intelligence (IJCAI)
- Weston, J.¹ Bengio, S.² Usunier, N.³

390
- 84906237512
- Investigations on hessian-free optimization for cross-entropy training of deep neural networks
- S.Wiesler, J. Li, and J. Xue. Investigations on hessian-free optimization for cross-entropy training of deep neural networks. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Wiesler, S.¹ Li, J.² Xue, J.³

391
- 79951599228
- A probabilistic interaction model for multi-pitch tracking with factorial hidden markov model
- May
- M. Wohlmayr, M. Stark, and F. Pernkopf. A probabilistic interaction model for multi-pitch tracking with factorial hidden markov model. IEEE Transactions on Audio, Speech, and Language Processing, 19(4), May 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.4
- Wohlmayr, M.¹ Stark, M.² Pernkopf, F.³

392
- 0026692226
- Stacked generalization
- D. Wolpert. Stacked generalization. Neural Networks, 5(2):241-259, 1992.
- (1992) Neural Networks , vol.5 , Issue.2 , pp. 241-259
- Wolpert, D.¹

393
- 84887037596
- Optimization algorithms and applications for speech and language processing
- November
- S. J. Wright, D. Kanevsky, L. Deng, X. He, G. Heigold, and H. Li. Optimization algorithms and applications for speech and language processing. IEEE Transactions on Audio, Speech, and Language Processing, 21(11):2231-2243, November 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.11 , pp. 2231-2243
- Wright, S.J.¹ Kanevsky, D.² Deng, L.³ He, X.⁴ Heigold, G.⁵ Li, H.⁶

394
- 85032751865
- A geometric perspective of large-margin training of gaussian models
- November
- L. Xiao and L. Deng. A geometric perspective of large-margin training of gaussian models. IEEE Signal Processing Magazine, 27(6):118-123, November 2010.
- (2010) IEEE Signal Processing Magazine , vol.27 , Issue.6 , pp. 118-123
- Xiao, L.¹ Deng, L.²

395
- 0037313081
- Equivalence of backpropagation and contrastive hebbian learning in a layered network
- X. Xie and S. Seung. Equivalence of backpropagation and contrastive hebbian learning in a layered network. Neural computation, 15:441-454, 2003.
- (2003) Neural Computation , vol.15 , pp. 441-454
- Xie, X.¹ Seung, S.²

396
- 84889257121
- An experimental study on speech enhancement based on deep neural networks
- Y. Xu, J. Du, L. Dai, and C. Lee. An experimental study on speech enhancement based on deep neural networks. IEEE Signal Processing Letters, 21(1):65-68, 2014.
- (2014) IEEE Signal Processing Letters , vol.21 , Issue.1 , pp. 65-68
- Xu, Y.¹ Du, J.² Dai, L.³ Lee, C.⁴

397
- 84906227589
- Restructuring of deep neural network acoustic models with singular value decomposition
- J. Xue, J. Li, and Y. Gong. Restructuring of deep neural network acoustic models with singular value decomposition. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Xue, J.¹ Li, J.² Gong, Y.³

398
- 66149085249
- An integrative and discriminative technique for spoken utterance classification
- S. Yamin, L. Deng, Y.Wang, and A. Acero. An integrative and discriminative technique for spoken utterance classification. IEEE Transactions on Audio, Speech, and Language Processing, 16:1207-1214, 2008.
- (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.16 , pp. 1207-1214
- Yamin, S.¹ Deng, L.² Wang, Y.³ Acero, A.⁴

399
- 84906225757
- A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR
- Z. Yan, Q. Huo, and J. Xu. A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Yan, Z.¹ Huo, Q.² Xu, J.³

400
- 84866881711
- Combining a two-step CRF model and a joint source-channel model for machine transliteration
- D. Yang and S. Furui. Combining a two-step CRF model and a joint source-channel model for machine transliteration. In Proceedings of Association for Computational Linguistics (ACL), pages 275-280. 2010.
- (2010) Proceedings of Association for Computational Linguistics (ACL) , pp. 275-280
- Yang, D.¹ Furui, S.²

401
- 84903733224
- A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation
- K. Yao, D. Yu, L. Deng, and Y. Gong. A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation. Neurocomputing, 2013a.
- (2013) Neurocomputing
- Yao, K.¹ Yu, D.² Deng, L.³ Gong, Y.⁴

402
- 84874226579
- Adaptation of context-dependent deep neural networks for automatic speech recognition
- K. Yao, D. Yu, F. Seide, H. Su, L. Deng, and Y. Gong. Adaptation of context-dependent deep neural networks for automatic speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Yao, K.¹ Yu, D.² Seide, F.³ Su, H.⁴ Deng, L.⁵ Gong, Y.⁶

403
- 84904483474
- Recurrent neural networks for language understanding
- K. Yao, G. Zweig, M. Hwang, Y. Shi, and D. Yu. Recurrent neural networks for language understanding. In Proceedings of Interspeech. 2013.
- (2013) Proceedings of Interspeech
- Yao, K.¹ Zweig, G.² Hwang, M.³ Shi, Y.⁴ Yu, D.⁵

404
- 84881043147
- Noise model transfer: Novel approach to robustness against nonstationary noise
- T. Yoshioka and T. Nakatani. Noise model transfer: Novel approach to robustness against nonstationary noise. IEEE Transactions on Audio, Speech, and Language Processing, 21(10):2182-2192, 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.10 , pp. 2182-2192
- Yoshioka, T.¹ Nakatani, T.²

405
- 84903743887
- Investigation of unsupervised adaptation of DNN acoustic models with filter bank input
- T. Yoshioka, A. Ragni, and M. Gales. Investigation of unsupervised adaptation of DNN acoustic models with filter bank input. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Yoshioka, T.¹ Ragni, A.² Gales, M.³

406
- 33644756784
- On the convergence of markovian stochastic algorithms with rapidly decreasing ergodicity rates
- L. Younes. On the convergence of markovian stochastic algorithms with rapidly decreasing ergodicity rates. Stochastics and Stochastic Reports, 65(3):177-228, 1999.
- (1999) Stochastics and Stochastic Reports , vol.65 , Issue.3 , pp. 177-228
- Younes, L.¹

407
- 84890447334
- Factorized deep neural networks for adaptive speech recognition
- March
- D. Yu, X. Chen, and L. Deng. Factorized deep neural networks for adaptive speech recognition. International Workshop on Statistical Machine Learning for Speech Processing, March 2012b.
- (2012) International Workshop on Statistical Machine Learning for Speech Processing
- Yu, D.¹ Chen, X.² Deng, L.³

408
- 79959858900
- Learning in the deep-structured conditional random fields
- D. Yu, D. Deng, and S. Wang. Learning in the deep-structured conditional random fields. Neural Information Processing Systems (NIPS) 2009 Workshop on Deep Learning for Speech Recognition and Related Applications, 2009.
- (2009) Neural Information Processing Systems (NIPS) 2009 Workshop on Deep Learning for Speech Recognition and Related Applications
- Yu, D.¹ Deng, D.² Wang, S.³

409
- 85032752267
- Solving nonlinear estimation problems using splines
- July
- D. Yu and L. Deng. Solving nonlinear estimation problems using splines. IEEE Signal Processing Magazine, 26(4):86-90, July 2009.
- (2009) IEEE Signal Processing Magazine , vol.26 , Issue.4 , pp. 86-90
- Yu, D.¹ Deng, L.²

410
- 79959828814
- Deep-structured hidden conditional random fields for phonetic recognition
- September
- D. Yu and L. Deng. Deep-structured hidden conditional random fields for phonetic recognition. In Proceedings of Interspeech. September 2010.
- (2010) Proceedings of Interspeech
- Yu, D.¹ Deng, L.²

411
- 84865770736
- Accelerated parallelizable neural networks learning algorithms for speech recognition
- D. Yu and L. Deng. Accelerated parallelizable neural networks learning algorithms for speech recognition. In Proceedings of Interspeech. 2011.
- (2011) Proceedings of Interspeech
- Yu, D.¹ Deng, L.²

412
- 85032782045
- Deep learning and its applications to signal and information processing
- January
- D. Yu and L. Deng. Deep learning and its applications to signal and information processing. IEEE Signal Processing Magazine, pages 145-154, January 2011.
- (2011) IEEE Signal Processing Magazine , pp. 145-154
- Yu, D.¹ Deng, L.²

413
- 84862822032
- Efficient and effective algorithms for training singlehidden- layer neural networks
- D. Yu and L. Deng. Efficient and effective algorithms for training singlehidden- layer neural networks. Pattern Recognition Letters, 33:554-558, 2012.
- (2012) Pattern Recognition Letters , vol.33 , pp. 554-558
- Yu, D.¹ Deng, L.²

414
- 84055163920
- Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition
- December
- D. Yu, L. Deng, and G. E. Dahl. Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition. Neural Information Processing Systems (NIPS) 2010 Workshop on Deep Learning and Unsupervised Feature Learning, December 2010.
- (2010) Neural Information Processing Systems (NIPS) 2010 Workshop on Deep Learning and Unsupervised Feature Learning
- Yu, D.¹ Deng, L.² Dahl, G.E.³

415
- 66149101303
- Robust speech recognition using cepstral minimum-mean-square-error noise suppressor
- July
- D. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero. Robust speech recognition using cepstral minimum-mean-square-error noise suppressor. IEEE Transactions on Audio, Speech, and Language Processing, 16(5), July 2008.
- (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.16 , Issue.5
- Yu, D.¹ Deng, L.² Droppo, J.³ Wu, J.⁴ Gong, Y.⁵ Acero, A.⁶

416
- 68549140008
- A novel framework and training algorithm for variable-parameter hidden markov models
- D. Yu, L. Deng, Y. Gong, and A. Acero. A novel framework and training algorithm for variable-parameter hidden markov models. IEEE Transactions on Audio, Speech and Language Processing, 17(7):1348-1360, 2009.
- (2009) IEEE Transactions on Audio, Speech and Language Processing , vol.17 , Issue.7 , pp. 1348-1360
- Yu, D.¹ Deng, L.² Gong, Y.³ Acero, A.⁴

417
- 42949105203
- Large-margin minimum classification error training: A theoretical risk minimization perspective
- October
- D. Yu, L. Deng, X. He, and A. Acero. Large-margin minimum classification error training: A theoretical risk minimization perspective. Computer Speech and Language, 22(4):415-429, October 2008.
- (2008) Computer Speech and Language , vol.22 , Issue.4 , pp. 415-429
- Yu, D.¹ Deng, L.² He, X.³ Acero, A.⁴

418
- 34547526577
- Large-margin minimum classification error training for large-scale speech recognition tasks
- D. Yu, L. Deng, X. He, and X. Acero. Large-margin minimum classification error training for large-scale speech recognition tasks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2007.
- (2007) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Yu, D.¹ Deng, L.² He, X.³ Acero, X.⁴

419
- 84867789985
- U.S. Patent Filing, November
- D. Yu, L. Deng, G. Li, and F. Seide. Discriminative pretraining of deep neural networks. U.S. Patent Filing, November 2011.
- (2011) Discriminative Pretraining of Deep Neural Networks
- Yu, D.¹ Deng, L.² Li, G.³ Seide, F.⁴

420
- 70349197671
- Cross-lingual speech recognition under runtime resource constraints
- D. Yu, L. Deng, P. Liu, J. Wu, Y. Gong, and A. Acero. Cross-lingual speech recognition under runtime resource constraints. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2009b.
- (2009) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Yu, D.¹ Deng, L.² Liu, P.³ Wu, J.⁴ Gong, Y.⁵ Acero, A.⁶

421
- 84878405171
- Large vocabulary speech recognition using deep tensor neural networks
- D. Yu, L. Deng, and F. Seide. Large vocabulary speech recognition using deep tensor neural networks. In Proceedings of Interspeech. 2012c.
- (2012) Proceedings of Interspeech
- Yu, D.¹ Deng, L.² Seide, F.³

422
- 84871387302
- The deep tensor neural network with applications to large vocabulary speech recognition
- D. Yu, L. Deng, and F. Seide. The deep tensor neural network with applications to large vocabulary speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 21(2):388-396, 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.2 , pp. 388-396
- Yu, D.¹ Deng, L.² Seide, F.³

423
- 85008521116
- Calibration of confidence measures in speech recognition
- D. Yu, J.-Y. Li, and L. Deng. Calibration of confidence measures in speech recognition. IEEE Transactions on Audio, Speech and Language, 19:2461-2473, 2010.
- (2010) IEEE Transactions on Audio, Speech and Language , vol.19 , pp. 2461-2473
- Yu, D.¹ Li, J.-Y.² Deng, L.³

424
- 84867606668
- Exploiting sparseness in deep neural networks for large vocabulary speech recognition
- D. Yu, F. Seide, G. Li, and L. Deng. Exploiting sparseness in deep neural networks for large vocabulary speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Yu, D.¹ Seide, F.² Li, G.³ Deng, L.⁴

425
- 84865785753
- Improved bottleneck features using pre-trained deep neural networks
- D. Yu and M. Seltzer. Improved bottleneck features using pre-trained deep neural networks. In Proceedings of Interspeech. 2011.
- (2011) Proceedings of Interspeech
- Yu, D.¹ Seltzer, M.²

426
- 85083953021
- Feature learning in deep neural networks - Studies on speech recognition
- D. Yu, M. Seltzer, J. Li, J.-T. Huang, and F. Seide. Feature learning in deep neural networks - studies on speech recognition. In Proceedings of International Conference on Learning Representations (ICLR). 2013.
- (2013) Proceedings of International Conference on Learning Representations (ICLR)
- Yu, D.¹ Seltzer, M.² Li, J.³ Huang, J.-T.⁴ Seide, F.⁵

427
- 84867329143
- Boosting attribute and phone estimation accuracies with deep neural networks for detectionbased speech recognition
- D. Yu, S. Siniscalchi, L. Deng, and C. Lee. Boosting attribute and phone estimation accuracies with deep neural networks for detectionbased speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2012.
- (2012) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Yu, D.¹ Siniscalchi, S.² Deng, L.³ Lee, C.⁴

428
- 78649308591
- Sequential labeling using deep-structured conditional random fields
- D. Yu, S. Wang, and L. Deng. Sequential labeling using deep-structured conditional random fields. Journal of Selected Topics in Signal Processing, 4:965-973, 2010.
- (2010) Journal of Selected Topics in Signal Processing , vol.4 , pp. 965-973
- Yu, D.¹ Wang, S.² Deng, L.³

429
- 78049409409
- Language recognition using deep-structured conditional random fields
- D. Yu, S. Wang, Z. Karam, and L. Deng. Language recognition using deep-structured conditional random fields. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 5030-5033. 2010.
- (2010) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 5030-5033
- Yu, D.¹ Wang, S.² Karam, Z.³ Deng, L.⁴

430
- 84890542079
- KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
- D. Yu, K. Yao, H. Su, G. Li, and F. Seide. KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Yu, D.¹ Yao, K.² Su, H.³ Li, G.⁴ Seide, F.⁵

431
- 65249094352
- Unsupervised adaptation with discriminative mapping transforms
- K. Yu, M. Gales, and P. Woodland. Unsupervised adaptation with discriminative mapping transforms. IEEE Transactions on Audio, Speech, and Language Processing, 17(4):714-723, 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.4 , pp. 714-723
- Yu, K.¹ Gales, M.² Woodland, P.³

432
- 80052889296
- Learning image representations from the pixel level via hierarchical sparse coding
- K. Yu, Y. Lin, and H. Lafferty. Learning image representations from the pixel level via hierarchical sparse coding. In Proceedings Computer Vision and Pattern Recognition (CVPR). 2011.
- (2011) Proceedings Computer Vision and Pattern Recognition (CVPR)
- Yu, K.¹ Lin, Y.² Lafferty, H.³

433
- 84866855842
- Fast evaluation of connectionist language models
- F. Zamora-Martinez, M. Castro-Bleda, and S. Espana-Boquera. Fast evaluation of connectionist language models. International Conference on Artificial Neural Networks, pages 144-151, 2009.
- (2009) International Conference on Artificial Neural Networks , pp. 144-151
- Zamora-Martinez, F.¹ Castro-Bleda, M.² Espana-Boquera, S.³

434
- 84903745553
- Ph.D. Thesis, New York University, January
- M. Zeiler. Hierarchical convolutional deep learning in computer vision. Ph.D. Thesis, New York University, January 2014.
- (2014) Hierarchical Convolutional Deep Learning in Computer Vision
- Zeiler, M.¹

435
- 85083954484
- Stochastic pooling for regularization of deep convolutional neural networks
- M. Zeiler and R. Fergus. Stochastic pooling for regularization of deep convolutional neural networks. In Proceedings of International Conference on Learning Representations (ICLR). 2013.
- (2013) Proceedings of International Conference on Learning Representations (ICLR)
- Zeiler, M.¹ Fergus, R.²

436
- 84903712031
- arXiv:1311.2901
- M. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. arXiv:1311.2901, pages 1-11, 2013.
- (2013) Visualizing and Understanding Convolutional Networks , pp. 1-11
- Zeiler, M.¹ Fergus, R.²

437
- 84856686379
- Adaptive deconvolutional networks for mid and high level feature learning
- M. Zeiler, G. Taylor, and R. Fergus. Adaptive deconvolutional networks for mid and high level feature learning. In Proceedings of International Conference on Computer vision (ICCV). 2011.
- (2011) Proceedings of International Conference on Computer Vision (ICCV)
- Zeiler, M.¹ Taylor, G.² Fergus, R.³

438
- 85008525798
- Product of experts for statistical parametric speech synthesis
- March
- H. Zen, M. Gales, J. F. Nankaku, and Y. K. Tokuda. Product of experts for statistical parametric speech synthesis. IEEE Transactions on Audio, Speech, and Language Processing, 20(3):794-805,March 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.3 , pp. 794-805
- Zen, H.¹ Gales, M.² Nankaku, J.F.³ Tokuda, Y.K.⁴

439
- 78149260085
- Continuous stochastic feature mapping based on trajectory HMMs
- February
- H. Zen, Y. Nankaku, and K. Tokuda. Continuous stochastic feature mapping based on trajectory HMMs. IEEE Transactions on Audio, Speech, and Language Processings, 19(2):417-430, February 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processings , vol.19 , Issue.2 , pp. 417-430
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

440
- 84890490547
- Statistical parametric speech synthesis using deep neural networks
- H. Zen, A. Senior, and M. Schuster. Statistical parametric speech synthesis using deep neural networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP), pages 7962-7966. 2013.
- (2013) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP) , pp. 7962-7966
- Zen, H.¹ Senior, A.² Schuster, M.³

441
- 84905239342
- Improving deep neural network acoustic models using generalized maxout networks
- X. Zhang, J. Trmal, D. Povey, and S. Khudanpur. Improving deep neural network acoustic models using generalized maxout networks. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2014.
- (2014) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Zhang, X.¹ Trmal, J.² Povey, D.³ Khudanpur, S.⁴

442
- 84872300403
- Deep belief networks based voice activity detection
- X. Zhang and J. Wu. Deep belief networks based voice activity detection. IEEE Transactions on Audio, Speech, and Language Processing, 21(4):697-710, 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.4 , pp. 697-710
- Zhang, X.¹ Wu, J.²

443
- 4544290173
- Multi-sensory microphones for robust speech detection, enhancement and recognition
- Z. Zhang, Z. Liu, M. Sinclair, A. Acero, L. Deng, J. Droppo, X. Huang, and Y. Zheng. Multi-sensory microphones for robust speech detection, enhancement and recognition. In Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP). 2004.
- (2004) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Zhang, Z.¹ Liu, Z.² Sinclair, M.³ Acero, A.⁴ Deng, L.⁵ Droppo, J.⁶ Huang, X.⁷ Zheng, Y.⁸

444
- 84865208051
- Nonlinear compensation using the gauss-newton method for noise-robust speech recognition
- Y. Zhao and B. Juang. Nonlinear compensation using the gauss-newton method for noise-robust speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 20(8):2191-2206, 2012.
- (2012) IEEE Transactions on Audio Speech, and Language Processing , vol.20 , Issue.8 , pp. 2191-2206
- Zhao, Y.¹ Juang, B.²

445
- 84926285904
- Bilingual word embeddings for phrase-based machine translation
- W. Zou, R. Socher, D. Cer, and C. Manning. Bilingual word embeddings for phrase-based machine translation. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP). 2013.
- (2013) Proceedings of Empirical Methods in Natural Language Processing (EMNLP)
- Zou, W.¹ Socher, R.² Cer, D.³ Manning, C.⁴

446
- 77949370075
- A segmental CRF approach to large vocabulary continuous speech recognition
- G. Zweig and P. Nguyen. A segmental CRF approach to large vocabulary continuous speech recognition. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 2009.
- (2009) Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU)
- Zweig, G.¹ Nguyen, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.