-
1
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
Geoffrey Hinton, Li Deng, Dong Yu, George E Dahl, Abdelrahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N Sainath, et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
2
-
-
84879854889
-
Representation learning: A review and new perspectives
-
Yoshua Bengio, Aaron Courville, and Pierre Vincent, "Representation learning: A review and new perspectives, " Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 35, no. 8, pp. 1798-1828, 2013.
-
(2013)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.35
, Issue.8
, pp. 1798-1828
-
-
Bengio, Y.1
Courville, A.2
Vincent, P.3
-
3
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
Yoshua Bengio, Pascal Lamblin, Dan Popovici, Hugo Larochelle, et al., "Greedy layer-wise training of deep networks, " Advances in neural information processing systems, vol. 19, pp. 153, 2007.
-
(2007)
Advances in Neural Information Processing Systems
, vol.19
, pp. 153
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
4
-
-
84943645147
-
-
arXiv preprint arXiv: 1409. 5185
-
Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, and Zhuowen Tu, "Deeply-supervised nets, " arXiv preprint arXiv: 1409. 5185, 2014.
-
(2014)
Deeply-supervised Nets
-
-
Lee, C.1
Xie, S.2
Gallagher, P.3
Zhang, Z.4
Tu, Z.5
-
5
-
-
84892421248
-
-
arXiv preprint arXiv: 1302. 4389
-
Ian J Goodfellow, David Warde-Farley, Mehdi Mirza, Aaron Courville, and Yoshua Bengio, "Maxout networks, " arXiv preprint arXiv: 1302. 4389, 2013.
-
(2013)
Maxout Networks
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
6
-
-
84964544562
-
-
arXiv preprint arXiv: 1412. 6550
-
Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, and Yoshua Bengio, "Fitnets: Hints for thin deep nets, " arXiv preprint arXiv: 1412. 6550, 2014.
-
(2014)
Fitnets: Hints for Thin Deep Nets
-
-
Romero, A.1
Ballas, N.2
Ebrahimi Kahou, S.3
Chassang, A.4
Gatta, C.5
Bengio, Y.6
-
7
-
-
84867606668
-
Exploiting sparseness in deep neural networks for large vocabulary speech recognition
-
Dong Yu, Frank Seide, Gang Li, and Li Deng, "Exploiting sparseness in deep neural networks for large vocabulary speech recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. IEEE, 2012, pp. 4409-4412.
-
(2012)
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference On. IEEE
, pp. 4409-4412
-
-
Yu, D.1
Seide, F.2
Li, G.3
Deng, L.4
-
8
-
-
84906227589
-
Restructuring of deep neural network acoustic models with singular value decomposition
-
Jian Xue, Jinyu Li, and Yifan Gong, "Restructuring of deep neural network acoustic models with singular value decomposition., " in INTERSPEECH, 2013, pp. 2365-2369.
-
(2013)
INTERSPEECH
, pp. 2365-2369
-
-
Xue, J.1
Li, J.2
Gong, Y.3
-
9
-
-
6344222337
-
DARPA timit acoustic-phonetic continous speech corpus cd-rom nist speech disc 1-1. 1
-
John S Garofolo, Lori F Lamel, William M Fisher, Jonathon G Fiscus, and David S Pallett, "Darpa timit acoustic-phonetic continous speech corpus cd-rom. nist speech disc 1-1. 1, " NASA STI/Recon Technical Report N, vol. 93, pp. 27403, 1993.
-
(1993)
NASA STI/Recon Technical Report N
, vol.93
, pp. 27403
-
-
Garofolo, J.S.1
Lamel, L.F.2
Fisher, W.M.3
Fiscus, J.G.4
Pallett, D.S.5
-
11
-
-
84890494546
-
Deep stacking networks for information retrieval
-
Li Deng, Xiaodong He, and Jianfeng Gao, "Deep stacking networks for information retrieval, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 3153-3157.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 3153-3157
-
-
Deng, L.1
He, X.2
Gao, J.3
-
12
-
-
84928146953
-
-
Tech. Rep., Tech. Rep. MSR, Microsoft Research 2014
-
Dong Yu, Adam Eversole, Mike Seltzer, Kaisheng Yao, Zhiheng Huang, Brian Guenter, Oleksii Kuchaiev, Yu Zhang, Frank Seide, Huaming Wang, et al., "An introduction to computational networks and the computational network toolkit, " Tech. Rep., Tech. Rep. MSR, Microsoft Research, 2014, http: //codebox/cntk, 2014.
-
(2014)
An Introduction to Computational Networks and the Computational Network Toolkit
-
-
Yu, D.1
Eversole, A.2
Seltzer, M.3
Yao, K.4
Huang, Z.5
Guenter, B.6
Kuchaiev, O.7
Zhang, Y.8
Seide, F.9
Wang, H.10
-
14
-
-
47749152568
-
The rich transcription 2007 meeting recognition evaluation
-
Springer
-
Jonathan G Fiscus, Jerome Ajot, and John S Garofolo, "The rich transcription 2007 meeting recognition evaluation, " in Multimodal Technologies for Perception of Humans, pp. 373-389. Springer, 2008.
-
(2008)
Multimodal Technologies for Perception of Humans
, pp. 373-389
-
-
Fiscus, J.G.1
Ajot, J.2
Garofolo, J.S.3
-
15
-
-
84893704659
-
Hybrid acoustic models for distant and multichannel large vocabulary speech recognition
-
Pawel Swietojanski, Arnab Ghoshal, and Steve Renals, "Hybrid acoustic models for distant and multichannel large vocabulary speech recognition, " in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. IEEE, 2013, pp. 285-290.
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop On. IEEE
, pp. 285-290
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
-
16
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
Geoffrey E Hinton, Simon Osindero, and Yee-Whye Teh, "A fast learning algorithm for deep belief nets, " Neural computation, vol. 18, no. 7, pp. 1527-1554, 2006.
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
17
-
-
85008520364
-
Transcribing meetings with the amida systems
-
Thomas Hain, Luká Burget, John Dines, Philip N Garner, Frantisek Grézl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln, and VincentWan, "Transcribing meetings with the amida systems, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 2, pp. 486-498, 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.2
, pp. 486-498
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garner, P.N.4
Grézl, F.5
El Hannani, A.6
Huijbregts, M.7
Karafiat, M.8
Lincoln, M.9
Wan, V.10
|