-
1
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton, "Imagenet classification with deep convolutional neural networks, " in NIPS, 2012, pp. 1097-1105
-
(2012)
NIPS
, pp. 1097-1105
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
2
-
-
84994264999
-
Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks
-
Dimitri Palaz, Ronan Coli obert, and Mathew Magimai Doss, "Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks, " Interspeech, 2014
-
(2014)
Interspeech
-
-
Palaz, D.1
Coli Obert, R.2
Magimai Doss, M.3
-
3
-
-
84910065702
-
Acoustic modeling with deep neural networks using raw time signal for LV CSR
-
Singapore, Sept
-
Zoltan Ttiske, Pavel Golik, Ralf SchlUter, and Hermann Ney, "Acoustic modeling with deep neural networks using raw time signal for LV CSR, " in Interspeech, Singapore, Sept. 2014
-
(2014)
Interspeech
-
-
Ttiske, Z.1
Golik, P.2
SchlUter, R.3
Ney, H.4
-
4
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
Geoffrey Hinton, Li Deng, Dong Yu, George E Dahl, Abdelrahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N Sainath, et aI., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Ai, E.11
-
5
-
-
80051609011
-
Learning a better representation of speech soundwaves using restricted Boltzmann machines
-
Navdeep Jaitly and Geoffrey Hinton, "Learning a better representation of speech soundwaves using restricted Boltzmann machines, " in ICASSP. IEEE, 2011, pp. 5884-5887
-
(2011)
ICASSP. IEEE
, pp. 5884-5887
-
-
Jaitly, N.1
Hinton, G.2
-
6
-
-
84893622444
-
The REVERB challenge: A common evaluation framework for dereverberation and recognition of reverberant speech
-
Keisuke Kinoshita, Marc Delcroix, Takuya Yoshioka, Tomohiro Nakatani, Armin Sehr, Walter Kellermann, and Roland Maas, "The REVERB challenge: A common evaluation framework for dereverberation and recognition of reverberant speech, " in W ASPAA. IEEE, 2013, pp. 1-4
-
(2013)
W ASPAA. IEEE
, pp. 1-4
-
-
Kinoshita, K.1
Delcroix, M.2
Yoshioka, T.3
Nakatani, T.4
Sehr, A.5
Kellermann, W.6
Maas, R.7
-
7
-
-
84890541701
-
The second CHiME speech separation and recognition challenge: Datasets, tasks and baselines
-
Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, and Marco Matassoni, 'The second CHiME speech separation and recognition challenge: Datasets, tasks and baselines, " in ICASSP. IEEE, 2013, pp. 126-130
-
(2013)
ICASSP. IEEE
, pp. 126-130
-
-
Vincent, E.1
Barker, J.2
Watanabe, S.3
Le Roux, J.4
Nesta, F.5
Matassoni, M.6
-
8
-
-
84933559263
-
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the reverb challenge
-
Marc Delcroix, Takuya Yoshioka, Atsunori Ogawa, Yotaro Kubo, Masakiyo Fujimoto, Nobutaka Ito, Keisuke Kinoshita, Miquel Espi, Takaaki Hori, Tomohiro Nakatani, and Atsushi Nakamura, "Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the reverb challenge, " in REVERB Workshop, 2014
-
(2014)
REVERB Workshop
-
-
Delcroix, M.1
Yoshioka, T.2
Ogawa, A.3
Kubo, Y.4
Fujimoto, M.5
Ito, N.6
Kinoshita, K.7
Espi, M.8
Hori, T.9
Nakatani, T.10
Nakamura, A.11
-
9
-
-
80052067786
-
Reverberant speech segregation based on multipitch tracking and classification
-
Zhaozhang Jin and DeLiang Wang, "Reverberant speech segregation based on multipitch tracking and classification, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 8, pp. 2328-2337, 2011
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.19
, Issue.8
, pp. 2328-2337
-
-
Jin, Z.1
Wang, D.2
-
11
-
-
84893688455
-
Learning filter banks within a deep neural network framework
-
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, and Bhuvana Ramabhadran, "Learning filter banks within a deep neural network framework, " in ASRU. IEEE, 2013, pp. 297-302
-
(2013)
ASRU. IEEE
, pp. 297-302
-
-
Sainath, T.N.1
Kingsbury, B.2
Mohamed, A.-R.3
Ramabhadran, B.4
-
12
-
-
0020596154
-
Cepstral analysis synthesis on the mel frequency scale
-
Satoshi Imai, "Cepstral analysis synthesis on the mel frequency scale, " in ICASSP. IEEE, 1983, vol. 8, pp. 93-96
-
(1983)
ICASSP. IEEE
, vol.8
, pp. 93-96
-
-
Imai, S.1
-
13
-
-
84893704659
-
Hybrid acoustic models for distant and multichannel large vocabulary speech recognition
-
Pawel Swietojanski, Arnab Ghoshal, and Steve Renals, "Hybrid acoustic models for distant and multichannel large vocabulary speech recognition, " in ASRU. IEEE, 2013, pp. 285-290
-
(2013)
ASRU. IEEE
, pp. 285-290
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
-
16
-
-
34547539413
-
Gammatone features and feature combination for large vocabulary speech recognition
-
Ralf SchlUter, Ilja Bezrukov, Hermann Wagner, and Hermann Ney, "Gammatone features and feature combination for large vocabulary speech recognition, " in ICASSP. IEEE, 2007, vol. 4, pp. IV-649
-
(2007)
ICASSP. IEEE
, vol.4
, pp. 4-649
-
-
SchlUter, R.1
Bezrukov, I.2
Wagner, H.3
Ney, H.4
-
17
-
-
84885728886
-
Your word is my command: Google search by voice: A case study
-
Springer
-
Johan Schalkwyk, Doug Beeferman, Franóise Beaufays, Bill Byrne, Ciprian Chelba, Mike Cohen, Maryam Kamvar, and Brian Strope, "Your Word is my Command: Google search by voice: A case study, " in Advances in Speech Recognition, pp. 61-90. Springer, 2010
-
(2010)
Advances in Speech Recognition
, pp. 61-90
-
-
Schalkwyk, J.1
Beeferman, D.2
Beaufays, F.3
Byrne, B.4
Chelba, C.5
Cohen, M.6
Kamvar, M.7
Strope, B.8
-
19
-
-
84877760312
-
Large scale distributed deep networks
-
Jeffrey Dean, Greg Corrado, Rajat Monga, Kai Chen, Matthieu Devin, Mark Mao, Andrew Senior, Paul Tucker, Ke Yang, Quoc V Le, et aI., "Large scale distributed deep networks, " in NIPS, 2012, pp. 1223-1231
-
(2012)
NIPS
, pp. 1223-1231
-
-
Dean, J.1
Corrado, G.2
Monga, R.3
Chen, K.4
Devin, M.5
Mao, M.6
Senior, A.7
Tucker, P.8
Yang, K.9
Le, Q.V.10
Ai, E.11
-
20
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
John Duchi, Elad Hazan, and Yoram Singer, "Adaptive subgradient methods for online learning and stochastic optimization, " T he lournal of Machine Learning Research, vol. 12, pp. 2121-2159, 2011
-
(2011)
T He Lournal of Machine Learning Research
, vol.12
, pp. 2121-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
|