-
1
-
-
0024634603
-
Phoneme recognition using time-delay neural networks
-
DOI 10.1109/29.21701
-
A. Waibel, T. Hanazawa, G.E. Hinton, K. Shikano, and K.J. Lang, "Phoneme recognition using time-delay neural networks, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 37, no. 3, pp. 328-339, 1989. (Pubitemid 19065785)
-
(1989)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.37
, Issue.3
, pp. 328-339
-
-
Waibel, A.1
Hanazawa, T.2
Hinton, G.3
Shikano, K.4
Lang, K.J.5
-
2
-
-
84951490428
-
Review of neural networks for speech recognition
-
R.P. Lippmann, "Review of neural networks for speech recognition, " Neural computation, vol. 1, no. 1, pp. 1-38, 1989.
-
(1989)
Neural Computation
, vol.1
, Issue.1
, pp. 1-38
-
-
Lippmann, R.P.1
-
5
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
G.E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
6
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. Interspeech, 2011, pp. 437-440.
-
(2011)
Proc. Interspeech
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
7
-
-
84874226274
-
The language-independent bottleneck features
-
IEEE
-
K. Vesely, M. Karafiát, F. Grezl, M. Janda, and E. Egorova, "The language-independent bottleneck features, " in Spoken Language Technology Workshop (SLT), 2012 IEEE. IEEE, 2012, pp. 336-341.
-
(2012)
Spoken Language Technology Workshop (SLT), 2012 IEEE
, pp. 336-341
-
-
Vesely, K.1
Karafiát, M.2
Grezl, F.3
Janda, M.4
Egorova, E.5
-
8
-
-
84890539009
-
Multilingual acoustic models using distributed deep neural networks
-
G Heigold, V Vanhoucke, A Senior, P Nguyen, M Ranzato, M Devin, and J Dean, "Multilingual acoustic models using distributed deep neural networks, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, 2013.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
-
-
Heigold, G.1
Vanhoucke, V.2
Senior, A.3
Nguyen, P.4
Ranzato, M.5
Devin, M.6
Dean, J.7
-
9
-
-
0024939480
-
Modularity and scaling in large phonemic neural networks
-
DOI 10.1109/29.45535
-
A Waibel, H Sawai, and K Shikano, "Modularity and scaling in large phonemic neural networks, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 37, no. 12, pp. 1888-1898, 1989. (Pubitemid 20642700)
-
(1989)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.37
, Issue.12
, pp. 1888-1898
-
-
Waibel Alexander1
Sawai Hidefumi2
Shikano Kiyohiro3
-
10
-
-
27144439262
-
Dataderived nonlinear mapping for feature extraction in hmm
-
Citeseer
-
Hynek Hermansky, Sangita Sharma, and Pratibha Jain, "Dataderived nonlinear mapping for feature extraction in hmm, " in Proc. ASRU. Citeseer, 1999, vol. 99.
-
(1999)
Proc. ASRU
, vol.99
-
-
Hermansky, H.1
Sharma, S.2
Jain, P.3
-
11
-
-
70450217311
-
Hierarchical processing of the modulation spectrum for gale mandarin lvcsr system
-
F. Valente, M. Magimai-Doss, C. Plahl, and S.V. Ravuri, "Hierarchical processing of the modulation spectrum for GALE Mandarin LVCSR system., " in Proc. Interspeech, 2009, pp. 2963-2966.
-
(2009)
Proc. Interspeech
, pp. 2963-2966
-
-
Valente, F.1
Magimai-Doss, M.2
Plahl, C.3
Ravuri, S.V.4
-
12
-
-
84906273176
-
Modular combination of deep neural networks for acoustic modeling
-
to appear
-
J. Gehring, W. Lee, K. Kilgour, I. Lane, Y. Miao, and A. Waibel, "Modular combination of deep neural networks for acoustic modeling, " in Proc. Interspeech, 2013, to appear.
-
(2013)
Proc. Interspeech
-
-
Gehring, J.1
Lee, W.2
Kilgour, K.3
Lane, I.4
Miao, Y.5
Waibel, A.6
-
13
-
-
84874278045
-
Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR
-
IEEE
-
P. Swietojanski, A. Ghoshal, and S. Renals, "Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR, " in Spoken Language Technology Workshop (SLT), 2012 IEEE. IEEE, 2012, pp. 246-251.
-
(2012)
Spoken Language Technology Workshop (SLT), 2012 IEEE
, pp. 246-251
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
-
14
-
-
34547548235
-
Probabilistic and bottle-neck features for LVCSR of meetings
-
IEEE
-
F. Grézl, M. Karafiát, S. Kontár, and J. Cernocky, "Probabilistic and bottle-neck features for LVCSR of meetings, " in Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on. IEEE, 2007, vol. 4, pp. IV-757.
-
(2007)
Acoustics, Speech and Signal Processing, 2007, ICASSP 2007, IEEE International Conference on
, vol.4
-
-
Grézl, F.1
Karafiát, M.2
Kontár, S.3
Cernocky, J.4
-
15
-
-
84890482429
-
Extracting deep bottleneck features using stacked auto-encoders
-
IEEE
-
J Gehring, Y Miao, F Metze, and A Waibel, "Extracting deep bottleneck features using stacked auto-encoders, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
-
-
Gehring, J.1
Miao, Y.2
Metze, F.3
Waibel, A.4
-
16
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
Y Bengio, P Lamblin, D Popovici, and H Larochelle, "Greedy layer-wise training of deep networks, " Advances in neural information processing systems, vol. 19, pp. 153, 2007.
-
(2007)
Advances in Neural Information Processing Systems
, vol.19
, pp. 153
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
17
-
-
79551480483
-
Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
-
P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion, " The Journal of Machine Learning Research, vol. 11, pp. 3371- 3408, 2010.
-
(2010)
The Journal of Machine Learning Research
, vol.11
, pp. 3371-3408
-
-
Vincent, P.1
Larochelle, H.2
Lajoie, I.3
Bengio, Y.4
Manzagol, P.A.5
-
18
-
-
84878559540
-
An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on asr performance
-
N.T. Vu, W. Breiter, F. Metze, and T. Schultz, "An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance, " in Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Vu, N.T.1
Breiter, W.2
Metze, F.3
Schultz, T.4
-
19
-
-
84867224965
-
On the use of a multilingual neural network front-end
-
S. Scanzio, P. Laface, L. Fissore, R. Gemello, and F. Mana, "On the use of a multilingual neural network front-end., " in Proc. Interspeech, 2008, pp. 2711-2714.
-
(2008)
Proc. Interspeech
, pp. 2711-2714
-
-
Scanzio, S.1
Laface, P.2
Fissore, L.3
Gemello, R.4
Mana, F.5
-
21
-
-
84890498592
-
Warped minimum variance distortionless response based bottle neck features for LVCSR
-
K. Kilgour, T. Seytzer, Q.B. Nguyen, and A. Waibel, "Warped minimum variance distortionless response based bottle neck features for LVCSR, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, 2013.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
-
-
Kilgour, K.1
Seytzer, T.2
Nguyen, Q.B.3
Waibel, A.4
-
22
-
-
84890461500
-
Multilingual training of deep-neural netowrks
-
A. Ghoshal, P. Swietojanski, and S. Renals, "Multilingual training of deep-neural netowrks, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, 2013.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
-
-
Ghoshal, A.1
Swietojanski, P.2
Renals, S.3
-
23
-
-
84893672075
-
The fundamental frequency variation spectrum
-
K Laskowski, MHeldner, and J Edlund, "The fundamental frequency variation spectrum, " Proceedings of FONETIK 2008, pp. 29-32, 2008.
-
(2008)
Proceedings of FONETIK 2008
, pp. 29-32
-
-
Laskowski, K.1
Heldner, M.2
Edlund, J.3
-
24
-
-
84893656667
-
Models of tone for tonal and non-tonal languages
-
IEEE, submitted for review
-
F. Metze, Z.A. Sheik, A. Waibel, J. Gehring, K. Kilgour, Q.B. Nguyen, and V.H. Nguyen, "Models of tone for tonal and non-tonal languages, " in Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on. IEEE, 2013, submitted for review.
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
-
-
Metze, F.1
Sheik, Z.A.2
Waibel, A.3
Gehring, J.4
Kilgour, K.5
Nguyen, Q.B.6
Nguyen, V.H.7
-
26
-
-
0030643785
-
The karlsruhe-verbmobil speech recognition engine
-
IEEE
-
M. Finke, P. Geutner, H. Hild, T. Kemp, K. Ries, and M.Westphal, "The Karlsruhe-Verbmobil speech recognition engine, " in Acoustics, Speech, and Signal Processing, 1997. ICASSP- 97., 1997 IEEE International Conference on. IEEE, 1997, vol. 1, pp. 83-86.
-
(1997)
Acoustics, Speech, and Signal Processing, 1997, ICASSP- 97, 1997 IEEE International Conference on
, vol.1
, pp. 83-86
-
-
Finke, M.1
Geutner, P.2
Hild, H.3
Kemp, T.4
Ries, K.5
Westphal, M.6
-
27
-
-
84857819132
-
Theano: A CPU and GPU math expression compiler
-
June, Oral Presentation
-
J. Bergstra, O. Breuleux, F. Bastien, P. Lamblin, R. Pascanu, G. Desjardins, J. Turian, D. Warde-Farley, and Y. Bengio, "Theano: A CPU and GPU Math Expression Compiler, " in Proceedings of the Python for Scientific Computing Conference (SciPy), June 2010, Oral Presentation.
-
(2010)
Proceedings of the Python for Scientific Computing Conference (SciPy)
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
Warde-Farley, D.8
Bengio, Y.9
-
28
-
-
84874282188
-
Improving wideband speech recognition using mixed-bandwidth training data in cd-dnn-hmm
-
2012 IEEE. IEEE
-
Jinyu Li, Dong Yu, Jui-Ting Huang, and Yifan Gong, "Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM, " in Spoken Language Technology Workshop (SLT), 2012 IEEE. IEEE, 2012, pp. 131-136.
-
(2012)
Spoken Language Technology Workshop (SLT)
, pp. 131-136
-
-
Li, J.1
Yu, D.2
Huang, J.-T.3
Gong, Y.4
|