-
1
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio, Speech, Language Process., vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Trans. Audio, Speech, Language Process
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
2
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
3
-
-
84901502980
-
Feature learning in deep neural networks-studies on speech recognition
-
D. Yu, M. Seltzer, J. Li, J.-T. Huang, and F. Seide, "Feature learning in deep neural networks-studies on speech recognition," in Proc. ICLR, 2013.
-
(2013)
Proc. ICLR
-
-
Yu, D.1
Seltzer, M.2
Li, J.3
Huang, J.-T.4
Seide, F.5
-
5
-
-
84867809023
-
A nonparametric Bayesian approach to acoustic model discovery
-
C. Lee and J. R. Glass, "A nonparametric Bayesian approach to acoustic model discovery," in Proc. ACL, 2012.
-
(2012)
Proc. ACL
-
-
Lee, C.1
Glass, J.R.2
-
6
-
-
70450158585
-
Unsupervised training of an HMM-based speech recognizer for topic classification
-
H. Gish, M.-H. Siu, A. Chan, and B. Belfield, "Unsupervised training of an HMM-based speech recognizer for topic classification," in Proc. Interspeech, 2009.
-
(2009)
Proc. Interspeech
-
-
Gish, H.1
Siu, M.-H.2
Chan, A.3
Belfield, B.4
-
7
-
-
77949473673
-
Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
-
Y. Zhang and J. R. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams," in Proc. ASRU, 2009.
-
(2009)
Proc. ASRU
-
-
Zhang, Y.1
Glass, J.R.2
-
8
-
-
84890478910
-
The spoken web search task at MediaEval 2012
-
F. Metze, X. Anguera, E. Barnard, M. Davel, and G. Gravier, "The spoken web search task at MediaEval 2012," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Metze, F.1
Anguera, X.2
Barnard, E.3
Davel, M.4
Gravier, G.5
-
9
-
-
84890471125
-
On rectified linear units for speech processing
-
M. D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q. V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean, and G. E. Hinton, "On rectified linear units for speech processing," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Zeiler, M.D.1
Ranzato, M.2
Monga, R.3
Mao, M.4
Yang, K.5
Le, Q.V.6
Nguyen, P.7
Senior, A.8
Vanhoucke, V.9
Dean, J.10
Hinton, G.E.11
-
10
-
-
84905227103
-
An autoencoder based approach to unsupervised learning of subword units
-
L. Badino, C. Canevari, L. Fadiga, and G. Metta, "An autoencoder based approach to unsupervised learning of subword units," in Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Badino, L.1
Canevari, C.2
Fadiga, L.3
Metta, G.4
-
11
-
-
33947643715
-
Unsupervised word acquisition from speech using pattern discovery
-
A. Park and J. R. Glass, "Unsupervised word acquisition from speech using pattern discovery," in Proc. ICASSP, 2006.
-
(2006)
Proc. ICASSP
-
-
Park, A.1
Glass, J.R.2
-
12
-
-
84858987768
-
Efficient spoken term discovery using randomized algorithms
-
A. Jansen and B. Van Durme, "Efficient spoken term discovery using randomized algorithms," in Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Jansen, A.1
Van Durme, B.2
-
13
-
-
84893673786
-
A hierarchical system for word discovery exploiting DTW-based initialization
-
O. Walter, T. Korthals, R. Haeb-Umbach, and B. Raj, "A hierarchical system for word discovery exploiting DTW-based initialization," in Proc. ASRU, 2013.
-
(2013)
Proc. ASRU
-
-
Walter, O.1
Korthals, T.2
Haeb-Umbach, R.3
Raj, B.4
-
14
-
-
84865770260
-
Towards unsupervised training of speaker independent acoustic models
-
A. Jansen and K. Church, "Towards unsupervised training of speaker independent acoustic models," in Proc. Interspeech, 2011.
-
(2011)
Proc. Interspeech
-
-
Jansen, A.1
Church, K.2
-
15
-
-
84890467020
-
Weak top-down constraints for unsupervised acoustic model training
-
A. Jansen, S. Thomas, and H. Hermansky, "Weak top-down constraints for unsupervised acoustic model training," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Jansen, A.1
Thomas, S.2
Hermansky, H.3
-
16
-
-
0026400245
-
An investigation of PLP and IMELDA acoustic representations and of their potential for combination
-
M. Hunt, S. M. Richardson, D. C. Bateman, and A. Piau, "An investigation of PLP and IMELDA acoustic representations and of their potential for combination," in Proc. ICASSP, 1991.
-
(1991)
Proc. ICASSP
-
-
Hunt, M.1
Richardson, S.M.2
Bateman, D.C.3
Piau, A.4
-
17
-
-
84946685733
-
Phonetics embedding learning with side information
-
G. Synnaeve1, T. Schatz, and E. Dupoux, "Phonetics embedding learning with side information," in Proc. SLT, 2014.
-
(2014)
Proc. SLT
-
-
Synnaevel, G.1
Schatz, T.2
Dupoux, E.3
-
18
-
-
69349090197
-
Learning deep architectures for AI
-
Y. Bengio, "Learning deep architectures for AI," Found. Trends Mach. Learning, vol. 2, no. 1, pp. 1-127, 2009.
-
(2009)
Found. Trends Mach. Learning
, vol.2
, Issue.1
, pp. 1-127
-
-
Bengio, Y.1
-
19
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.R.2
-
20
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, "Greedy layer-wise training of deep networks," in Proc. NIPS, 2007.
-
(2007)
Proc. NIPS
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
21
-
-
0017930815
-
Dynamic programming algorithm optimization for spoken word recognition
-
H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 26, no. 1, pp. 43-49, 1978.
-
(1978)
IEEE Trans. Acoust., Speech, Signal Process
, vol.26
, Issue.1
, pp. 43-49
-
-
Sakoe, H.1
Chiba, S.2
-
22
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol, "Extracting and composing robust features with denoising autoencoders," in Proc. ICML, 2008.
-
(2008)
Proc. ICML
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.-A.4
-
23
-
-
0003571976
-
-
Cambridge University Engineering Department
-
S. J. Young, G. Evermann, M. J. F. Gales, T. Hain, D. Kershaw, X. Liu, G. L. Moore, J. J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK Book (for HTK Version 3. 4), Cambridge University Engineering Department, 2009.
-
(2009)
The HTK Book (For HTK Version 3. 4)
-
-
Young, S.J.1
Evermann, G.2
Gales, M.J.F.3
Hain, T.4
Kershaw, D.5
Liu, X.6
Moore, G.L.7
Odell, J.J.8
Ollason, D.9
Povey, D.10
Valtchev, V.11
Woodland, P.C.12
-
24
-
-
84893401626
-
-
arXiv:1308. 4214
-
I. J. Goodfellow, D. Warde-Farley, P. Lamblin, V. Dumoulin, M. Mirza, R. Pascanu, J. Bergstra, F. Bastien, and Y. Bengio, "Pylearn2: a machine learning research library," arXiv:1308. 4214, 2013.
-
(2013)
Pylearn2: A Machine Learning Research Library
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Lamblin, P.3
Dumoulin, V.4
Mirza, M.5
Pascanu, R.6
Bergstra, J.7
Bastien, F.8
Bengio, Y.9
-
25
-
-
84905240834
-
Recurrent deep neural networks for robust speech recognition
-
C. Weng, D. Yu, S. Watanabe, and B.-H. Juang, "Recurrent deep neural networks for robust speech recognition," in Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Weng, C.1
Yu, D.2
Watanabe, S.3
Juang, B.-H.4
-
26
-
-
84865767134
-
Rapid evaluation of speech representations for spoken term discovery
-
M. A. Carlin, S. Thomas, A. Jansen, and H. Hermansky, "Rapid evaluation of speech representations for spoken term discovery," in Proc. Interspeech, 2011
-
(2011)
Proc. Interspeech
-
-
Carlin, M.A.1
Thomas, S.2
Jansen, A.3
Hermansky, H.4
|