-
1
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G.E. Dahl, A-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T.N. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, vol. 29, no. 6, pp. 82- 97, 2012.
-
(2012)
Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
2
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A.-R. Mohamed, G.E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. on ASLP, vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
IEEE Trans. on ASLP
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.-R.1
Dahl, G.E.2
Hinton, G.3
-
3
-
-
84890466217
-
Improving neural networks by preventing co-adaptation of feature detectors
-
p. abs/1207.0580
-
G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors, " CoRR, p. abs/1207.0580, 2012.
-
(2012)
CoRR
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
4
-
-
84886714036
-
Acoustic modeling with hierarchical reservoirs
-
(accepted
-
F. Triefenbach, A. Jalalvand, K. Demuynck, and J-P. Martens, "Acoustic modeling with hierarchical reservoirs, " IEEE Trans. on ASLP, 2013, (accepted).
-
(2013)
IEEE Trans. on ASLP
-
-
Triefenbach, F.1
Jalalvand, A.2
Demuynck, K.3
Martens, J.-P.4
-
5
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Comp. Speech and Lang., vol. 12, pp. 75-98, 1998. (Pubitemid 128383747)
-
(1998)
Computer Speech and Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
6
-
-
0034320005
-
Rapid speaker adaptation in eigenvoice space
-
DOI 10.1109/89.876308
-
R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space, " IEEE Trans. on SAP, vol. 8, no. 6, pp. 695-707, 2000. (Pubitemid 32025317)
-
(2000)
IEEE Transactions on Speech and Audio Processing
, vol.8
, Issue.6
, pp. 695-707
-
-
Kuhn, R.1
Junqua, J.-C.2
Nguyen, P.3
Niedzielski, N.4
-
7
-
-
85009113852
-
Hmm adaptation using vector taylor series for noisy speech recognition
-
A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition, " in Proc. ICSLP, 2000, pp. 229-232.
-
(2000)
Proc. ICSLP
, pp. 229-232
-
-
Acero, A.1
Deng, L.2
Kristjansson, T.3
Zhang, J.4
-
8
-
-
85006734596
-
Evaluation of the SPLICE algorithm on the aurora2 database
-
J. Droppo, L. Deng, and A. Acero, "Evaluation of the SPLICE algorithm on the Aurora2 database, " in Proc. EUROSPEECH, 2001, pp. 217-220.
-
(2001)
Proc. EUROSPEECH
, pp. 217-220
-
-
Droppo, J.1
Deng, L.2
Acero, A.3
-
9
-
-
34547528168
-
Adaptive training with joint uncertainty decoding for robust recognition of noisy data
-
H. Liao and M.J.F Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data, " in Proc. ICASSP, 2007, pp. 389-392.
-
(2007)
Proc. ICASSP
, pp. 389-392
-
-
Liao, H.1
Gales, M.J.F.2
-
10
-
-
0030364003
-
Reduced semi-continuous models for large vocabulary continuous speech recognition in dutch
-
K. Demuynck, J. Duchateau, and D. Van Compernolle, "Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch, " in Proc. ICSLP, 1996, vol. IV, pp. 2289-2292.
-
(1996)
Proc. ICSLP
, vol.4
, pp. 2289-2292
-
-
Demuynck, K.1
Duchateau, J.2
Van Compernolle, D.3
-
11
-
-
0013344078
-
Training products of experts by minimizing contrastive divergence
-
G.E. Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 14, pp. 1771- 1800, 2002.
-
(2002)
Neural Computation
, vol.14
, pp. 1771-1800
-
-
Hinton, G.E.1
-
13
-
-
0029228458
-
Optimal linear feature transformations for semi-continuous hidden markov models
-
G. Schukat-Talamazzini, J. Hornegger, and H. Niemann, "Optimal linear feature transformations for semi-continuous hidden Markov models, " in Proc. ICASSP, 1995, vol. I, pp. 369- 372.
-
(1995)
Proc. ICASSP
, vol.1
, pp. 369-372
-
-
Schukat-Talamazzini, G.1
Hornegger, J.2
Niemann, H.3
-
15
-
-
84871612455
-
Optimal feature sub-space selection based on discriminant analysis
-
K. Demuynck, J. Duchateau, and D. Van Compernolle, "Optimal feature sub-space selection based on discriminant analysis, " in Proc. EUROSPEECH, 1999, vol. III, pp. 1311-1314.
-
(1999)
Proc. EUROSPEECH
, vol.3
, pp. 1311-1314
-
-
Demuynck, K.1
Duchateau, J.2
Van Compernolle, D.3
-
16
-
-
84893656625
-
Class definition in discriminant feature analysis
-
J. Duchateau, K. Demuynck, D. Van Compernolle, and P. Wambacq, "Class definition in discriminant feature analysis, " in Proc. EUROSPEECH, 2001, vol. III, pp. 1621-1624.
-
(2001)
Proc. EUROSPEECH
, vol.3
, pp. 1621-1624
-
-
Duchateau, J.1
Demuynck, K.2
Van Compernolle, D.3
Wambacq, P.4
-
17
-
-
0002235014
-
Improved feature decorrelation for hmm-based speech recognition
-
K. Demuynck, J. Duchateau, D. Van Compernolle, and P.Wambacq, "Improved feature decorrelation for HMM-based speech recognition, " in Proc. ICSLP, 1998, vol. VII, pp. 2907- 2910.
-
(1998)
Proc. ICSLP
, vol.7
, pp. 2907-2910
-
-
Demuynck, K.1
Duchateau, J.2
Van Compernolle, D.3
Wambacq, P.4
-
18
-
-
4243460174
-
Semi-tied covariance matrices
-
M.J.F. Gales, "Semi-tied covariance matrices, " in Proc. ICASSP, 1998, vol. II, pp. 657-660.
-
(1998)
Proc. ICASSP
, vol.2
, pp. 657-660
-
-
Gales, M.J.F.1
-
19
-
-
70450191324
-
A study of bootstrapping with multiple acoustic features for improved automatic speech recognition
-
X. Cui, J. Xue, B. Xiang, and B. Zhou, "A study of bootstrapping with multiple acoustic features for improved automatic speech recognition, " in Proc. INTERSPEECH, 2009, pp. 240- 243.
-
(2009)
Proc. INTERSPEECH
, pp. 240-243
-
-
Cui, X.1
Xue, J.2
Xiang, B.3
Zhou, B.4
-
20
-
-
85083953021
-
Feature learning in deep neural networks - studies on speech recognition tasks
-
vol. abs/1301.3605
-
D. Yu, M.L. Seltzer, J. Li, J.-T. Huang, and F. Seide, "Feature learning in deep neural networks - studies on speech recognition tasks, " CoRR, vol. abs/1301.3605, 2013, http://arxiv.org/abs/1301.3605.
-
(2013)
CoRR
-
-
Yu, D.1
Seltzer, M.L.2
Li, J.3
Huang, J.-T.4
Seide, F.5
-
21
-
-
77949426518
-
Hidden conditional random fields for phone recognition
-
Y-h. Sung and D. Jurafsky, "Hidden conditional random fields for phone recognition, " in Proc. ASRU, 2009, pp. 107-112.
-
(2009)
Proc. ASRU
, pp. 107-112
-
-
Sung, Y.-H.1
Jurafsky, D.2
-
22
-
-
70349208656
-
A flat direct model for speech recognition
-
G. Heigold, G. Zweig, X. Li, and P. Nguyen, "A flat direct model for speech recognition, " in Proc. ICASSP, 2009, pp. 3861-3864.
-
(2009)
Proc. ICASSP
, pp. 3861-3864
-
-
Heigold, G.1
Zweig, G.2
Li, X.3
Nguyen, P.4
-
23
-
-
0004158153
-
Speech recognition with dynamic bayesian networks
-
G. Zweig and S. Russell, "Speech recognition with dynamic Bayesian networks, " Tech. Rep., Microsoft, 1998.
-
(1998)
Tech. Rep., Microsoft
-
-
Zweig, G.1
Russell, S.2
-
24
-
-
78649308591
-
Sequential labeling using deepstructured conditional random fields
-
D. Yu, S.Wang, and L. Deng, "Sequential labeling using deepstructured conditional random fields, " IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 6, pp. 965-973, 2010.
-
(2010)
IEEE Journal of Selected Topics in Signal Processing
, vol.4
, Issue.6
, pp. 965-973
-
-
Yu, D.1
Wang, S.2
Deng, L.3
-
25
-
-
0003548585
-
The darpa timit acoustic-phonetic continuous speech corpus cd-rom
-
J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett, and N. Dahlgren, "The DARPA TIMIT acoustic-phonetic continuous speech corpus CD-rom, " Tech. report, NIST, 1993.
-
(1993)
Tech. Report, NIST
-
-
Garofolo, J.1
Lamel, L.2
Fisher, W.3
Fiscus, J.4
Pallett, D.5
Dahlgren, N.6
-
26
-
-
0024768209
-
Speaker-independent phone recognition using hidden Markov models
-
DOI 10.1109/29.46546
-
K. Lee and H.-W. Hon, "Speaker-independent phone recognition using hidden Markov models, " IEEE Trans. on ASSP, vol. 37, pp. 1641-1648, 1989. (Pubitemid 20652953)
-
(1989)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.-F.1
Hon, H.-W.2
-
27
-
-
80051654263
-
Deep belief networks using discriminative features for phone recognition
-
A.R. Mohamed, T.N. Sainath, G. Dahl, B. Ramabhadran, G.E. Hinton, and M.A. Picheny, "Deep belief networks using discriminative features for phone recognition, " in Proc. ICASSP, 2011, pp. 5060-5063.
-
(2011)
Proc. ICASSP
, pp. 5060-5063
-
-
Mohamed, A.R.1
Sainath, T.N.2
Dahl, G.3
Ramabhadran, B.4
Hinton, G.E.5
Picheny, M.A.6
-
28
-
-
84867215798
-
Spraak: An open source speech recognition and automatic annotation kit
-
Sept.
-
K. Demuynck, J. Roelens, D. Van Compernolle, and P. Wambacq, "SPRAAK: An open source speech recognition and automatic annotation kit, " in Proc. INTERSPEECH, Sept. 2008, p. 495.
-
(2008)
Proc. INTERSPEECH
, pp. 495
-
-
Demuynck, K.1
Roelens, J.2
Van Compernolle, D.3
Wambacq, P.4
-
29
-
-
44949142593
-
A flexible recogniser architecture in a reading tutor for children
-
J. Duchateau, M. Wigham, K. Demuynck, and H. Van hamme, "A flexible recogniser architecture in a reading tutor for children, " in Proc. ITRWon Speech Recognition and Intrinsic Variation, 2006, pp. 59-64.
-
(2006)
Proc. ITRWon Speech Recognition and Intrinsic Variation
, pp. 59-64
-
-
Duchateau, J.1
Wigham, M.2
Demuynck, K.3
Van Hamme, H.4
-
30
-
-
0031211090
-
A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting
-
Y. Freund and R.E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting, " Journal of computer and system sciences, vol. 55, no. 1, pp. 119- 139, 1997. (Pubitemid 127433398)
-
(1997)
Journal of Computer and System Sciences
, vol.55
, Issue.1
, pp. 119-139
-
-
Freund, Y.1
Schapire, R.E.2
-
31
-
-
0344666609
-
Boosting linear discriminant analysis for face recognition
-
J. Lu, K.N. Plataniotis, and A.N. Venetsanopoulos, "Boosting linear discriminant analysis for face recognition, " in Proc. ICIP, 2003, vol. 1, pp. 657-660.
-
(2003)
Proc. ICIP
, vol.1
, pp. 657-660
-
-
Lu, J.1
Plataniotis, K.N.2
Venetsanopoulos, A.N.3
-
32
-
-
0030677475
-
Speaker adaptive training: A maximum likelihood approach to speaker normalization
-
T. Anastasakos, J. McDonough, and J. Makhoul, "Speaker adaptive training: A maximum likelihood approach to speaker normalization, " in Proc. ICASSP, 1997, vol. 2, pp. 1043-1046.
-
(1997)
Proc. ICASSP
, vol.2
, pp. 1043-1046
-
-
Anastasakos, T.1
McDonough, J.2
Makhoul, J.3
-
33
-
-
84867600898
-
Latent variable speaker adaptation of gaussian mixture weights and means
-
X. Zhang, K. Demuynck, and H. Van hamme, "latent variable speaker adaptation of gaussian mixture weights and means, " in Proc. ICASSP, 2012, pp. 4349-4352.
-
(2012)
Proc. ICASSP
, pp. 4349-4352
-
-
Zhang, X.1
Demuynck, K.2
Van Hamme, H.3
-
35
-
-
0034849080
-
Improved discriminative training techniques for large vocabulary continuous speech recognition
-
D. Povey and P.C. Woodland, "Improved discriminative training techniques for large vocabulary continuous speech recognition, " in Proc. ICASSP, 2001, vol. 1, pp. 45-48. (Pubitemid 32839184)
-
(2001)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.1
, pp. 45-48
-
-
Povey, D.1
Woodland, P.C.2
-
36
-
-
0036296863
-
Minimum phone error and i-smoothing for improved discriminative training
-
D. Povey and P.C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training, " in Proc. ICASSP, 2002, vol. 1, pp. 105-108.
-
(2002)
Proc. ICASSP
, vol.1
, pp. 105-108
-
-
Povey, D.1
Woodland, P.C.2
-
37
-
-
70349213445
-
Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
-
B. Kingsbury, "lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. ICASSP, 2009, pp. 3761-3764.
-
(2009)
Proc. ICASSP
, pp. 3761-3764
-
-
Kingsbury, B.1
-
38
-
-
80051622448
-
A-functions: A generalization of extended baum-welch transformations to convex optimization
-
D. Kanevsky, D. Nahamoo, T.N. Sainath, B. Ramabhadran, and P.A. Olsen, "A-functions: A generalization of extended Baum-Welch transformations to convex optimization, " in Proc. ICASSP, 2011, pp. 5164-5167.
-
(2011)
Proc. ICASSP
, pp. 5164-5167
-
-
Kanevsky, D.1
Nahamoo, D.2
Sainath, T.N.3
Ramabhadran, B.4
Olsen, P.A.5
-
39
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. INTERSPEECH, 2011, pp. 437-440.
-
(2011)
Proc. INTERSPEECH
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
40
-
-
84976736715
-
The hearsay-ii speech understanding system: Integrating knowledge to resolve uncertainty
-
L.D. Erman, F. Hayes-Roth, V.R. Lesser, and D.R. Reddy, "The HEARSAY-II speech understanding system: Integrating knowledge to resolve uncertainty, " Computing Surveys, vol. 12, pp. 213-253, 1980.
-
(1980)
Computing Surveys
, vol.12
, pp. 213-253
-
-
Erman, L.D.1
Hayes-Roth, F.2
Lesser, V.R.3
Reddy, D.R.4
-
41
-
-
84893686887
-
Attention shift decoding for conversational speech recognition
-
R. Kumaran, J. Bilmes, and K. Kirchhoff, "Attention shift decoding for conversational speech recognition, " in Proc. INTERSPEECH, 2007, pp. 1493-1496.
-
(2007)
Proc. INTERSPEECH
, pp. 1493-1496
-
-
Kumaran, R.1
Bilmes, J.2
Kirchhoff, K.3
|