-
1
-
-
84867605836
-
Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " ICASSP, 2012
-
(2012)
ICASSP
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
-
2
-
-
84878397276
-
Pipelined back-propagation for context-dependent deep neural networks
-
X. Chen, A. Eversole, G. Li, D. Yu, and F. Seide, "Pipelined back-propagation for context-dependent deep neural networks, " Interspeech, 2012
-
(2012)
Interspeech
-
-
Chen, X.1
Eversole, A.2
Li, G.3
Yu, D.4
Seide, F.5
-
3
-
-
80051616844
-
Large vocabulary continuous speech recognition with context-dependent DBN-HMMs
-
G. Dahl, D. Yu, L. Deng. "Large vocabulary continuous speech recognition with context-dependent DBN-HMMs, " ICASSP, 2011
-
(2011)
ICASSP
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
-
4
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
-
G. Dahl, D. Yu, L. Deng, and A. Acero. "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. " IEEE Trans. Speech and Audio Proc., vol. 20, no. I, pp. 30-42, 2012
-
(2012)
IEEE Trans. Speech and Audio Proc.
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
5
-
-
84877760312
-
Large scaled distributed deep networks
-
J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Ng. "Large scaled distributed deep networks, " IPS, 2012
-
(2012)
IPS
-
-
Dean, J.1
Corrado, G.2
Monga, R.3
Chen, K.4
Devin, M.5
Le, Q.6
Mao, M.7
Ranzato, M.8
Senior, A.9
Tucker, P.10
Yang, K.11
Ng, A.12
-
6
-
-
84890491198
-
Recent advances of deep learning for speech research at Microsoft
-
L. Deng, J. Li, J. Huang, K. Yao, D. Yu, F. SeIde, M. Seltzer, G. Zweig, X. He, J. Williams, Y. Gong, and A. Acero. "Recent advances of deep learning for speech research at Microsoft, " ICASSP, 2013
-
(2013)
ICASSP
-
-
Deng, L.1
Li, J.2
Huang, J.3
Yao, K.4
Yu, D.5
Seide, F.6
Seltzer, G.7
Weig, Z.8
He, X.9
Williams, J.10
Gong, Y.11
Acero, A.12
-
7
-
-
84867614591
-
Scalable stacking and learning for building deep architectures
-
L. Deng, D. Yu, and J. Platt. "Scalable stacking and learning for building deep architectures, " ICASSP, 2012
-
(2012)
ICASSP
-
-
Deng, L.1
Yu, D.2
Platt, J.3
-
8
-
-
79959842828
-
Binary coding of speech spectrograms USIng a deep auto-encoder
-
L. Deng, M. Seltzer, D. Yu, A. Acero, A. Mohad, and G. Hinton, "Binary coding of speech spectrograms USIng a deep auto-encoder, " Interspeech, 2010
-
(2010)
Interspeech
-
-
Deng, L.1
Seltzer, M.2
Yu, D.3
Acero, A.4
Mohamd, A.5
Hinton, G.6
-
9
-
-
84890534540
-
Use of kernel deep convex networks and end-to-end learning for spoken language understanding
-
L. Deng, G. Tur, X. He, and D. Hakkani-Tur, "Use of kernel deep convex networks and end-to-end learning for spoken language understanding, " IEEE SLT, 2012.
-
(2012)
IEEE SLT
-
-
Deng, L.1
Tur, G.2
He, X.3
Hakkani-Tur, D.4
-
10
-
-
0033623527
-
Spontaneous speech recognItIOn USIng a statistical coarticulatory model for the vocal tract resonance dynamics
-
L. Deng and 1. Ma, "Spontaneous speech recognItIOn USIng a statistical coarticulatory model for the vocal tract resonance dynamics, " 1. Acoust.Soc.Am., vol. 108, pp. 3036-3048, 2000
-
(2000)
1. Acoust.Soc.Am.
, vol.108
, pp. 3036-3048
-
-
Deng, L.1
Ma, I.2
-
11
-
-
33744966561
-
A bidirectional target filtering model of speech coarticulation: Two-sage implementation for phonetic recognition
-
L. Deng, D. Yu, and A. Acero. "A bidirectional target filtering model of speech coarticulation: Two-sage implementation for phonetic recognition, " IEEE TransactIOns on Audio and Speech Processing, vol. 14, pp. 256-265, 2006
-
(2006)
IEEE TransactIOns on Audio and Speech Processing
, vol.14
, pp. 256-265
-
-
Deng, L.1
Yu, D.2
Acero, A.3
-
12
-
-
34047266395
-
Structured speech mo. Deling
-
L. Deng, D. Yu, and A. Acero. "Structured speech mo. deling, " IEEE Trans. on Audio, Speech and Language ProcessIng, vol. 14, no. 5, pp. 1492-1504, 2006.
-
(2006)
IEEE Trans. on Audio, Speech and Language ProcessIng
, vol.14
, Issue.5
, pp. 1492-1504
-
-
Deng, L.1
Yu, D.2
Acero, A.3
-
13
-
-
34547551709
-
Use of differential. Cepstra as acousc features in hidden trajectory modelIng for phonetIc recognition
-
L. Deng and D. Yu. "Use of differential. cepstra as acousc features in hidden trajectory modelIng for phonetIc recognition, " ICASSP, 2007
-
(2007)
ICASSP
-
-
Deng, L.1
Yu, D.2
-
15
-
-
84890468916
-
Deep learning for speech recognition and related applications
-
L. Deng, D. Yu, and G. Hinton. "Deep Learning for Speech Recognition and Related Applications " NIPS Workshop, 2009 http://nips.cc/Conferences/ 2009IProgramlevent.php?ID= 1512
-
(2009)
NIPS Workshop
-
-
Deng, L.1
Yu, D.2
Hinton, G.3
-
16
-
-
84890526837
-
N,: Types of deep neural network learning for speech recognItIOn and related applications: An overview
-
L. Deng, G. Hinton, and B. Kingsbury. "N, types of deep neural network learning for speech recognItIOn and related applications: An overview, " ICASSP, 2013
-
(2013)
ICASSP
-
-
Deng, L.1
Hinton, G.2
Kingsbury, B.3
-
17
-
-
85032751458
-
Deep neural networks for acoustIc modelIng in speech recognition
-
Nov
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A.-R. Moamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. aInath, nd B. Kingsbury. "Deep neural networks for acoustIc modelIng in speech recognition, " IEEE Signal Processing Magazine, Vol. 29, No. 6, pp. 82-97, Nov., 2012
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Moamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Ainath, T.10
Kingsbury, B.11
-
18
-
-
84867720412
-
-
0580 arXiv
-
G. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever: &R. Salakhutdinov. "Improving neural networks by preventIng coadaptation of feature detectors, " arXiv: 1207.0580vl, 2012
-
(2012)
Improving Neural Networks by PreventIng Coadaptation of Feature Detectors 1207
-
-
Hinton, G.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
20
-
-
84878539964
-
Application of pretrained deep neural networks to large vocabulary speech recognition
-
N. Jaitly, P. Nguyen, and V. Vanhoucke, "Application of pretrained deep neural networks to large vocabulary speech recognition, " Interspeech, 2012
-
(2012)
Interspeech
-
-
Jaitly, N.1
Nguyen, P.2
Vanhoucke, V.3
-
21
-
-
84878379108
-
Scalabe minimum Bayes risk training of deep neural network acoustIc models using distributed Hessian-free optimization
-
B. Kingsbury, T. N. Sainath, and H. Soltau. "Scalabe minimum Bayes risk training of deep neural network acoustIc models using distributed Hessian-free optimization, " Interspeech, 2012
-
(2012)
Interspeech
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
-
22
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
A. Krizhevsky Ilya Sutskever G. Hinton. "ImageNet classification with deep convolutional neural networks, " NIPS, 2012
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.3
-
23
-
-
85161972005
-
Tiled convolutional neural networks
-
Q. Le, J. Ngiam, Z. Chen, D. Chia, W. Pang, and A. Ng. "Tiled convolutional neural networks, " NIPS, 2010
-
(2010)
NIPS
-
-
Le, Q.1
Ngiam, J.2
Chen, Z.3
Chia, D.4
Pang, W.5
Ng, A.6
-
24
-
-
0032203257
-
Gradintbased learning applied to document recognition
-
Y. Lecun, L. Bottou, Y. Bengio, and P. ffn, r. " Gradintbased learning applied to document recognItIOn, ProceedIngs of the IEEE, pp. 2278-2324, 1998
-
(1998)
ProceedIngs of the IEEE
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Ffnr, P.4
-
25
-
-
5044231640
-
Learning methods for generic object recognition with invariance to pose and lighting
-
Y. LeCun, F. Huang, and L. Bottou, "Learning methods for generic object recognition with invariance to pose and lighting, " Proc. IEEE CVPR, 2004.
-
(2004)
Proc. IEEE CVPR
-
-
Lecun, Y.1
Huang, F.2
Bottou, L.3
-
26
-
-
84055211743
-
Acousti modelIng using deep belief networks
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoust modelIng using deep belief networks, " IEEE Trans. on AudIO, Speech, and Language Processing, " Vol. 20, no. I, pp. 14-22, 2012
-
(2012)
IEEE Trans. on AudIO, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
27
-
-
79959840616
-
Investigation of fullsequence training of deep belief networks for speech recognition
-
A. Mohamed, D. Yu, and L. Deng. "Investigation of fullsequence training of deep belief networks for speech recognition, " Interspeech, 2010
-
(2010)
Interspeech
-
-
Mohamed, A.1
Yu, D.2
Deng, L.3
-
28
-
-
80051654263
-
Deep belief nets USIng dlscnmInatlve features for phone recognition
-
A. Mohamed, T. Sainath, G. Dahl, B. ambhdrn,. G. Hinton, M. Picheny. "Deep belief nets USIng dlscnmInatlve features for phone recognition, " ICASSP, 2011
-
(2011)
ICASSP
-
-
Mohamed, A.1
Sainath, T.2
Dahl, G.3
Ambhdrn, B.4
Hinton, G.5
Picheny, M.6
-
29
-
-
84255177123
-
Deep and wide: Multiple layers in automatic speech recognition
-
N. Morgan. "Deep and wide: Multiple layers in automatic speech recognition, " IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. I, pp. 7-13, 2012
-
(2012)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 7-13
-
-
Morgan, N.1
-
30
-
-
80053437179
-
Multi modal deep learning
-
J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Ng, "Multi modal deep learning, " ICML, 20 II
-
(2011)
ICML
-
-
Ngiam, J.1
Khosla, A.2
Kim, M.3
Nam, J.4
Lee, H.5
Ng, A.6
-
31
-
-
0034047363
-
Effect of speaking rate and contrastive stress on formant dynamics and vowel perception
-
M. Pitermann, "Effect of speaking rate and contrastive stress on formant dynamics and vowel perception, " J. Acoust. Soc. Am., vol. 107, pp. 3425-3437, 2000
-
(2000)
J. Acoust. Soc. Am.
, vol.107
, pp. 3425-3437
-
-
Pitermann, M.1
-
32
-
-
84858972572
-
Making deep belief networks effective for large vocabulary continuous speech recognition
-
T. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition ", Proc. ASRU, pp. 30-35, 2011
-
(2011)
Proc. ASRU
, pp. 30-35
-
-
Sainath, T.1
Kingsbury, B.2
Ramabhadran, B.3
Fousek, P.4
Novak, P.5
Mohamed, A.6
-
33
-
-
84878572738
-
Enhancing exemplar-based posteriors for speech recognItIOn tasks
-
T. Sainath, D. Nahamoo, D. Kanevsky, B. Ramabhar, "Enhancing exemplar-based posteriors for speech recognItIOn tasks, " Interspeech, 2012
-
(2012)
Interspeech
-
-
Sainath, T.1
Nahamoo, D.2
Kanevsky, D.3
Ramabhar, B.4
-
34
-
-
84865801985
-
Conversational speec transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speec transcription using context-dependent deep neural networks, Interspeech, 2011
-
(2011)
Interspeech
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
35
-
-
84867605416
-
Towars deeper understanding: Deep convex networks for semantIc utterance classification
-
G. Tur, L. Deng, D. Hakkani-Tur, and X. He, "Towars deeper understanding: Deep convex networks for semantIc utterance classification, " ICASSP, 2012
-
(2012)
ICASSP
-
-
Tur, G.1
Deng, L.2
Hakkani-Tur, D.3
He, X.4
-
36
-
-
84055163920
-
Roles of pretraining and finetuning in context-dependent DNN-HMMs for real-world speech recognition
-
D. Yu, L. Deng, and G. Dahl, "Roles of pretraining and finetuning in context-dependent DNN-HMMs for real-world speech recognition, " NIPS Workshop, 2010
-
(2010)
NIPS Workshop
-
-
Yu, D.1
Deng, L.2
Dahl, G.3
-
37
-
-
84871387302
-
The deep tensor neural network with applications to large vocabulary speech recognition
-
Feb
-
D. Yu, L. Deng, and F. Seide. "The deep tensor neural network with applications to large vocabulary speech recognition, " IEEE Trans. Audio, Speech, and Lang. Proc. vol. 21, no. 2, pp. 388-396, Feb, 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Lang. Proc
, vol.21
, Issue.2
, pp. 388-396
-
-
Yu, D.1
Deng, L.2
Seide, F.3
|