-
2
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
-
G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Speech and Audio Proc., vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Trans. Speech and Audio Proc
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
3
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, 2011.
-
(2011)
Proc. Interspeech
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
4
-
-
84874235393
-
Why deep neural networks are promising for large vocabulary speech recognition
-
submitted to
-
D. Yu, F. Seide, G. Li, J. Li, and M. Seltzer, "Why deep neural networks are promising for large vocabulary speech recognition," submitted to IEEE Trans. on Audio, Speech, and Language Processing, 2012.
-
(2012)
IEEE Trans. on Audio, Speech, and Language Processing
-
-
Yu, D.1
Seide, F.2
Li, G.3
Li, J.4
Seltzer, M.5
-
5
-
-
84999742323
-
An application of pretrained deep neural networks to large vocabulary conversational speech recognition
-
Department of Computer Science, University of Toronto
-
N. Jaitly, P. Nguyen, A. Senior, and V. Vanhoucke, "An application of pretrained deep neural networks to large vocabulary conversational speech recognition," Tech. Rep. 001, Department of Computer Science, University of Toronto, 2012.
-
(2012)
Tech. Rep 001
-
-
Jaitly, N.1
Nguyen, P.2
Senior, A.3
Vanhoucke, V.4
-
6
-
-
84867754964
-
Improvements in using deep belief networks for large vocabulary continuous speech recognition
-
Speech and Language Algorithm Group, IBM, February 2011
-
T. N. Sainath, B. Kingsbury, and B. Ramabhadran, "Improvements in using deep belief networks for large vocabulary continuous speech recognition," Tech. Rep. UTML TR 2010-003, Speech and Language Algorithm Group, IBM, February 2011
-
Tech. Rep. UTML TR 2010-003
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
-
7
-
-
84858972572
-
Making deep belief networks effective for large vocabulary continuous speech recognition
-
T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, A.-r. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition", in Proc. ASRU 2011, pp. 30-35.
-
(2011)
Proc. ASRU
, pp. 30-35
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
Fousek, P.4
Novak, A.-R.5
Mohamed, P.6
-
8
-
-
44049108531
-
Automated directory assistance system-from theory to practice
-
D. Yu, Y. C. Ju, Y. Y. Wang, G. Zweig, and A. Acero, "Automated directory assistance system-from theory to practice," in Proc. Interspeech, 2007, pp. 2709-2711.
-
(2007)
Proc. Interspeech
, pp. 2709-2711
-
-
Yu, D.1
Ju, Y.C.2
Wang, Y.Y.3
Zweig, G.4
Acero, A.5
-
9
-
-
85079086476
-
Sources of degradation of speech recognition in the telephone network
-
Adelaide, Australia Apr
-
P. Moreno and R. M. Stern, "Sources of degradation of speech recognition in the telephone network," in Proc. ICASSP, Adelaide, Australia, vol. I, pp.109-112, Apr. 1994.
-
(1994)
Proc. ICASSP
, vol.1
, pp. 109-112
-
-
Moreno, P.1
Stern, R.M.2
-
11
-
-
64149084747
-
Training wideband acoustic models using mixed-bandwidth training data for speech recognition
-
M. L. Seltzer and A. Acero, "Training wideband acoustic models using mixed-bandwidth training data for speech recognition", IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 1, pp. 235-245, 2007.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.1
, pp. 235-245
-
-
Seltzer, M.L.1
Acero, A.2
-
12
-
-
33745199156
-
Robust bandwidth extension of noise-corrupted narrowband speech
-
M. L. Seltzer, A. Acero, and J. Droppo, "Robust bandwidth extension of noise-corrupted narrowband speech," in Proc. Interspeech, pp. 1509-1512, 2005.
-
(2005)
Proc. Interspeech
, pp. 1509-1512
-
-
Seltzer, M.L.1
Acero, A.2
Droppo, J.3
-
13
-
-
0028517647
-
Statistical recovery of wideband speech from narrowband speech
-
Oct
-
Y. M. Cheng, D. O'Shaughnessy, and P. Mermelstein, "Statistical recovery of wideband speech from narrowband speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 544-548, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.4
, pp. 544-548
-
-
Cheng, Y.M.1
O'Shaughnessy, D.2
Mermelstein, P.3
-
14
-
-
0033692729
-
Narrowband to wideband conversion of speech using GMM based transformation
-
Istanbul, Turkey Jun
-
K.-Y. Park and H. S. Kim, "Narrowband to wideband conversion of speech using GMM based transformation," in Proc. ICASSP, Istanbul, Turkey, Jun. 2000, vol. 3, pp. 1843-1846.
-
(2000)
Proc. ICASSP
, vol.3
, pp. 1843-1846
-
-
Park, K.-Y.1
Kim, H.S.2
-
15
-
-
84951992170
-
Wideband extension of telephone speech using a hidden Markov model
-
Delavan, WI, Sep
-
P. Jax and P. Vary, "Wideband extension of telephone speech using a hidden Markov model," in IEEE Workshop on Speech Coding, Delavan, WI, Sep. 2000, pp. 133-135.
-
(2000)
IEEE Workshop on Speech Coding
, pp. 133-135
-
-
Jax, P.1
Vary, P.2
-
17
-
-
84867585919
-
Understanding how deep belief networks perform acoustic modelling
-
A. Mohamed, G. Hinton, and G. Penn, "Understanding how deep belief networks perform acoustic modelling", in Proc. ICASSP, pp. 4273-4276, 2012.
-
(2012)
Proc. ICASSP
, pp. 4273-4276
-
-
Mohamed, A.1
Hinton, G.2
Penn, G.3
-
18
-
-
33646788786
-
FMPE: Discriminatively trained features for speech recognition
-
D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau and G. Zweig, "fMPE: discriminatively trained features for speech recognition," in Pro. ICASSP, 2005.
-
(2005)
Pro. ICASSP
-
-
Povey, D.1
Kingsbury, B.2
Mangu, L.3
Saon, G.4
Soltau, H.5
Zweig, G.6
-
19
-
-
51449120120
-
Boosted MMI for model and feature space discriminative training
-
D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon and K. Visweswariah, "Boosted MMI for model and feature space discriminative training", in Proc. ICASSP, 2008
-
(2008)
Proc. ICASSP
-
-
Povey, D.1
Kanevsky, D.2
Kingsbury, B.3
Ramabhadran, B.4
Saon, G.5
Visweswariah, K.6
-
20
-
-
79959831132
-
Investigation of fullsequence training of deep belief networks for speech recognition
-
A. Mohamed, D. Yu, and L. Deng, "Investigation of fullsequence training of deep belief networks for speech recognition", in Proc. Interspeech 2010, pp. 1692-1695.
-
(2010)
Proc. Interspeech
, pp. 1692-1695
-
-
Mohamed, A.1
Yu, D.2
Deng, L.3
-
21
-
-
70349213445
-
Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
-
B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling," in Proc. ICASSP 2009, pp. 3761-3764.
-
(2009)
Proc. ICASSP
, pp. 3761-3764
-
-
Kingsbury, B.1
-
22
-
-
80051623709
-
Joint encoding of the waveform and speech recognition features using a transform codec
-
May
-
X. Fan, M. Seltzer, J. Droppo, H. Malvar, and A. Acero, "Joint encoding of the waveform and speech recognition features using a transform codec," in Proc. ICASSP, pp.5148-5151, May 2011.
-
(2011)
Proc. ICASSP
, pp. 5148-5151
-
-
Fan, X.1
Seltzer, M.2
Droppo, J.3
Malvar, H.4
Acero, A.5
|