-
1
-
-
80051618754
-
A symmetrization of the subspace gaussian mixture model
-
May
-
D. Povey, M. Karafiat, A. Ghoshal, and P. Schwarz, "A symmetrization of the Subspace Gaussian Mixture Model, " in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, May 2011, pp. 4504-4507.
-
(2011)
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
, pp. 4504-4507
-
-
Povey, D.1
Karafiat, M.2
Ghoshal, A.3
Schwarz, P.4
-
2
-
-
51449120120
-
Boosted MMI for model and feature-space discriminative training
-
March
-
D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "Boosted MMI for Model and Feature-space Discriminative Training, " in Acoustics, Speech and Signal Processing (ICASSP), 2008 IEEE International Conference on, March 2008, pp. 4057-4060.
-
(2008)
Acoustics, Speech and Signal Processing (ICASSP), 2008 IEEE International Conference on
, pp. 4057-4060
-
-
Povey, D.1
Kanevsky, D.2
Kingsbury, B.3
Ramabhadran, B.4
Saon, G.5
Visweswariah, K.6
-
3
-
-
84905239342
-
Improving deep neural network acoustic models using generalized maxout networks
-
Florence, Italy
-
X. Zhang, J. Trmal, D. Povey, and S. Khudanpur, "Improving Deep Neural Network Acoustic Models Using Generalized Maxout Networks, " in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, Florence, Italy, 2014.
-
(2014)
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
-
-
Zhang, X.1
Trmal, J.2
Povey, D.3
Khudanpur, S.4
-
4
-
-
80052042597
-
Lattice indexing for spoken term detection
-
Nov
-
D. Can and M. Saraclar, "Lattice Indexing for Spoken Term Detection, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 8, pp. 2338-2347, Nov 2011.
-
(2011)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.19
, Issue.8
, pp. 2338-2347
-
-
Can, D.1
Saraclar, M.2
-
5
-
-
84867616340
-
Generating exact lattices in the wfst framework
-
March
-
D. Povey, M. Hannemann, G. Boulianne, L. Burget, A. Ghoshal, M. Janda, M. Karafiat, S. Kombrink, P. Motlicek, Yanmin Qian, K. Riedhammer, K. Vesely, and Ngoc Thang Vu, "Generating Exact Lattices in the WFST framework, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, March 2012, pp. 4213-4216.
-
(2012)
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
, pp. 4213-4216
-
-
Povey, D.1
Hannemann, M.2
Boulianne, G.3
Burget, L.4
Ghoshal, A.5
Janda, M.6
Karafiat, M.7
Kombrink, S.8
Motlicek, P.9
Qian, Y.10
Riedhammer, K.11
Vesely, K.12
Thang Vu, N.13
-
6
-
-
84893698649
-
Using proxies for OOV leywords in the keyword search task
-
Dec
-
G. Chen, O. Yilmaz, J. Trmal, D. Povey, and S. Khudanpur, "Using Proxies for OOV Leywords in the Keyword Search Task, " in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on, Dec 2013, pp. 416-421.
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
, pp. 416-421
-
-
Chen, G.1
Yilmaz, O.2
Trmal, J.3
Povey, D.4
Khudanpur, S.5
-
7
-
-
69249145662
-
Point process models for spotting keywords in continuous speech
-
Nov
-
A. Jansen and P. Niyogi, "Point Process Models for Spotting Keywords in Continuous Speech, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, no. 8, pp. 1457-1470, Nov 2009.
-
(2009)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.17
, Issue.8
, pp. 1457-1470
-
-
Jansen, A.1
Niyogi, P.2
-
8
-
-
84905252790
-
A pitch extraction algorithm tuned for automatic speech recognition
-
P. Ghahremani, B. BabaAli, D. Povey, K. Riedhammer, J. Trmal, and S. Khudanpur, "A Pitch Extraction Algorithm Tuned for Automatic Speech Recognition, " Proceeding of Int. Conf. ICASSP 2014, 2014.
-
(2014)
Proceeding of Int. Conf. ICASSP 2014
-
-
Ghahremani, P.1
Babaali, B.2
Povey, D.3
Riedhammer, K.4
Trmal, J.5
Khudanpur, S.6
-
9
-
-
84893650076
-
Semisupervised training of deep neural networks
-
IEEE Signal Processing Society
-
K. Veselý, M. Hannemann, and L. Burget, "Semisupervised Training of Deep Neural Networks, " in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. 2013, pp. 267-272, IEEE Signal Processing Society.
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
, pp. 267-272
-
-
Veselý, K.1
Hannemann, M.2
Burget, L.3
-
10
-
-
41049105254
-
Joint-sequence models for grapheme-to-phoneme conversion
-
M. Bisani and H. Ney, "Joint-sequence Models for Grapheme-to-phoneme Conversion, " Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
-
(2008)
Speech Communication
, vol.50
, Issue.5
, pp. 434-451
-
-
Bisani, M.1
Ney, H.2
-
11
-
-
85022919385
-
Class-Based N-Gram models of natural language
-
Dec
-
P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. Della Pietra, and J. C. Lai, "Class-based N-gram Models of Natural Language, " Comput. Linguist, vol. 18, no. 4, pp. 467-479, Dec. 1992.
-
(1992)
Comput. Linguist.
, vol.18
, Issue.4
, pp. 467-479
-
-
Brown, P.F.1
Desouza, P.V.2
Mercer, R.L.3
Della Pietra, V.J.4
Lai, J.C.5
-
12
-
-
84905234179
-
Featherweight phonetic keyword search for conversational speech
-
K. Kintzley, A. Jansen, and H. Hermansky, "Featherweight Phonetic Keyword Search for Conversational Speech, " in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, 2014.
-
(2014)
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
-
-
Kintzley, K.1
Jansen, A.2
Hermansky, H.3
-
13
-
-
79953250475
-
Minimum bayes risk decoding and system combination based on a recursion for edit distance
-
H. Xu, D. Povey, L. Mangu, and J. Zhu, "Minimum Bayes Risk Decoding and System Combination Based on a Recursion for Edit Distance, " Computer Speech & Language, vol. 25, no. 4, pp. 802-828, 2011.
-
(2011)
Computer Speech & Language
, vol.25
, Issue.4
, pp. 802-828
-
-
Xu, H.1
Povey, D.2
Mangu, L.3
Zhu, J.4
-
14
-
-
4544253834
-
Posterior probability decoding, confidence estimation and system combination
-
Baltimore
-
G. Evermann and P. C. Woodland, "Posterior Probability Decoding, Confidence Estimation and System Combination, " in Proc. Speech TranscriptionWorkshop. Baltimore, 2000, vol. 27.
-
(2000)
Proc. Speech TranscriptionWorkshop
, vol.27
-
-
Evermann, G.1
Woodland, P.C.2
|