-
2
-
-
34047266376
-
Advances in speech transcription at IBM under the DARPA EARS program
-
S. Chen, B. Kingsbury, L. Mangu, D. Povey, G. Saon, H. Soltau, and G. Zweig, "Advances in speech transcription at IBM under the DARPA EARS program," IEEE Trans. on Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1596-1608, 2006
-
(2006)
IEEE Trans. on Audio, Speech and Language Processing
, vol.14
, Issue.5
, pp. 1596-1608
-
-
Chen, S.1
Kingsbury, B.2
Mangu, L.3
Povey, D.4
Saon, G.5
Soltau, H.6
Zweig, G.7
-
3
-
-
77949347726
-
Dynamic network decoding revisited
-
H. Soltau and G. Saon, "Dynamic network decoding revisited," in Proc. ASRU, 2009
-
(2009)
Proc. ASRU
-
-
Soltau, H.1
Saon, G.2
-
4
-
-
84865265602
-
Hidden Markov acoustic modeling with bootstrap and restructuring for lowresourced languages
-
X. Cui, J. Xue, X. Chen, P. A. Olsen, P. L. Dognin, U. V. Chaudhari, J. R. Hershey, and B. Zhou, "Hidden Markov acoustic modeling with bootstrap and restructuring for lowresourced languages," IEEE Trans. on Audio, Speech and Language Processing, vol. 20, no. 8, pp. 2252-2264, 2012
-
(2012)
IEEE Trans. on Audio, Speech and Language Processing
, vol.20
, Issue.8
, pp. 2252-2264
-
-
Cui, X.1
Xue, J.2
Chen, X.3
Olsen, P.A.4
Dognin, P.L.5
Chaudhari, U.V.6
Hershey, J.R.7
Zhou, B.8
-
5
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, 2011
-
(2011)
Proc. ASRU
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
6
-
-
0038338085
-
A continuous speech recognition system embedding MLP into HMM
-
D. S. Touretzky, Ed
-
H. Bourlard and N. Morgan, "A continuous speech recognition system embedding MLP into HMM," in Advanced in Neural Information Processing Systems 2, D. S. Touretzky, Ed., 1990, pp. 186-193
-
(1990)
Advanced in Neural Information Processing Systems 2
, pp. 186-193
-
-
Bourlard, H.1
Morgan, N.2
-
7
-
-
84878379108
-
Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
-
B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in Proc. Interspeech, 2012
-
(2012)
Proc. Interspeech
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
-
9
-
-
84858972572
-
Making deep belief networks effective for large vocabulary continuous speech recognition
-
T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novk, and A. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition," in ASRU, 2011, pp. 30-35
-
(2011)
ASRU
, pp. 30-35
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
Fousek, P.4
Novk, P.5
Mohamed, A.6
-
11
-
-
34547548235
-
Probabilistic and bottleneck features for LVCSR of meetings
-
F. Grezl, M. Karafiat, S. Kontar, and J. Cernocky, "Probabilistic and bottleneck features for LVCSR of meetings," in Proc. ICASSP, 2007
-
(2007)
Proc. ICASSP
-
-
Grezl, F.1
Karafiat, M.2
Kontar, S.3
Cernocky, J.4
-
13
-
-
84878422162
-
Large scale hierarchical neural network language models
-
H. Kuo, E. Arisoy, A. Emami, and P. Vozila, "Large scale hierarchical neural network language models," in Proc. Interspeech, 2012
-
(2012)
Proc. Interspeech
-
-
Kuo, H.1
Arisoy, E.2
Emami, A.3
Vozila, P.4
-
14
-
-
77949349100
-
Scaling shrinkage-based language models
-
S. F. Chen, L. Mangu, B. Ramabhadran, R. Sarikaya, and A. Sethy, "Scaling shrinkage-based language models," in Proceedings of ASRU, 2009
-
(2009)
Proceedings of ASRU
-
-
Chen, S.F.1
Mangu, L.2
Ramabhadran, B.3
Sarikaya, R.4
Sethy, A.5
-
15
-
-
79951634009
-
Results of the 2006 spoken term detection evaluation
-
J. G. Fiscus, J. G. Ajot, J. Garofalo, and G. Doddington, "Results of the 2006 spoken term detection evaluation," in Proc. SIGIR Workshop on Searching Spontaneous Conversational Speech, 2007, pp. 51-57
-
(2007)
Proc. SIGIR Workshop on Searching Spontaneous Conversational Speech
, pp. 51-57
-
-
Fiscus, J.G.1
Ajot, J.G.2
Garofalo, J.3
Doddington, G.4
-
16
-
-
70349211775
-
Effect of pronounciations on OOV queries in spoken term detection
-
Dogan Can, Erica Cooper, Abhinav Sethy, Chris White, Bhuvana Ramabhadran, and Murat Saraclar, "Effect of pronounciations on OOV queries in spoken term detection," Proceedings of ICASSP, 2009
-
(2009)
Proceedings of ICASSP
-
-
Can, D.1
Cooper, E.2
Sethy, A.3
White, C.4
Ramabhadran, B.5
Saraclar, M.6
-
17
-
-
85050187568
-
Lattice-based search for spoken utterance retrieval
-
Murat Saraclar and Richard W. Sproat, "Lattice-based search for spoken utterance retrieval," in HLT-NAACL, 2004
-
(2004)
HLT-NAACL
-
-
Saraclar, M.1
Sproat, R.W.2
-
19
-
-
77949407432
-
Query-by-example spoken term detection for OOV terms
-
C. Parada, A. Sethy, and B. Ramabhadran, "Query-by-example spoken term detection for OOV terms," in ASRU, 2009
-
(2009)
ASRU
-
-
Parada, C.1
Sethy, A.2
Ramabhadran, B.3
-
20
-
-
84890542302
-
Exploiting diversity for spoken term detection
-
To appear
-
L. Mangu, H. Soltau, H.-K. Kuo, B. Kingsbury, and G. Saon, "Exploiting diversity for spoken term detection," in Proc. ICASSP, 2013. To appear
-
(2013)
Proc. ICASSP
-
-
Mangu, L.1
Soltau, H.2
Kuo, H.-K.3
Kingsbury, B.4
Saon, G.5
-
21
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Magazine, 2012
-
(2012)
IEEE Signal Processing Magazine
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
22
-
-
84890489531
-
System combination and score normalization for spoken term detection
-
To appear
-
J. Mamou, J. Cui, X. Cui, M. J. F. Gales, B. Kingsbury, K. Knill, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, R. Schluter, A. Sethy, and P. C. Woodland, "System combination and score normalization for spoken term detection," in Proc. ICASSP, 2013. To appear.
-
(2013)
Proc. ICASSP
-
-
Mamou, J.1
Cui, J.2
Cui, X.3
Gales, M.J.F.4
Kingsbury, B.5
Knill, K.6
Mangu, L.7
Nolden, D.8
Picheny, M.9
Ramabhadran, B.10
Schluter, R.11
Sethy, A.12
Woodland, P.C.13
|