-
1
-
-
0030245363
-
From hmm's to segment models: A unified view of stochastic modeling for speech recognition
-
M. Ostendorf, V. V. Digalakis, and O. A. Kimball, "From hmm's to segment models: A unified view of stochastic modeling for speech recognition, " Speech and Audio Processing, IEEE Transactions on, vol. 4, no. 5, pp. 360-378, 1996.
-
(1996)
Speech and Audio Processing, IEEE Transactions on
, vol.4
, Issue.5
, pp. 360-378
-
-
Ostendorf, M.1
Digalakis, V.V.2
Kimball, O.A.3
-
2
-
-
0038359548
-
A probabilistic framework for segment-based speech recognition
-
J. R. Glass, "A probabilistic framework for segment-based speech recognition, " Computer Speech & Language, vol. 17, no. 2, pp. 137-152, 2003.
-
(2003)
Computer Speech & Language
, vol.17
, Issue.2
, pp. 137-152
-
-
Glass, J.R.1
-
4
-
-
45549086638
-
Template-based continuous speech recognition
-
M. DeWachter, M. Matton, K. Demuynck, P. Wambacq, R. Cools, and D. Van Compernolle, "Template-based continuous speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, no. 4, pp. 1377-1390, 2007.
-
(2007)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.15
, Issue.4
, pp. 1377-1390
-
-
DeWachter, M.1
Matton, M.2
Demuynck, K.3
Wambacq, P.4
Cools, R.5
Van Compernolle, D.6
-
5
-
-
33947702666
-
Augmented statistical models for speech recognition
-
IEEE
-
M. Layton and M. Gales, "Augmented statistical models for speech recognition, " in Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, vol. 1. IEEE, 2006, pp. I-I.
-
(2006)
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
, vol.1
, pp. I-I
-
-
Layton, M.1
Gales, M.2
-
6
-
-
77949370075
-
A segmental CRF approach to large vocabulary continuous speech recognition
-
Merano, Italy, Dec
-
G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition, " in Proceedings of the IEEE Workshop on Automatic Speech Recognition Understanding (ASRU'09), Merano, Italy, Dec. 2009, pp. 152-157.
-
(2009)
Proceedings of the IEEE Workshop on Automatic Speech Recognition Understanding (ASRU'09)
, pp. 152-157
-
-
Zweig, G.1
Nguyen, P.2
-
7
-
-
77957744761
-
Structured log linear models for noise robust speech recognition
-
S. Zhang, A. Ragni, and M. Gales, "Structured log linear models for noise robust speech recognition, " Signal Processing Letters, IEEE, vol. 17, no. 11, pp. 945-948, 2010.
-
(2010)
Signal Processing Letters, IEEE
, vol.17
, Issue.11
, pp. 945-948
-
-
Zhang, S.1
Ragni, A.2
Gales, M.3
-
8
-
-
80051659716
-
Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop
-
Prague, Czech Republic, May
-
G. Zweig, P. Nguyen, D. Van Compernolle, K. Demuynck, L. Atlas, P. Clark, G. Sell, M. Wang, F. Sha, H. Hermansky, D. Karakos, A. Jansen, S. Thomas, G. S. V. S. Sivaram, S. Bowman, and J. Kao, "Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop, " in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'11), Prague, Czech Republic, May 2011, pp. 5044-5047.
-
(2011)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'11)
, pp. 5044-5047
-
-
Zweig, G.1
Nguyen, P.2
Van Compernolle, D.3
Demuynck, K.4
Atlas, L.5
Clark, P.6
Sell, G.7
Wang, M.8
Sha, F.9
Hermansky, H.10
Karakos, D.11
Jansen, A.12
Thomas, S.13
Sivaram, G.S.V.S.14
Bowman, S.15
Kao, J.16
-
9
-
-
84867598637
-
Classification and recognition with direct segment models
-
Kyoto, Japan, Mar
-
G. Zweig, "Classification and recognition with direct segment models, " in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12), Kyoto, Japan, Mar. 2012, pp. 4161-4164.
-
(2012)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'12)
, pp. 4161-4164
-
-
Zweig, G.1
-
10
-
-
84878565391
-
Efficient segmental conditional random fields for phone recognition
-
Portland, OR, USA, Sep
-
Y. He and E. Fosler-Lussier, "Efficient segmental conditional random fields for phone recognition, " in Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech'12), Portland, OR, USA, Sep. 2012.
-
(2012)
Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech'12)
-
-
He, Y.1
Fosler-Lussier, E.2
-
11
-
-
84906282118
-
Deep segmental neural networks for speech recognition
-
O. Abdel-Hamid, L. Deng, D. Yu, and H. Jiang, "Deep segmental neural networks for speech recognition, " in INTERSPEECH, 2013.
-
(2013)
INTERSPEECH
-
-
Abdel-Hamid, O.1
Deng, L.2
Yu, D.3
Jiang, H.4
-
12
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
13
-
-
84890491198
-
Recent advances in deep learning for speech research at microsoft
-
L. Deng, J. Li, J.-T. Huang, K. Yao, D. Yu, F. Seide, M. Seltzer, G. Zweig, X. He, J. Williams et al., "Recent advances in deep learning for speech research at microsoft, " ICASSP 2013, 2013.
-
(2013)
ICASSP 2013
-
-
Deng, L.1
Li, J.2
Huang, J.-T.3
Yao, K.4
Yu, D.5
Seide, F.6
Seltzer, M.7
Zweig, G.8
He, X.9
Williams, J.10
-
15
-
-
84876691724
-
Conditional random fields in speech, audio, and language processing
-
E. Fosler-Lussier, Y. He, P. Jyothi, and R. Prabhavalkar, "Conditional random fields in speech, audio, and language processing, " Proceedings of the IEEE, vol. 101, no. 5, pp. 1054-1075, 2013.
-
(2013)
Proceedings of the IEEE
, vol.101
, Issue.5
, pp. 1054-1075
-
-
Fosler-Lussier, E.1
He, Y.2
Jyothi, P.3
Prabhavalkar, R.4
-
16
-
-
34047192804
-
Semi-Markov conditional random fields for information extraction
-
Vancouver, British Columbia, Canada, Dec
-
S. Sarawagi andW. W. Cohen, "Semi-Markov conditional random fields for information extraction, " in Advances in Neural Information Processing Systems (NIPS'04), Vancouver, British Columbia, Canada, Dec. 2004, pp. 1185-1192.
-
(2004)
Advances in Neural Information Processing Systems (NIPS'04)
, pp. 1185-1192
-
-
Sarawagi, W.S.1
Cohen, W.2
-
18
-
-
85075929453
-
Speech recognition with weighted finite-state transducers
-
Springer
-
M. Mohri, F. Pereira, and M. Riley, "Speech recognition with weighted finite-state transducers, " in Springer Handbook of Speech Processing. Springer, 2008, pp. 559-584.
-
(2008)
Springer Handbook of Speech Processing
, pp. 559-584
-
-
Mohri, M.1
Pereira, F.2
Riley, M.3
-
20
-
-
84858953642
-
The Kaldi speech recognition toolkit
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz et al., "The Kaldi speech recognition toolkit, " in Proc. of ASRU, 2011, pp. 1-4.
-
(2011)
Proc. of ASRU
, pp. 1-4
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlicek, P.8
Qian, Y.9
Schwarz, P.10
-
21
-
-
70349213445
-
Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
-
B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. of ICASSP, 2009, pp. 3761-3764.
-
(2009)
Proc. of ICASSP
, pp. 3761-3764
-
-
Kingsbury, B.1
-
22
-
-
84906274730
-
Sequencediscriminative training of deep neural networks
-
K. Vesely, A. Ghoshal, L. Burget, and D. Povey, "Sequencediscriminative training of deep neural networks, " in INTERSPEECH, 2013.
-
(2013)
INTERSPEECH
-
-
Vesely, K.1
Ghoshal, A.2
Burget, L.3
Povey, D.4
-
23
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
J. Duchi, E. Hazan, and Y. Singer, "Adaptive subgradient methods for online learning and stochastic optimization, " J. Mach. Learn. Res., vol. 12, pp. 2121-2159, 2011.
-
(2011)
J. Mach. Learn. Res.
, vol.12
, pp. 2121-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
24
-
-
85032750905
-
Discriminative learning in sequential pattern recognition
-
Sep
-
X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition, " IEEE Signal Processing Magazine, vol. 25, no. 5, pp. 14-36, Sep. 2008.
-
(2008)
IEEE Signal Processing Magazine
, vol.25
, Issue.5
, pp. 14-36
-
-
He, X.1
Deng, L.2
Chou, W.3
-
27
-
-
84872193462
-
Structured svms for automatic speech recognition
-
S.-X. Zhang and M. J. Gales, "Structured svms for automatic speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 3, pp. 544-555, 2013.
-
(2013)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.21
, Issue.3
, pp. 544-555
-
-
Zhang, S.-X.1
Gales, M.J.2
-
29
-
-
0028392167
-
An application of recurrent nets to phone probability estimation
-
A. J. Robinson, "An application of recurrent nets to phone probability estimation, " Neural Networks, IEEE Transactions on, vol. 5, no. 2, pp. 298-305, 1994.
-
(1994)
Neural Networks, IEEE Transactions on
, vol.5
, Issue.2
, pp. 298-305
-
-
Robinson, A.J.1
-
30
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
A. Graves, A.-r. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 6645-6649.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 6645-6649
-
-
Graves, A.1
Mohamed, A.-R.2
Hinton, G.3
|