-
1
-
-
85032751593
-
Research developments and directions in speech recognition and understanding part 1
-
5
-
J. Baker, L. Deng, J. Glass, S. Khudanpur, C. Lee, N. Morgan, and D. O'Shaughnessy, "Research developments and directions in speech recognition and understanding part 1, " IEEE Signal Processing Magazine, vol. 26, pp. 75-80, 2009. 5.
-
(2009)
IEEE Signal Processing Magazine
, vol.26
, pp. 75-80
-
-
Baker, J.1
Deng, L.2
Glass, J.3
Khudanpur, S.4
Lee, C.5
Morgan, N.6
O'Shaughnessy, D.7
-
2
-
-
85032750905
-
Discriminative learning in sequential pattern recognition
-
9
-
X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition, " IEEE Signal Processing Magazine, vol. 25, pp. 14-36, 2008. 9.
-
(2008)
IEEE Signal Processing Magazine
, vol.25
, pp. 14-36
-
-
He, X.1
Deng, L.2
Chou, W.3
-
3
-
-
0022890536
-
Maximum mutual information estimation of hidden Markov model parameters for speech recognition
-
L. Bahl, P. Brown, P. de Souza, and R. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition, " in Proceedings ICASSP, vol. 11, pp. 49-52, 1986.
-
(1986)
Proceedings ICASSP
, vol.11
, pp. 49-52
-
-
Bahl, L.1
Brown, P.2
De Souza, P.3
Mercer, R.4
-
4
-
-
0036296863
-
Minimum phone error and ismoothing for improved discriminative training
-
D. Povey and P. Woodland, "Minimum phone error and Ismoothing for improved discriminative training, " in Proceedings ICASSP, vol. I, pp. 105-108, 2002.
-
(2002)
Proceedings ICASSP
, vol.1
, pp. 105-108
-
-
Povey, D.1
Woodland, P.2
-
5
-
-
34547522070
-
Discriminative training for large-vocabulary speech recognition using minimum classification error
-
1
-
E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large-vocabulary speech recognition using minimum classification error, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, pp. 203-223, 2007. 1.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.J.2
Roux, J.L.3
Nakamura, A.4
Katagiri, S.5
-
6
-
-
78049409757
-
Discriminative training based on an integrated view of MPE and MMI in margin and error space
-
E. McDermott, S. Watanabe, and A. Nakamura, "Discriminative training based on an integrated view of MPE and MMI in margin and error space, " in Proceedings ICASSP, pp. 4894-4897, 2010.
-
(2010)
Proceedings ICASSP
, pp. 4894-4897
-
-
McDermott, E.1
Watanabe, S.2
Nakamura, A.3
-
7
-
-
85032751713
-
Discriminative training for automatic speech recognition: Modeling, criteria, optimization, implementation, and performance
-
11
-
G. Heigold, H. J. Ney, R. Schl̈uter, and S. Wiesler, "Discriminative training for automatic speech recognition: Modeling, criteria, optimization, implementation, and performance, " IEEE Signal Processing Magazine, vol. 29, pp. 58-69, 2012. 11.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, pp. 58-69
-
-
Heigold, G.1
Ney, H.J.2
Schl̈uter, R.3
Wiesler, S.4
-
8
-
-
85032751545
-
Structured discriminative models for speech recognition: An overview
-
11
-
M. Gales, S. Watanabe, and E. Fosler-Lussier, "Structured discriminative models for speech recognition: An overview, " IEEE Signal Processing Magazine, vol. 29, pp. 70-81, 2012. 11.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, pp. 70-81
-
-
Gales, M.1
Watanabe, S.2
Fosler-Lussier, E.3
-
9
-
-
0030638031
-
A post-processing system to yield reduced error word rates: Recognizer output voting error reduction (rover)
-
J. Fiscus, "A post-processing system to yield reduced error word rates: Recognizer output voting error reduction (ROVER), " in Proceedings ASRU, pp. 347-354, 1997.
-
(1997)
Proceedings ASRU
, pp. 347-354
-
-
Fiscus, J.1
-
11
-
-
44849087216
-
Frame based system combination and a comparison with weighted ROVER and CNC
-
B. Hoffmeister, T. Klein, R. Schlüter, and H. Ney, "Frame based system combination and a comparison with weighted ROVER and CNC, " in Proceedings ICSLP, pp. 537-540, 2006.
-
(2006)
Proceedings ICSLP
, pp. 537-540
-
-
Hoffmeister, B.1
Klein, T.2
Schlüter, R.3
Ney, H.4
-
12
-
-
84866874722
-
Strategies for model training and adaptation based on data dependency control
-
T. Shinozaki and S. Furui, "Strategies for model training and adaptation based on data dependency control, " APSIPA Overview, 2011.
-
(2011)
APSIPA Overview
-
-
Shinozaki, T.1
Furui, S.2
-
13
-
-
33646818291
-
Constructing ensembles of ASR systems using randomized decision trees
-
O. Siohan, B. Ramabhadran, and B. Kingsbury, "Constructing ensembles of ASR systems using randomized decision trees, " in Proceedings ICASSP, pp. 197-200, 2005.
-
(2005)
Proceedings ICASSP
, pp. 197-200
-
-
Siohan, O.1
Ramabhadran, B.2
Kingsbury, B.3
-
14
-
-
34547541480
-
Generating complementary systems for speech recognition
-
C. Breslin and M. Gales, "Generating complementary systems for speech recognition, " in Proceedings ICASSP, pp. 337-340, 2007.
-
(2007)
Proceedings ICASSP
, pp. 337-340
-
-
Breslin, C.1
Gales, M.2
-
15
-
-
78049386242
-
Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models
-
H. Tang, M. Hasegawa-Johnson, and T. S. Huang, "Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models, " in Proceedings ICASSP, pp. 5242-5245, 2010.
-
(2010)
Proceedings ICASSP
, pp. 5242-5245
-
-
Tang, H.1
Hasegawa-Johnson, M.2
Huang, T.S.3
-
17
-
-
0031211090
-
A dicision-theoretic generalisation of online learning and an application to boosting
-
8
-
Y. Freund and R. Schapire, "A dicision-theoretic generalisation of online learning and an application to boosting, " Journal of Computer and System Sciences, vol. 55, pp. 119-139, 1997. 8.
-
(1997)
Journal of Computer and System Sciences
, vol.55
, pp. 119-139
-
-
Freund, Y.1
Schapire, R.2
-
18
-
-
0346870000
-
Robust real-time object detection
-
7
-
P. Viola and M. Jones, "Robust real-time object detection, " in Proceedings Second International Workshop on Statistical and Computational Theories of Vision - Modeling, Learning, Computing, and Sampling, pp. 1-25, 2001. 7.
-
(2001)
Proceedings Second International Workshop on Statistical and Computational Theories of Vision - Modeling, Learning, Computing, and Sampling
, pp. 1-25
-
-
Viola, P.1
Jones, M.2
-
19
-
-
0034164230
-
Additive logistic regression: A statistical view of boosting
-
J. Friedman, T. Hestie, and R. Tibshirani, "Additive logistic regression: A statistical view of boosting, " Annals of Statistics, vol. 28, pp. 337-407, 2000.
-
(2000)
Annals of Statistics
, vol.28
, pp. 337-407
-
-
Friedman, J.1
Hestie, T.2
Tibshirani, R.3
-
20
-
-
51449120120
-
Boosted MMI for model and feature-space discriminative training
-
D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "Boosted MMI for model and feature-space discriminative training, " in Proceedings ICASSP, pp. 4057-4060, 2008.
-
(2008)
Proceedings ICASSP
, pp. 4057-4060
-
-
Povey, D.1
Kanevsky, D.2
Kingsbury, B.3
Ramabhadran, B.4
Saon, G.5
Visweswariah, K.6
-
21
-
-
0026372945
-
An improved mmie training algorithm for speaker-independent, small vocabulary, continuous speech recognition
-
Y. Normandin and S. D. Morgera, "An improved MMIE training algorithm for speaker-independent, small vocabulary, continuous speech recognition, " in Proceedings ICASSP, vol. 1, pp. 537-540, 1991.
-
(1991)
Proceedings ICASSP
, vol.1
, pp. 537-540
-
-
Normandin, Y.1
Morgera, S.D.2
-
22
-
-
84890541701
-
The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines
-
E. Vincent, J. Barker, S. Watanabe, J. Le Roux, F. Nesta, and M. Matassoni, "The second 'CHiME' speech separation and recognition challenge: Datasets, tasks and baselines, " in Proceedings ICASSP, 2013.
-
(2013)
Proceedings ICASSP
-
-
Vincent, E.1
Barker, J.2
Watanabe, S.3
Roux, J.L.4
Nesta, F.5
Matassoni, M.6
-
23
-
-
84858953642
-
The kaldi speech recognition toolkit
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, M. Petr, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognition toolkit, " in Proceedings ASRU, pp. 1-4, 2011.
-
(2011)
Proceedings ASRU
, pp. 1-4
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Petr, M.8
Qian, Y.9
Schwarz, P.10
Silovský, J.11
Stemmer, G.12
Veselý, K.13
-
24
-
-
84893671946
-
Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark
-
Y. Tachioka, S. Watanabe, J. Le Roux, and J. Hershey, "Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark, " The 2nd International Workshop on Machine Listening in Multisource Environments, 2013.
-
(2013)
The 2nd International Workshop on Machine Listening in Multisource Environments
-
-
Tachioka, Y.1
Watanabe, S.2
Roux, J.L.3
Hershey, J.4
-
25
-
-
84890503970
-
Effectiveness of discriminative training and feature transformation for reverberated and noisy speech
-
Y. Tachioka, S. Watanabe, and J. Hershey, "Effectiveness of discriminative training and feature transformation for reverberated and noisy speech, " in Proceedings ICASSP, 2013.
-
(2013)
Proceedings ICASSP
-
-
Tachioka, Y.1
Watanabe, S.2
Hershey, J.3
-
26
-
-
85017287487
-
Linear discriminant analysis for improved large vocabulary continuous speech recognition
-
R. Haeb-Umbach and H. Ney, "Linear discriminant analysis for improved large vocabulary continuous speech recognition, " in Proceedings ICASSP, pp. 13-16, 1992.
-
(1992)
Proceedings ICASSP
, pp. 13-16
-
-
Haeb-Umbach, R.1
Ney, H.2
-
27
-
-
84892187452
-
Maximum likelihood modeling with gaussian distributions for classification
-
R. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification, " in Proceedings ICASSP, pp. 661- 664, 1998.
-
(1998)
Proceedings ICASSP
, pp. 661-664
-
-
Gopinath, R.1
-
28
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
3
-
M. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE Transactions on Speech and Audio Processing, vol. 7, pp. 272-281, 1999. 3.
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, pp. 272-281
-
-
Gales, M.1
-
29
-
-
0030362995
-
A compact model for speaker-adaptive training
-
T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training, " in Proceedings ICSLP, pp. 1137-1140, 1996.
-
(1996)
Proceedings ICSLP
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
30
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, pp. 75-98, 1998.
-
(1998)
Computer Speech and Language
, vol.12
, pp. 75-98
-
-
Gales, M.1
|