SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 2355-2359

Discriminative training of acoustic models for system combination

(2) Tachioka, Yuuki a Watanabe, Shinji b

a MITSUBISHI ELECTRIC CORPORATION (Japan)

b MITSUBISHI ELECTRIC RESEARCH LABORATORIES (United States)

Author keywords

Boosting; Discriminative training; Margin training; MMI; System combination

Indexed keywords

COMPUTER APPLICATIONS; COMPUTER SIMULATION;

AUTOMATIC SPEECH RECOGNITION; BOOSTING; COMPLEMENTARY SYSTEMS; CONVENTIONAL SYSTEMS; DISCRIMINATIVE TRAINING; MMI; NOISY SPEECH RECOGNITION; SYSTEM COMBINATION;

SPEECH RECOGNITION;

EID: 84893695671 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (4)

References (30)

1
- 85032751593
- Research developments and directions in speech recognition and understanding part 1
- 5
- J. Baker, L. Deng, J. Glass, S. Khudanpur, C. Lee, N. Morgan, and D. O'Shaughnessy, "Research developments and directions in speech recognition and understanding part 1, " IEEE Signal Processing Magazine, vol. 26, pp. 75-80, 2009. 5.
- (2009) IEEE Signal Processing Magazine , vol.26 , pp. 75-80
- Baker, J.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.⁵ Morgan, N.⁶ O'Shaughnessy, D.⁷

2
- 85032750905
- Discriminative learning in sequential pattern recognition
- 9
- X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition, " IEEE Signal Processing Magazine, vol. 25, pp. 14-36, 2008. 9.
- (2008) IEEE Signal Processing Magazine , vol.25 , pp. 14-36
- He, X.¹ Deng, L.² Chou, W.³

3
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- L. Bahl, P. Brown, P. de Souza, and R. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition, " in Proceedings ICASSP, vol. 11, pp. 49-52, 1986.
- (1986) Proceedings ICASSP , vol.11 , pp. 49-52
- Bahl, L.¹ Brown, P.² De Souza, P.³ Mercer, R.⁴

4
- 0036296863
- Minimum phone error and ismoothing for improved discriminative training
- D. Povey and P. Woodland, "Minimum phone error and Ismoothing for improved discriminative training, " in Proceedings ICASSP, vol. I, pp. 105-108, 2002.
- (2002) Proceedings ICASSP , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.²

5
- 34547522070
- Discriminative training for large-vocabulary speech recognition using minimum classification error
- 1
- E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large-vocabulary speech recognition using minimum classification error, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, pp. 203-223, 2007. 1.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , pp. 203-223
- McDermott, E.¹ Hazen, T.J.² Roux, J.L.³ Nakamura, A.⁴ Katagiri, S.⁵

6
- 78049409757
- Discriminative training based on an integrated view of MPE and MMI in margin and error space
- E. McDermott, S. Watanabe, and A. Nakamura, "Discriminative training based on an integrated view of MPE and MMI in margin and error space, " in Proceedings ICASSP, pp. 4894-4897, 2010.
- (2010) Proceedings ICASSP , pp. 4894-4897
- McDermott, E.¹ Watanabe, S.² Nakamura, A.³

7
- 85032751713
- Discriminative training for automatic speech recognition: Modeling, criteria, optimization, implementation, and performance
- 11
- G. Heigold, H. J. Ney, R. Schl̈uter, and S. Wiesler, "Discriminative training for automatic speech recognition: Modeling, criteria, optimization, implementation, and performance, " IEEE Signal Processing Magazine, vol. 29, pp. 58-69, 2012. 11.
- (2012) IEEE Signal Processing Magazine , vol.29 , pp. 58-69
- Heigold, G.¹ Ney, H.J.² Schl̈uter, R.³ Wiesler, S.⁴

8
- 85032751545
- Structured discriminative models for speech recognition: An overview
- 11
- M. Gales, S. Watanabe, and E. Fosler-Lussier, "Structured discriminative models for speech recognition: An overview, " IEEE Signal Processing Magazine, vol. 29, pp. 70-81, 2012. 11.
- (2012) IEEE Signal Processing Magazine , vol.29 , pp. 70-81
- Gales, M.¹ Watanabe, S.² Fosler-Lussier, E.³

9
- 0030638031
- A post-processing system to yield reduced error word rates: Recognizer output voting error reduction (rover)
- J. Fiscus, "A post-processing system to yield reduced error word rates: Recognizer output voting error reduction (ROVER), " in Proceedings ASRU, pp. 347-354, 1997.
- (1997) Proceedings ASRU , pp. 347-354
- Fiscus, J.¹

10
- 0141477960
- Posterior probability decoding, confidence estimation and system combination
- G. Evermann and P. Woodland, "Posterior probability decoding, confidence estimation and system combination, " in Proceedings NIST Speech Transcription Workshop, 2000.
- (2000) Proceedings NIST Speech Transcription Workshop
- Evermann, G.¹ Woodland, P.²

11
- 44849087216
- Frame based system combination and a comparison with weighted ROVER and CNC
- B. Hoffmeister, T. Klein, R. Schlüter, and H. Ney, "Frame based system combination and a comparison with weighted ROVER and CNC, " in Proceedings ICSLP, pp. 537-540, 2006.
- (2006) Proceedings ICSLP , pp. 537-540
- Hoffmeister, B.¹ Klein, T.² Schlüter, R.³ Ney, H.⁴

12
- 84866874722
- Strategies for model training and adaptation based on data dependency control
- T. Shinozaki and S. Furui, "Strategies for model training and adaptation based on data dependency control, " APSIPA Overview, 2011.
- (2011) APSIPA Overview
- Shinozaki, T.¹ Furui, S.²

13
- 33646818291
- Constructing ensembles of ASR systems using randomized decision trees
- O. Siohan, B. Ramabhadran, and B. Kingsbury, "Constructing ensembles of ASR systems using randomized decision trees, " in Proceedings ICASSP, pp. 197-200, 2005.
- (2005) Proceedings ICASSP , pp. 197-200
- Siohan, O.¹ Ramabhadran, B.² Kingsbury, B.³

14
- 34547541480
- Generating complementary systems for speech recognition
- C. Breslin and M. Gales, "Generating complementary systems for speech recognition, " in Proceedings ICASSP, pp. 337-340, 2007.
- (2007) Proceedings ICASSP , pp. 337-340
- Breslin, C.¹ Gales, M.²

15
- 78049386242
- Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models
- H. Tang, M. Hasegawa-Johnson, and T. S. Huang, "Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models, " in Proceedings ICASSP, pp. 5242-5245, 2010.
- (2010) Proceedings ICASSP , pp. 5242-5245
- Tang, H.¹ Hasegawa-Johnson, M.² Huang, T.S.³

16
- 79959815831
- Boosting systems for LVCSR
- G. Saon and H. Soltau, "Boosting systems for LVCSR, " in Proceedings INTERSPEECH, pp. 1341-1344, 2010.
- (2010) Proceedings INTERSPEECH , pp. 1341-1344
- Saon, G.¹ Soltau, H.²

17
- 0031211090
- A dicision-theoretic generalisation of online learning and an application to boosting
- 8
- Y. Freund and R. Schapire, "A dicision-theoretic generalisation of online learning and an application to boosting, " Journal of Computer and System Sciences, vol. 55, pp. 119-139, 1997. 8.
- (1997) Journal of Computer and System Sciences , vol.55 , pp. 119-139
- Freund, Y.¹ Schapire, R.²

18
- 0346870000
- Robust real-time object detection
- 7
- P. Viola and M. Jones, "Robust real-time object detection, " in Proceedings Second International Workshop on Statistical and Computational Theories of Vision - Modeling, Learning, Computing, and Sampling, pp. 1-25, 2001. 7.
- (2001) Proceedings Second International Workshop on Statistical and Computational Theories of Vision - Modeling, Learning, Computing, and Sampling , pp. 1-25
- Viola, P.¹ Jones, M.²

19
- 0034164230
- Additive logistic regression: A statistical view of boosting
- J. Friedman, T. Hestie, and R. Tibshirani, "Additive logistic regression: A statistical view of boosting, " Annals of Statistics, vol. 28, pp. 337-407, 2000.
- (2000) Annals of Statistics , vol.28 , pp. 337-407
- Friedman, J.¹ Hestie, T.² Tibshirani, R.³

20
- 51449120120
- Boosted MMI for model and feature-space discriminative training
- D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "Boosted MMI for model and feature-space discriminative training, " in Proceedings ICASSP, pp. 4057-4060, 2008.
- (2008) Proceedings ICASSP , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

21
- 0026372945
- An improved mmie training algorithm for speaker-independent, small vocabulary, continuous speech recognition
- Y. Normandin and S. D. Morgera, "An improved MMIE training algorithm for speaker-independent, small vocabulary, continuous speech recognition, " in Proceedings ICASSP, vol. 1, pp. 537-540, 1991.
- (1991) Proceedings ICASSP , vol.1 , pp. 537-540
- Normandin, Y.¹ Morgera, S.D.²

22
- 84890541701
- The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines
- E. Vincent, J. Barker, S. Watanabe, J. Le Roux, F. Nesta, and M. Matassoni, "The second 'CHiME' speech separation and recognition challenge: Datasets, tasks and baselines, " in Proceedings ICASSP, 2013.
- (2013) Proceedings ICASSP
- Vincent, E.¹ Barker, J.² Watanabe, S.³ Roux, J.L.⁴ Nesta, F.⁵ Matassoni, M.⁶

23
- 84858953642
- The kaldi speech recognition toolkit
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, M. Petr, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognition toolkit, " in Proceedings ASRU, pp. 1-4, 2011.
- (2011) Proceedings ASRU , pp. 1-4
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Petr, M.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovský, J.¹¹ Stemmer, G.¹² Veselý, K.¹³

24
- 84893671946
- Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark
- Y. Tachioka, S. Watanabe, J. Le Roux, and J. Hershey, "Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark, " The 2nd International Workshop on Machine Listening in Multisource Environments, 2013.
- (2013) The 2nd International Workshop on Machine Listening in Multisource Environments
- Tachioka, Y.¹ Watanabe, S.² Roux, J.L.³ Hershey, J.⁴

25
- 84890503970
- Effectiveness of discriminative training and feature transformation for reverberated and noisy speech
- Y. Tachioka, S. Watanabe, and J. Hershey, "Effectiveness of discriminative training and feature transformation for reverberated and noisy speech, " in Proceedings ICASSP, 2013.
- (2013) Proceedings ICASSP
- Tachioka, Y.¹ Watanabe, S.² Hershey, J.³

26
- 85017287487
- Linear discriminant analysis for improved large vocabulary continuous speech recognition
- R. Haeb-Umbach and H. Ney, "Linear discriminant analysis for improved large vocabulary continuous speech recognition, " in Proceedings ICASSP, pp. 13-16, 1992.
- (1992) Proceedings ICASSP , pp. 13-16
- Haeb-Umbach, R.¹ Ney, H.²

27
- 84892187452
- Maximum likelihood modeling with gaussian distributions for classification
- R. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification, " in Proceedings ICASSP, pp. 661- 664, 1998.
- (1998) Proceedings ICASSP , pp. 661-664
- Gopinath, R.¹

28
- 0032638856
- Semi-tied covariance matrices for hidden Markov models
- 3
- M. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE Transactions on Speech and Audio Processing, vol. 7, pp. 272-281, 1999. 3.
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 272-281
- Gales, M.¹

29
- 0030362995
- A compact model for speaker-adaptive training
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training, " in Proceedings ICSLP, pp. 1137-1140, 1996.
- (1996) Proceedings ICSLP , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

30
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , pp. 75-98
- Gales, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.