메뉴 건너뛰기




Volumn 21, Issue 3, 2013, Pages 498-507

Building acoustic model ensembles by data sampling with enhanced trainings and features

Author keywords

cross validation data sampling; discriminative training; Ensemble acoustic model; MLP feature; speaker clustering data sampling

Indexed keywords

ACOUSTIC MODEL; CROSS VALIDATION; DISCRIMINATIVE TRAINING; MLP FEATURE; SPEAKER CLUSTERING;

EID: 84872174281     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2227729     Document Type: Article
Times cited : (14)

References (32)
  • 1
    • 80053403826 scopus 로고    scopus 로고
    • Ensemble methods in machine learning
    • T. G. Dietterich, "Ensemble methods in machine learning," in Proc. MCS, 2000, pp. 1-15.
    • (2000) Proc. MCS , pp. 1-15
    • Dietterich, T.G.1
  • 2
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • DOI 10.1023/A:1010933404324
    • L. Breiman, "Random forests," Mach. Learn., vol. 45, pp. 5-32, 2001. (Pubitemid 32933532)
    • (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 3
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • J. G. Fiscus, "A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)," in Proc. IEEE ASRU Workshop, 1997, pp. 347-352.
    • (1997) Proc. IEEE ASRU Workshop , pp. 347-352
    • Fiscus, J.G.1
  • 4
    • 85009080958 scopus 로고    scopus 로고
    • Spontaneous speech recognition using a massively parallel decoder
    • T. Shinozaki and S. Furui, "Spontaneous speech recognition using a massively parallel decoder," in Proc. ICSLP, 2004, pp. 1705-1708.
    • (2004) Proc. ICSLP , pp. 1705-1708
    • Shinozaki, T.1    Furui, S.2
  • 5
    • 33646818291 scopus 로고    scopus 로고
    • Constructing ensembles of ASR systems using randomized decision trees
    • O. Siohan, B. Ramabhadran, and B. Kingsbury, "Constructing ensembles of ASR systems using randomized decision trees," in Proc. ICASSP, 2005, pp. I-197-I-200.
    • (2005) Proc. ICASSP
    • Siohan, O.1    Ramabhadran, B.2    Kingsbury, B.3
  • 8
    • 79959833868 scopus 로고    scopus 로고
    • Building multiple complementary systems using directed decision tree
    • C. Bresline and M. J. F. Gales, "Building multiple complementary systems using directed decision tree," in Proc. Interspeech, 2007, pp. 1441-1444.
    • (2007) Proc. Interspeech , pp. 1441-1444
    • Bresline, C.1    Gales, M.J.F.2
  • 9
    • 33645989784 scopus 로고    scopus 로고
    • Boosting HMM acoustic models in large vocabulary speech recognition
    • C. Meyer and H. Schramm, "Boosting HMM acoustic models in large vocabulary speech recognition," Speech Commun., vol. 48, pp. 532-548, 2006.
    • (2006) Speech Commun , vol.48 , pp. 532-548
    • Meyer, C.1    Schramm, H.2
  • 10
    • 51549086717 scopus 로고    scopus 로고
    • Random forests of phonetic decision trees for acoustic modeling in conversational speech recognition
    • Mar
    • J. Xue and Y. Zhao, "Random forests of phonetic decision trees for acoustic modeling in conversational speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 3, pp. 519-528, Mar. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process , vol.16 , Issue.3 , pp. 519-528
    • Xue, J.1    Zhao, Y.2
  • 11
    • 70450191324 scopus 로고    scopus 로고
    • A study of bootstrapping with multiple acoustic features for improved automatic speech recognition
    • X. Cui, J. Xue, B. Xiang, and B. Zhou, "A study of bootstrapping with multiple acoustic features for improved automatic speech recognition," in Proc. Interspeech, 2009, pp. 240-243.
    • (2009) Proc. Interspeech , pp. 240-243
    • Cui, X.1    Xue, J.2    Xiang, B.3    Zhou, B.4
  • 12
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • Jul
    • M. J. F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.J.F.1
  • 13
    • 0026982122 scopus 로고
    • Discriminative learning for minimum error classification
    • DOI 10.1109/78.175747
    • B.-H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Process., vol. 40, no. 12, pp. 3043-3054, Dec. 1992. (Pubitemid 23603018)
    • (1992) IEEE Transactions on Signal Processing , vol.40 , Issue.12 , pp. 3043-3054
    • Juang Biing-Hwang1    Katagiri Shigeru2
  • 14
    • 0022890536 scopus 로고
    • Maximum mutual information estimation of hiddenMarkov model parameters for speech recognition
    • L. R. Bahl, P. F. Brown, P.V.D. Souza, and R. L. Mercer, "Maximum mutual information estimation of hiddenMarkov model parameters for speech recognition," in Proc. ICASSP, 1986, pp. 49-52.
    • (1986) Proc. ICASSP , pp. 49-52
    • Bahl, L.R.1    Brown, P.F.2    Souza, P.V.D.3    Mercer, R.L.4
  • 15
    • 0036296863 scopus 로고    scopus 로고
    • Minimumphone error and I-smoothing for improved discriminative training
    • D. Povey and P. C.Woodland, "Minimumphone error and I-smoothing for improved discriminative training," in Proc. ICASSP, 2002, vol. 1, pp. 105-108.
    • (2002) Proc. ICASSP , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 16
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature stream extraction for conventional HMM systems
    • H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature stream extraction for conventional HMM systems," in Proc. ICASSP, 2000, vol. III, pp. 1635-1638.
    • (2000) Proc. ICASSP , vol.3 , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 17
    • 33745528628 scopus 로고    scopus 로고
    • Using MLP features in SRI's conversational speech recognition system
    • Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan, "Using MLP features in SRI's conversational speech recognition system," in Proc. ICSLP, 2005, vol. 2, pp. 921-924.
    • (2005) Proc. ICSLP , vol.2 , pp. 921-924
    • Zhu, Q.1    Stolcke, A.2    Chen, B.Y.3    Morgan, N.4
  • 18
    • 0141629799 scopus 로고    scopus 로고
    • Improved recognition by combining different features and different systems
    • D. P. W. Ellis, "Improved recognition by combining different features and different systems," in Proc. AVIOS-2000, 2000.
    • (2000) Proc. AVIOS-2000
    • Ellis, D.P.W.1
  • 19
    • 35549000218 scopus 로고    scopus 로고
    • Cross-validation and aggregated EM training for robust parameter estimation
    • DOI 10.1016/j.csl.2007.07.005, PII S0885230807000472
    • T. Shinozaki and M. Ostendorf, "Cross-validation and aggregated EM training for robust parameter estimation," Comput. Speech Lang., vol. 22, no. 2, pp. 185-195, 2008. (Pubitemid 350016715)
    • (2008) Computer Speech and Language , vol.22 , Issue.2 , pp. 185-195
    • Shinozaki, T.1    Ostendorf, M.2
  • 21
    • 78049378080 scopus 로고    scopus 로고
    • Cambridge, U.K. [Online]
    • "HTK Toolkit," . Cambridge, U.K. [Online]. Available: http://htk. eng.cam.ac
    • HTK Toolkit
  • 23
    • 34547516258 scopus 로고    scopus 로고
    • Approximating the Kullback Leibler divergence between gaussian mixture models
    • J. Hershey and P. Olsen, "Approximating the Kullback Leibler divergence between gaussian mixture models," in Proc. ICASSP, 2007, pp. 317-320.
    • (2007) Proc. ICASSP , pp. 317-320
    • Hershey, J.1    Olsen, P.2
  • 25
    • 78049379606 scopus 로고    scopus 로고
    • Data sampling ensemble acoustic modeling in speaker independent speech recognition
    • X. Chen and Y. Zhao, "Data sampling ensemble acoustic modeling in speaker independent speech recognition," in Proc. ICASSP, 2010, pp. 5130-5133.
    • (2010) Proc. ICASSP , pp. 5130-5133
    • Chen, X.1    Zhao, Y.2
  • 26
    • 79959855899 scopus 로고    scopus 로고
    • Integrating MLP features and discriminative training in data sampling based ensemble acoustic modeling
    • X. Chen and Y. Zhao, "Integrating MLP features and discriminative training in data sampling based ensemble acoustic modeling," in Proc. Interspeech, 2010, pp. 1349-1352.
    • (2010) Proc. Interspeech , pp. 1349-1352
    • Chen, X.1    Zhao, Y.2
  • 27
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • Nov
    • K.-F. Lee and H.-W. Hon, "Speaker-independent phone recognition using hidden Markov models," IEEE Trans. Audio, Speech, Signal Process., vol. 37, no. 11, pp. 1641-1648, Nov. 1989.
    • (1989) IEEE Trans. Audio, Speech, Signal Process , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.-F.1    Hon, H.-W.2
  • 29
    • 0344509344 scopus 로고    scopus 로고
    • Phoneme probability estimationwith dynamic sparsely connected artificial networks
    • N. Strom, "Phoneme probability estimationwith dynamic sparsely connected artificial networks," Free Speech J., no. 5, 1997.
    • (1997) Free Speech J , Issue.5
    • Strom, N.1
  • 30
    • 70349218140 scopus 로고    scopus 로고
    • Data sampling based ensemble acoustic modeling
    • X. Chen and Y. Zhao, "Data sampling based ensemble acoustic modeling," in Proc. ICASSP, 2009, pp. 3805-3808.
    • (2009) Proc. ICASSP , pp. 3805-3808
    • Chen, X.1    Zhao, Y.2
  • 31
    • 77949358999 scopus 로고    scopus 로고
    • An exploration of large vocabulary tools for small vocabulary phonetic recognition
    • T. N. Sainath, B. Ramabhadran, and M. Picheny, "An exploration of large vocabulary tools for small vocabulary phonetic recognition," in Proc. IEEE ASRU Workshop, 2009.
    • (2009) Proc IEEE ASRU Workshop
    • Sainath, T.N.1    Ramabhadran, B.2    Picheny, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.