메뉴 건너뛰기




Volumn 19, Issue 3, 2011, Pages 468-481

Data balancing for efficient training of hybrid ANN/HMM automatic speech recognition systems

Author keywords

Active learning; additive noise; ANN HMM; artificial neural networks (ANNs); hidden Markov models (HMMs); hybrid automatic speech recognition (ASR); machine learning; MLP HMM; multilayer perceptrons (MLPs); robust ASR

Indexed keywords

ACTIVE LEARNING; ANN/HMM; ARTIFICIAL NEURAL NETWORKS; AUTOMATIC SPEECH RECOGNITION; MACHINE-LEARNING; MLP/HMM; MULTILAYER PERCEPTRONS (MLPS); ROBUST ASR;

EID: 78149247542     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2050513     Document Type: Article
Times cited : (23)

References (61)
  • 1
    • 0035101535 scopus 로고    scopus 로고
    • A survey of hybrid ANN/HMM models for automatic speech recognition
    • April
    • E. Trentin and M. Gori, "A survey of hybrid ANN/HMM models for automatic speech recognition", Neurocomputing, vol. 37, no. 1-4, pp. 91-126, April 2001.
    • (2001) Neurocomputing , vol.37 , Issue.1-4 , pp. 91-126
    • Trentin, E.1    Gori, M.2
  • 2
    • 84857498367 scopus 로고    scopus 로고
    • Scaling up: Learning large-scale recognition methods from small-scale recognition tasks
    • Maui, HI
    • N. Morgan, B. Chen, Q. Zhu, and A. Stolcke, "Scaling up: Learning large-scale recognition methods from small-scale recognition tasks", in Proc. Special Workshop in Maui (SWIM), Maui, HI, 2004.
    • (2004) Proc. Special Workshop in Maui (SWIM)
    • Morgan, N.1    Chen, B.2    Zhu, Q.3    Stolcke, A.4
  • 9
    • 50549099450 scopus 로고    scopus 로고
    • Guest editorial: Special issue on utility-based data mining
    • Oct
    • G. Weiss, B. Zadrozny, and M. Saar-Tsechansky, "Guest editorial: Special issue on utility-based data mining", Data Mining Knowl. Discov., vol. 17, no. 2, pp. 129-135, Oct. 2008.
    • (2008) Data Mining Knowl. Discov. , vol.17 , Issue.2 , pp. 129-135
    • Weiss, G.1    Zadrozny, B.2    Saar-Tsechansky, M.3
  • 13
    • 33745216087 scopus 로고    scopus 로고
    • Selective sampling of training data for speech recognition
    • San Diego, CA, March, Morgan Kaufmann Publisher
    • T. Kamm and G. Meyer, "Selective sampling of training data for speech recognition", in Proc. 2nd Int. Conf. Human Lang. Technol. Res., San Diego, CA, March 2002, pp. 20-24, Morgan Kaufmann Publisher.
    • (2002) Proc. 2nd Int. Conf. Human Lang. Technol. Res. , pp. 20-24
    • Kamm, T.1    Meyer, G.2
  • 14
    • 85055304870 scopus 로고    scopus 로고
    • Ph. D. dissertation, John Hopkins Univ., Baltimore, MD, Jan, Online. Available
    • T. Kamm, "Active learning for acoustic speech recognition modeling" Ph. D. dissertation, John Hopkins Univ., Baltimore, MD, Jan. 2004 [Online]. Available: http://www.clsp.jhu.edu/people/tkamm/papers/kamm-thesis. pdf
    • (2004) Active Learning for Acoustic Speech Recognition modeling
    • Kamm, T.1
  • 18
    • 11144222882 scopus 로고    scopus 로고
    • Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system
    • Jan
    • P. Pujol, S. Pol, C. Nadeu, A. Hagen, and H. Bourlard, "Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system", IEEE Trans. Speech Audio Process., vol. 13, no. 1, pp. 14-22, Jan. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.1 , pp. 14-22
    • Pujol, P.1    Pol, S.2    Nadeu, C.3    Hagen, A.4    Bourlard, H.5
  • 22
    • 0029306621 scopus 로고
    • Continuous speech recognition: An introduction to the hybrid HMM/connectionist approach
    • May
    • N. Morgan and H. Bourlard, "Continuous speech recognition: An introduction to the hybrid HMM/connectionist approach", IEEE Signal Process. Mag., vol. 12, no. 3, pp. 24-42, May 1995.
    • (1995) IEEE Signal Process. Mag. , vol.12 , Issue.3 , pp. 24-42
    • Morgan, N.1    Bourlard, H.2
  • 23
    • 33847686469 scopus 로고    scopus 로고
    • A segment-based interpretation of HMM/ANN hybrids
    • Jul
    • L. Tóth and A. Kocsor, "A segment-based interpretation of HMM/ANN hybrids", Comput. Speech Lang., vol. 21, no. 3, pp. 562-578, Jul. 2007.
    • (2007) Comput. Speech Lang. , vol.21 , Issue.3 , pp. 562-578
    • Tóth, L.1    Kocsor, A.2
  • 24
    • 34548012893 scopus 로고    scopus 로고
    • Linear hidden transformations for adaptation of hybrid ANN/HMM models
    • Oct
    • R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. D. Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models", Speech Commun., vol. 49, no. 10-11, pp. 827-835, Oct. 2007.
    • (2007) Speech Commun. , vol.49 , Issue.10-11 , pp. 827-835
    • Gemello, R.1    Mana, F.2    Scanzio, S.3    Laface, P.4    Mori, R.D.5
  • 25
    • 33745351889 scopus 로고    scopus 로고
    • Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition
    • Aug
    • J. Stadermann and G. Rigoll, "Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition", Speech Commun., vol. 48, no. 8, pp. 1037-1046, Aug. 2006.
    • (2006) Speech Commun. , vol.48 , Issue.8 , pp. 1037-1046
    • Stadermann, J.1    Rigoll, G.2
  • 27
    • 0030355964 scopus 로고    scopus 로고
    • An incremental speaker-adaptation technique for hybrid hmm-mlp recognizer
    • Philadelphia, PA
    • J. Neto, C. Martins, and L. Almeida, "An incremental speaker-adaptation technique for hybrid hmm-mlp recognizer", in Proc. 4th Int. Conf. Spoken Lang. Process. (ICSLP'96), Philadelphia, PA, 1996, pp. 1289-1292.
    • (1996) Proc. 4th Int. Conf. Spoken Lang. Process. (ICSLP'96) , pp. 1289-1292
    • Neto, J.1    Martins, C.2    Almeida, L.3
  • 30
    • 0028194709 scopus 로고
    • Connectionist probability estimators in HMM speech recognition
    • Jan
    • S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco, "Connectionist probability estimators in HMM speech recognition", Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, Jan. 1994.
    • (1994) Speech and Audio Processing , vol.2 , Issue.1 , pp. 161-174
    • Renals, S.1    Morgan, N.2    Bourlard, H.3    Cohen, M.4    Franco, H.5
  • 31
    • 0141697346 scopus 로고    scopus 로고
    • Ph. D. dissertation, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, Dec, Online. Available
    • A. Hagen, "Robust speech recognition based on multi-stream processing" Ph. D. dissertation, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, Dec. 2001 [Online]. Available: http://infoscience.epfl.ch/search.py?recid=32973
    • (2001) Robust Speech Recognition Based on Multi-stream processing
    • Hagen, A.1
  • 33
    • 51449107216 scopus 로고    scopus 로고
    • Improving spoken language understanding with information retrieval and active learning methods
    • Las Vegas, NV, Mar.-Apr
    • I. Jars and F. Panaget, "Improving spoken language understanding with information retrieval and active learning methods", in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08), Las Vegas, NV, Mar.-Apr. 2008, pp. 5001-5004.
    • (2008) IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08) , pp. 5001-5004
    • Jars, I.1    Panaget, F.2
  • 36
    • 78149254813 scopus 로고    scopus 로고
    • Speeding-up neural network training using sentence and frame selection
    • Jan, Online. Available
    • S. Scanzio, P. Laface, R. Gemello, and F. Mana, "Speeding-up neural network training using sentence and frame selection", in Proc. Interspeech, Jan. 2007, pp. 1725-1728 [Online]. Available: http://www.isca-speech.org/ archive/interspeech-2007/
    • (2007) Proc. Interspeech , pp. 1725-1728
    • Scanzio, S.1    Laface, P.2    Gemello, R.3    Mana, F.4
  • 37
    • 0033322116 scopus 로고    scopus 로고
    • Improving generalization ability through active learning
    • Feb
    • S. Vijayakumar and H. Ogawa, "Improving generalization ability through active learning", IEICE Trans. Inf. Syst., vol. E82-D, no. 2, pp. 480-487, Feb. 1999.
    • (1999) IEICE Trans. Inf. Syst. , vol.E82-D , Issue.2 , pp. 480-487
    • Vijayakumar, S.1    Ogawa, H.2
  • 38
    • 13544261390 scopus 로고    scopus 로고
    • Combining active and semi-supervised learning for spoken language understanding
    • DOI 10.1016/j.specom.2004.08.002, PII S0167639304000962
    • G. Tur, D. Hakkani-Tür, and R. Schapire, "Combining active and semi-supervised learning for spoken language understanding", Speech Commun., vol. 45, no. 2, pp. 171-186, Feb. 2005. (Pubitemid 40220192)
    • (2005) Speech Communication , vol.45 , Issue.2 , pp. 171-186
    • Tur, G.1    Hakkani-Tur, D.2    Schapire, R.E.3
  • 39
    • 22544470628 scopus 로고    scopus 로고
    • Active learning: Theory and applications to automatic speech recognition
    • Jul
    • G. Riccardi and D. Hakkani-Tür, "Active learning: Theory and applications to automatic speech recognition", IEEE Trans. Speech Audio Process., vol. 13, no. 4, pp. 504-511, Jul. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.4 , pp. 504-511
    • Riccardi, G.1    Hakkani-Tür, D.2
  • 40
    • 33947613132 scopus 로고    scopus 로고
    • A new data selection approach for semisupervised acoustic modeling
    • Toulouse, France, May
    • R. Zhang and A. Rudnicky, "A new data selection approach for semisupervised acoustic modeling", in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06), Toulouse, France, May 2006, vol. 1, pp. 421-424.
    • (2006) IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06) , vol.1 , pp. 421-424
    • Zhang, R.1    Rudnicky, A.2
  • 42
    • 27644444018 scopus 로고    scopus 로고
    • A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification
    • Sep
    • H. Jiang, F. Soong, and C.-H. Lee, "A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification", IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 945-955, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 945-955
    • Jiang, H.1    Soong, F.2    Lee, C.-H.3
  • 44
    • 27144549260 scopus 로고    scopus 로고
    • Editorial: Special issue on learning from imbalanced data sets
    • Jun
    • N. Chawla, N. Japkowicz, and A. Kotcz, "Editorial: Special issue on learning from imbalanced data sets", ACM SIGKDD Explorations Newslett., vol. 6, no. 1, pp. 1-6, Jun. 2004.
    • (2004) ACM SIGKDD Explorations Newslett. , vol.6 , Issue.1 , pp. 1-6
    • Chawla, N.1    Japkowicz, N.2    Kotcz, A.3
  • 46
    • 27144531570 scopus 로고    scopus 로고
    • A study of the behavior of several methods for balancing machine learning training data
    • Jun
    • G. Batista, R. Prati, and M. Monard, "A study of the behavior of several methods for balancing machine learning training data", ACM SIGKDD Explorations Newslett., vol. 6, no. 1, pp. 20-29, Jun. 2004.
    • (2004) ACM SIGKDD Explorations Newslett. , vol.6 , Issue.1 , pp. 20-29
    • Batista, G.1    Prati, R.2    Monard, M.3
  • 47
    • 27144540575 scopus 로고    scopus 로고
    • Class imbalances versus small disjuncts
    • Jun
    • T. Jo and N. Japkowicz, "Class imbalances versus small disjuncts", ACM SIGKDD Explorations Newslett., vol. 6, no. 1, pp. 40-49, Jun. 2004.
    • (2004) ACM SIGKDD Explorations Newslett. , vol.6 , Issue.1 , pp. 40-49
    • Jo, T.1    Japkowicz, N.2
  • 51
    • 50549101751 scopus 로고    scopus 로고
    • Automatically countering imbalance and its empirical relationship to cost
    • Oct
    • N. Chawla, D. Cieslak, L. Hall, and A. Joshi, "Automatically countering imbalance and its empirical relationship to cost", Data Mining Knowl. Discov., vol. 17, no. 2, pp. 225-252, Oct. 2008.
    • (2008) Data Mining Knowl. Discov. , vol.17 , Issue.2 , pp. 225-252
    • Chawla, N.1    Cieslak, D.2    Hall, L.3    Joshi, A.4
  • 52
    • 0001972236 scopus 로고    scopus 로고
    • Addressing the curse of imbalanced training sets: One-sided selection
    • Morgan Kaufmann
    • M. Kubat and S. Matwin, "Addressing the curse of imbalanced training sets: One-sided selection", in Proc. 14th Int. Conf. Mach. Learn., 1997, pp. 179-186, Morgan Kaufmann.
    • (1997) Proc. 14th Int. Conf. Mach. Learn. , pp. 179-186
    • Kubat, M.1    Matwin, S.2
  • 53
    • 0346586663 scopus 로고    scopus 로고
    • Smote: Synthetic minority over-sampling technique
    • Feb
    • N. Chawla, K. Bowyer, L. Hall, and W. Kegelmeyer, "Smote: Synthetic minority over-sampling technique", J. Artif. Intell. Res., vol. 16, pp. 321-357, Feb. 2002.
    • (2002) J. Artif. Intell. Res. , vol.16 , pp. 321-357
    • Chawla, N.1    Bowyer, K.2    Hall, L.3    Kegelmeyer, W.4
  • 54
  • 56
    • 1442356040 scopus 로고    scopus 로고
    • A multiple resampling method for learning from imbalanced data sets
    • Feb
    • A. Estabrooks, T. Jo, and N. Japkowicz, "A multiple resampling method for learning from imbalanced data sets", Comput. Intell., vol. 20, no. 1, pp. 18-36, Feb. 2004.
    • (2004) Comput. Intell. , vol.20 , Issue.1 , pp. 18-36
    • Estabrooks, A.1    Jo, T.2    Japkowicz, N.3
  • 57
    • 31344442851 scopus 로고    scopus 로고
    • Training cost-sensitive neural networks with methods addressing the class imbalance problem
    • Jan
    • Z.-H. Zhou and X.-Y. Liu, "Training cost-sensitive neural networks with methods addressing the class imbalance problem", IEEE Trans. Knowl. Data Eng., vol. 18, no. 1, pp. 63-77, Jan. 2006.
    • (2006) IEEE Trans. Knowl. Data Eng. , vol.18 , Issue.1 , pp. 63-77
    • Zhou, Z.-H.1    Liu, X.-Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.