메뉴 건너뛰기




Volumn 19, Issue 7, 2011, Pages 2067-2080

Exemplar-based sparse representations for noise robust automatic speech recognition

Author keywords

Exemplar based; noise robustness; non negative matrix factorization; sparse representations; speech recognition

Indexed keywords

CONNECTED DIGITS; EXEMPLAR-BASED; FEATURE ENHANCEMENT; LINEAR COMBINATIONS; MISSING DATA; NOISE ROBUSTNESS; NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION; NOISY SPEECH; NONNEGATIVE MATRIX FACTORIZATION; PHONETIC INFORMATION; SIGNAL TO NOISE; SOURCE SEPARATION; SPARSE REPRESENTATION; TIME FRAME; TIME FREQUENCY;

EID: 79960657803     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2112350     Document Type: Article
Times cited : (298)

References (47)
  • 1
    • 0030142722 scopus 로고    scopus 로고
    • Towards increasing speech recognition error rates
    • DOI 10.1016/0167-6393(96)00003-9, PII S0167639396000039
    • H. Bourlard, H. Hermansky, and N. Morgan, "Towards increasing speech recognition error rates," Speech Commun., vol. 18, pp. 205-231, 1996. (Pubitemid 126362800)
    • (1996) Speech Communication , vol.18 , Issue.3 , pp. 205-231
    • Bourlard, H.1    Hermansky, H.2    Morgan, N.3
  • 2
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • Atlanta, GA
    • P. Moreno, B. Raj, and R. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. Int. Conf. Audio, Speech, Signal Process., Atlanta, GA, 1996, pp. 733-736.
    • (1996) Proc. Int. Conf. Audio, Speech, Signal Process. , pp. 733-736
    • Moreno, P.1    Raj, B.2    Stern, R.3
  • 3
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • PII S1063667696067120
    • M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, Sep. 1996. (Pubitemid 126753023)
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 4
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • DOI 10.1109/MSP.2005.1511828
    • B. Raj and R. M. Stern, "Missing-feature approaches in speech recognition," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 101-116, Sep. 2005. (Pubitemid 41488524)
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.M.2
  • 6
    • 0030635327 scopus 로고    scopus 로고
    • Application of sequential estimation to time-varying environment compensation in speech recognition
    • N. S. Kim, D. K. Kim, and S. R. Kim, "Application of sequential estimation to time-varying environment compensation in speech recognition," in IEEE Workshop Autom. Speech Recognition Understanding, 1997, pp. 389-395.
    • (1997) IEEE Workshop Autom. Speech Recognition Understanding , pp. 389-395
    • Kim, N.S.1    Kim, D.K.2    Kim, S.R.3
  • 7
    • 85009074657 scopus 로고    scopus 로고
    • ALGONQUIN: Iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition
    • B. J. Frey, L. Deng, A. Acero, and T. Kristjansson, "ALGONQUIN: Iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition," in Proc. Eurospeech, 2001, pp. 901-904.
    • (2001) Proc. Eurospeech , pp. 901-904
    • Frey, B.J.1    Deng, L.2    Acero, A.3    Kristjansson, T.4
  • 8
    • 84898993440 scopus 로고    scopus 로고
    • Sequential noise compensation by sequential Monte Carlo method
    • K. Yao and S. Nakamura, "Sequential noise compensation by sequential Monte Carlo method," in Proc. Neural Inf. Process. Syst., 2002, pp. 1205-1212.
    • (2002) Proc. Neural Inf. Process. Syst. , pp. 1205-1212
    • Yao, K.1    Nakamura, S.2
  • 9
    • 84898964201 scopus 로고    scopus 로고
    • Algorithms for non-negative matrix factorization
    • Apr
    • D. D. Lee and H. S. Seung, "Algorithms for non-negative matrix factorization," in Proc. Neural Inf. Process. Syst., Apr. 2001, pp. 556-562.
    • (2001) Proc. Neural Inf. Process. Syst. , pp. 556-562
    • Lee, D.D.1    Seung, H.S.2
  • 10
    • 85032750937 scopus 로고    scopus 로고
    • An introduction to compressive sampling
    • Mar
    • E. J. Candés and M. B. Wakin, "An introduction to compressive sampling," IEEE Signal Process. Mag., vol. 25, no. 2, pp. 21-30, Mar. 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.2 , pp. 21-30
    • Candés, E.J.1    Wakin, M.B.2
  • 12
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
    • Mar
    • T. Virtanen, "Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 14
    • 67149096066 scopus 로고    scopus 로고
    • Mixtures of gamma priors for nonnegative matrix factorization based speech separation
    • T. Virtanen and A. T. Cemgil, "Mixtures of gamma priors for nonnegative matrix factorization based speech separation," in Proc. ICA, 2009, pp. 646-653.
    • (2009) Proc. ICA , pp. 646-653
    • Virtanen, T.1    Cemgil, A.T.2
  • 15
    • 79959818117 scopus 로고    scopus 로고
    • Non-negative matrix factorization based compensation of music for automatic speech recognition
    • B. Raj, T. Virtanen, S. Chaudhure, and R. Singh, "Non-negative matrix factorization based compensation of music for automatic speech recognition," in Proc. Int. Conf. Speech, Lang. Process., 2010, pp. 717-720.
    • Proc. Int. Conf. Speech, Lang. Process. , vol.2010 , pp. 717-720
    • Raj, B.1    Virtanen, T.2    Chaudhure, S.3    Singh, R.4
  • 18
    • 84863733079 scopus 로고    scopus 로고
    • Using sparse representations for exemplar based continuous digit recognition
    • Glasgow, Scotland, Aug. 24-28
    • J. F. Gemmeke, L. ten Bosch, L. Boves, and B. Cranen, "Using sparse representations for exemplar based continuous digit recognition," in Proc. EUSIPCO, Glasgow, Scotland, Aug. 24-28, 2009, pp. 1755-1759.
    • (2009) Proc. EUSIPCO , pp. 1755-1759
    • Gemmeke, J.F.1    Ten Bosch, L.2    Boves, L.3    Cranen, B.4
  • 22
    • 77949695902 scopus 로고    scopus 로고
    • Compressive sensing for missing data imputation in noise robust speech recognition
    • Apr.
    • J. F. Gemmeke, H. Van Hamme, B. Cranen, and L. Boves, "Compressive sensing for missing data imputation in noise robust speech recognition," IEEE J. Sel. Topics Signal Process., vol. 4, no. 2, pp. 272-287, Apr. 2010.
    • (2010) IEEE J. Sel. Topics Signal Process. , vol.4 , Issue.2 , pp. 272-287
    • Gemmeke, J.F.1    Van Hamme, H.2    Cranen, B.3    Boves, L.4
  • 24
    • 70450188400 scopus 로고    scopus 로고
    • Applying non-negative matrix factorization on time-frequency reassignment spectra for missing data mask estimation
    • Brighton, U.K., Sep. 6-10
    • M. Van Segbroeck and H. Van Hamme, "Applying non-negative matrix factorization on time-frequency reassignment spectra for missing data mask estimation," in Proc. Interspeech, Brighton, U.K., Sep. 6-10, 2009, pp. 2511-2514.
    • (2009) Proc. Interspeech , pp. 2511-2514
    • Van Segbroeck, M.1    Van Hamme, H.2
  • 25
    • 4544315110 scopus 로고    scopus 로고
    • Robust speech recognition using cepstral domain missing data techniques and noisy masks
    • H. Van Hamme, "Robust speech recognition using cepstral domain missing data techniques and noisy masks," in Proc. Int. Conf. Audio, Speech, Signal Process., 2004, vol. 1, pp. 213-216.
    • (2004) Proc. Int. Conf. Audio, Speech, Signal Process. , vol.1 , pp. 213-216
    • Van Hamme, H.1
  • 27
    • 79959837544 scopus 로고    scopus 로고
    • State-based labeling for a sparse representation of speech and its application to robust speech recognition
    • T. Virtanen, J. F. Gemmeke, and A. Hurmalainen, "State-based labeling for a sparse representation of speech and its application to robust speech recognition," in Proc. Interspeech, 2010, pp. 893-896.
    • Proc. Interspeech , vol.2010 , pp. 893-896
    • Virtanen, T.1    Gemmeke, J.F.2    Hurmalainen, A.3
  • 28
    • 84858719009 scopus 로고    scopus 로고
    • A sparse non-parametric approach for single channel separation of known sounds
    • P. Smaragdis, M. Shashanka, and B. Raj, "A sparse non-parametric approach for single channel separation of known sounds," in Proc. Neural Inf. Process. Syst., 2009, pp. 1705-1713.
    • (2009) Proc. Neural Inf. Process. Syst. , pp. 1705-1713
    • Smaragdis, P.1    Shashanka, M.2    Raj, B.3
  • 30
    • 85128375707 scopus 로고    scopus 로고
    • Inference of missing spectrographic features for robust automatic speech recognition
    • Sydney, Australia, Nov. 4
    • B. Raj, R. Singh, and R. Stern, "Inference of missing spectrographic features for robust automatic speech recognition," in Proc. Int. Conf. Speech Lang. Process., Sydney, Australia, Nov. 4, 1998, pp. 1491-1494.
    • (1998) Proc. Int. Conf. Speech Lang. Process. , pp. 1491-1494
    • Raj, B.1    Singh, R.2    Stern, R.3
  • 31
    • 85009128803 scopus 로고    scopus 로고
    • Prospect features and their application to missing data techniques for robust speech recognition
    • H.Van Hamme, "Prospect features and their application to missing data techniques for robust speech recognition," in Proc. Interspeech, 2004, pp. 101-104.
    • (2004) Proc. Interspeech , pp. 101-104
    • Van Hamme, H.1
  • 33
    • 0038669544 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France, Sep. 18-20
    • H. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA Tutorial Research Workshop ASR2000, Paris, France, Sep. 18-20, 2000, pp. 181-188.
    • (2000) Proc. ISCA Tutorial Research Workshop ASR2000 , pp. 181-188
    • Hirsch, H.1    Pearce, D.2
  • 34
    • 33947622695 scopus 로고    scopus 로고
    • Handling time-derivative features in a missing data framework for robust automatic speech recognition
    • H. Van Hamme, "Handling time-derivative features in a missing data framework for robust automatic speech recognition," in Proc. Int. Conf. Audio, Speech, Signal Process., 2006, pp. 293-296.
    • (2006) Proc. Int. Conf. Audio, Speech, Signal Process. , pp. 293-296
    • Van Hamme, H.1
  • 35
    • 79959834868 scopus 로고    scopus 로고
    • Artificial and online acquired noise dictionaries for noise robust ASR
    • J. F. Gemmeke and T. Virtanen, "Artificial and online acquired noise dictionaries for noise robust ASR," in Proc. Interspeech, 2010, pp. 2082-2085.
    • Proc. Interspeech , vol.2010 , pp. 2082-2085
    • Gemmeke, J.F.1    Virtanen, T.2
  • 36
    • 85009113852 scopus 로고    scopus 로고
    • HMM adaptation using vector Taylor series for noise speech recognition
    • Beijing, China
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noise speech recognition," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China, 2000, pp. 869-872.
    • (2000) Proc. Int. Conf. Spoken Lang. Process. , pp. 869-872
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 37
    • 70450179002 scopus 로고    scopus 로고
    • Transforming features to compensate speech recognizer models for noise
    • Brighton, U.K., Sep. 6-10
    • R. C. Van Dalen, F. Flego, and M. J. F. Gales, "Transforming features to compensate speech recognizer models for noise," in Proc. Interspeech, Brighton, U.K., Sep. 6-10, 2009, pp. 2499-2502.
    • (2009) Proc. Interspeech , pp. 2499-2502
    • Van Dalen, R.C.1    Flego, F.2    Gales, M.J.F.3
  • 38
    • 79959825120 scopus 로고    scopus 로고
    • Using a DBN to integrate sparse classification and GMM-based ASR
    • DOI:10.1016/j.csl.2010.06. 004
    • Y. Sun, J. F. Gemmeke, B. Cranen, L. ten Bosch, and L. Boves, "Using a DBN to integrate sparse classification and GMM-based ASR," in Proc. Interspeech, 2010, pp. 2098-2101, DOI:10.1016/j.csl.2010.06. 004.
    • Proc. Interspeech , vol.2010 , pp. 2098-2101
    • Sun, Y.1    Gemmeke, J.F.2    Cranen, B.3    Ten Bosch, L.4    Boves, L.5
  • 39
    • 78049527664 scopus 로고    scopus 로고
    • Sparse imputation for large vocabulary noise robust ASR
    • J. F. Gemmeke, B. Cranen, and U. Remes, "Sparse imputation for large vocabulary noise robust ASR," Comput. Speech Lang., pp. 462-479, 2010.
    • (2010) Comput. Speech Lang. , pp. 462-479
    • Gemmeke, J.F.1    Cranen, B.2    Remes, U.3
  • 40
    • 78049409668 scopus 로고    scopus 로고
    • Fast GPU implementation of large scale dictionary and sparse representation based vision problems
    • P. Nagesh, R. Gowda, and B. Li, "Fast GPU implementation of large scale dictionary and sparse representation based vision problems," in Proc. Int. Conf. Audio, Speech, Signal Process., 2010, pp. 1570-1573.
    • Proc. Int. Conf. Audio, Speech, Signal Process. , vol.2010 , pp. 1570-1573
    • Nagesh, P.1    Gowda, R.2    Li, B.3
  • 41
    • 67651030071 scopus 로고    scopus 로고
    • Unsupervised learning of timefrequency patches as a noise-robust representation of speech
    • M. Van Segboeck and H. Van Hamme, "Unsupervised learning of timefrequency patches as a noise-robust representation of speech," Speech Commun., vol. 51, no. 11, 2009.
    • (2009) Speech Commun. , vol.51 , Issue.11
    • Van Segboeck, M.1    Van Hamme, H.2
  • 44
    • 33750383209 scopus 로고    scopus 로고
    • K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
    • DOI 10.1109/TSP.2006.881199
    • M. Aharon, M. Elad, and A. M. Bruckstein, "The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representations," IEEE Trans. Signal Process., vol. 54, no. 11, pp. 4311-4322, Nov. 2006. (Pubitemid 44637761)
    • (2006) IEEE Transactions on Signal Processing , vol.54 , Issue.11 , pp. 4311-4322
    • Aharon, M.1    Elad, M.2    Bruckstein, A.3
  • 46
    • 38049021850 scopus 로고    scopus 로고
    • Convolutive speech bases and their application to supervised speech separation
    • Jan
    • P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 1-12, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 1-12
    • Smaragdis, P.1
  • 47
    • 36348966695 scopus 로고    scopus 로고
    • On the convergence of multiplicative update algorithms for nonnegative matrix factorization
    • DOI 10.1109/TNN.2007.895831
    • C.-J. Lin, "On the convergence of multiplicative update algorithms for nonnegative matrix factorization," IEEE Trans. Neural Netw., vol. 18, no. 6, pp. 1589-1596, Nov. 2007. (Pubitemid 350148414)
    • (2007) IEEE Transactions on Neural Networks , vol.18 , Issue.6 , pp. 1589-1596
    • Lin, C.-J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.