메뉴 건너뛰기




Volumn 25, Issue 2, 2011, Pages 462-479

Sparse imputation for large vocabulary noise robust ASR

Author keywords

Automatic speech recognition; Missing data techniques; Noise robustness; Sparse imputation

Indexed keywords

A-FRAMES; AUTOMATIC SPEECH RECOGNITION; CLEAN SPEECH; CONNECTED DIGITS; ERROR PRONES; FEATURE RELIABILITY; IMPUTATION TECHNIQUES; LARGE VOCABULARY; MISSING DATA TECHNIQUES; NOISE CONDITIONS; NOISE ROBUSTNESS; NOISY SPEECH; PARAMETRIC MODELS; ROBUST ASR; SPARSE IMPUTATION; TIME FRAME;

EID: 78049527664     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2010.06.004     Document Type: Article
Times cited : (26)

References (52)
  • 1
    • 85009063707 scopus 로고    scopus 로고
    • Soft decisions in missing data techniques for robust automatic speech recognition
    • Beijing, China
    • J. Barker, L. Josifovski, M. Cooke, and P. Green Soft decisions in missing data techniques for robust automatic speech recognition Proc. ICSLP Beijing, China 2000 373 376
    • (2000) Proc. ICSLP , pp. 373-376
    • Barker, J.1    Josifovski, L.2    Cooke, M.3    Green, P.4
  • 2
    • 85009106519 scopus 로고    scopus 로고
    • Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
    • Aalborg, Denmark
    • J. Barker, M. Cooke, and P. Green Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise Proc. EUROSPEECH Aalborg, Denmark 2001 213 216
    • (2001) Proc. EUROSPEECH , pp. 213-216
    • Barker, J.1    Cooke, M.2    Green, P.3
  • 3
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. Barker, M. Cooke, and D. Ellis Decoding speech in the presence of other sources Speech Communication 45 1 2005 5 25
    • (2005) Speech Communication , vol.45 , Issue.1 , pp. 5-25
    • Barker, J.1    Cooke, M.2    Ellis, D.3
  • 5
    • 33847629729 scopus 로고    scopus 로고
    • On noise masking for automatic missing data speech recognition: A survey and discussion
    • C. Cerisara, S. Demange, and J.-P. Haton On noise masking for automatic missing data speech recognition: a survey and discussion Computer Speech and Language 21 3 2007 443 457
    • (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 443-457
    • Cerisara, C.1    Demange, S.2    Haton, J.-P.3
  • 6
    • 84869001637 scopus 로고
    • Handling missing data in speech recognition
    • Yokohama, Japan
    • M. Cooke, P. Green, and M. Crawford Handling missing data in speech recognition Proc. ICSLP Yokohama, Japan 1994 1555 1558
    • (1994) Proc. ICSLP , pp. 1555-1558
    • Cooke, M.1    Green, P.2    Crawford, M.3
  • 7
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovksi, and A. Vizinho Robust automatic speech recognition with missing and unreliable acoustic data Speech Communication 34 3 2001 267 285
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovksi, L.3    Vizinho, A.4
  • 8
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • M. Cooke A glimpsing model of speech perception in noise J. Acoust. Soc. Am. 119 3 2006 1562 1573
    • (2006) J. Acoust. Soc. Am. , vol.119 , Issue.3 , pp. 1562-1573
    • Cooke, M.1
  • 10
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S.B. Davis, and P. Mermelstein Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Transactions on Acoustics, Speech, and Signal Processing 28 4 1980 357 366
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 12
    • 33646365077 scopus 로고    scopus 로고
    • For most large underdetermined systems of linear equations the minimal L1-norm solution is also the sparsest solution
    • D.L. Donoho For most large underdetermined systems of linear equations the minimal L1-norm solution is also the sparsest solution Communications on Pure and Applied Mathematics 59 6 2006 797 829
    • (2006) Communications on Pure and Applied Mathematics , vol.59 , Issue.6 , pp. 797-829
    • Donoho, D.L.1
  • 13
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • M. Gales Semi-tied covariance matrices for hidden Markov models IEEE Transactions on Speech and Audio Processing 7 3 1999 272 281
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.1
  • 14
    • 70349196731 scopus 로고    scopus 로고
    • Using sparse representations for missing data imputation in noise robust speech recognition
    • Lausanne, Switzerland
    • J. Gemmeke, and B. Cranen Using sparse representations for missing data imputation in noise robust speech recognition Proc. EUSIPCO Lausanne, Switzerland 2008
    • (2008) Proc. EUSIPCO
    • Gemmeke, J.1    Cranen, B.2
  • 15
    • 70449584968 scopus 로고    scopus 로고
    • Missing data imputation using compressive sensing techniques for connected digit recognition
    • Santorini, Greece
    • J. Gemmeke, and B. Cranen Missing data imputation using compressive sensing techniques for connected digit recognition Proc. DSP Santorini, Greece 2009 1 8
    • (2009) Proc. DSP , pp. 1-8
    • Gemmeke, J.1    Cranen, B.2
  • 16
    • 70349227603 scopus 로고    scopus 로고
    • Sparse imputation for noise robust speech recognition using soft masks
    • Taipei, Taiwan
    • J. Gemmeke, and B. Cranen Sparse imputation for noise robust speech recognition using soft masks Proc. ICASSP Taipei, Taiwan 2009 4645 4648
    • (2009) Proc. ICASSP , pp. 4645-4648
    • Gemmeke, J.1    Cranen, B.2
  • 17
    • 70450163899 scopus 로고    scopus 로고
    • Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases
    • Brighton, UK
    • J.F. Gemmeke, Y. Wang, M. Van Segbroeck, B. Cranen, and H. Van hamme Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases Proc. INTERSPEECH Brighton, UK 2009
    • (2009) Proc. INTERSPEECH
    • Gemmeke, J.F.1    Wang, Y.2    Van Segbroeck, M.3    Cranen, B.4    Van Hamme, H.5
  • 19
    • 79959814198 scopus 로고    scopus 로고
    • Observation uncertainty measures for sparse imputation
    • Gemmeke, J.F., Remes, U., Palomäki, K.J., 2010. Observation uncertainty measures for sparse imputation. In: Accepted to INTERSPEECH.
    • (2010) Accepted to INTERSPEECH
    • Gemmeke, J.F.1
  • 21
    • 0038669544 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France
    • H. Hirsch, and D. Pearce The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions Proc. ISCA Tutorial and Research Workshop ASR2000 Paris, France 2000 181 188
    • (2000) Proc. ISCA Tutorial and Research Workshop ASR2000 , pp. 181-188
    • Hirsch, H.1    Pearce, D.2
  • 23
    • 84910032186 scopus 로고    scopus 로고
    • SPEECON - Speech databases for consumer devices: Database specification and validation
    • Las Palmas, Canary Islands, Spain
    • D. Iskra, B. Grosskopf, K. Marasek, H.V.D. Heuvel, F. Diehl, and A. Kiessling SPEECON - speech databases for consumer devices: database specification and validation Proc. LREC Las Palmas, Canary Islands, Spain 2002 329 333
    • (2002) Proc. LREC , pp. 329-333
    • Iskra, D.1    Grosskopf, B.2    Marasek, K.3    Heuvel, H.V.D.4    Diehl, F.5    Kiessling, A.6
  • 24
    • 0037841203 scopus 로고    scopus 로고
    • State based imputation of missing data for robust speech recognition and speech enhancement
    • Budapest, Hungary
    • L. Josifovski, M. Cooke, P. Green, and A. Vizinho State based imputation of missing data for robust speech recognition and speech enhancement Proc. EUROSPEECH Budapest, Hungary 1999 2837 2840
    • (1999) Proc. EUROSPEECH , pp. 2837-2840
    • Josifovski, L.1    Cooke, M.2    Green, P.3    Vizinho, A.4
  • 26
    • 33947703708 scopus 로고    scopus 로고
    • Band-independent mask estimation for missing-feature reconstruction in the presence of unknown background noise
    • Toulouse, France
    • W. Kim, and R.M. Stern Band-independent mask estimation for missing-feature reconstruction in the presence of unknown background noise Proc. ICASSP Toulouse, France 2006 305 308
    • (2006) Proc. ICASSP , pp. 305-308
    • Kim, W.1    Stern, R.M.2
  • 28
    • 40249103761 scopus 로고    scopus 로고
    • Issues with uncertainty decoding for noise robust automatic speech recognition
    • H. Liao, and M.J.F. Gales Issues with uncertainty decoding for noise robust automatic speech recognition Speech Communication 50 4 2008 265 277
    • (2008) Speech Communication , vol.50 , Issue.4 , pp. 265-277
    • Liao, H.1    Gales, M.J.F.2
  • 29
    • 34748817500 scopus 로고    scopus 로고
    • Exploiting correlogram structure for robust speech recognition with multiple speech sources
    • N. Ma, P. Green, J. Barker, and A. Coy Exploiting correlogram structure for robust speech recognition with multiple speech sources Speech Communication 49 12 2007 874 891
    • (2007) Speech Communication , vol.49 , Issue.12 , pp. 874-891
    • Ma, N.1    Green, P.2    Barker, J.3    Coy, A.4
  • 32
    • 33745202179 scopus 로고    scopus 로고
    • An efficient one-pass decoder for Finnish large vocabulary continuous speech recognition
    • Tallinn, Estonia
    • J. Pylkkönen An efficient one-pass decoder for Finnish large vocabulary continuous speech recognition Proc. 2nd Baltic Conference on Human Language Technologies Tallinn, Estonia 2005 167 172
    • (2005) Proc. 2nd Baltic Conference on Human Language Technologies , pp. 167-172
    • Pylkkönen, J.1
  • 33
    • 85009089669 scopus 로고    scopus 로고
    • Duration modeling techniques for continuous speech recognition
    • Jeju Island, Korea
    • J. Pylkkönen, and M. Kurimo Duration modeling techniques for continuous speech recognition Proc. INTERSPEECH Jeju Island, Korea 2004 385 388
    • (2004) Proc. INTERSPEECH , pp. 385-388
    • Pylkkönen, J.1    Kurimo, M.2
  • 34
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • B. Raj, and R.M. Stern Missing-feature approaches in speech recognition IEEE Signal Processing Magazine 22 5 2005 101 116
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.M.2
  • 35
    • 0001774275 scopus 로고    scopus 로고
    • Inference of missing spectrographic features for robust automatic speech recognition
    • Sydney, Australia
    • B. Raj, R. Singh, and R. Stern Inference of missing spectrographic features for robust automatic speech recognition Proc. ICSLP Sydney, Australia 1998 1491 1494
    • (1998) Proc. ICSLP , pp. 1491-1494
    • Raj, B.1    Singh, R.2    Stern, R.3
  • 36
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • B. Raj, M. Seltzer, and R. Stern Reconstruction of missing features for robust speech recognition Speech Communication 43 4 2004 275 296
    • (2004) Speech Communication , vol.43 , Issue.4 , pp. 275-296
    • Raj, B.1    Seltzer, M.2    Stern, R.3
  • 38
    • 70450147392 scopus 로고    scopus 로고
    • Missing feature reconstruction and acoustic model adaptation combined for large vocabulary continuous speech recognition
    • Lausanne, Switzerland
    • U. Remes, K.J. Palomäki, and M. Kurimo Missing feature reconstruction and acoustic model adaptation combined for large vocabulary continuous speech recognition Proc. EUSIPCO Lausanne, Switzerland 2008
    • (2008) Proc. EUSIPCO
    • Remes, U.1    Palomäki, K.J.2    Kurimo, M.3
  • 40
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
    • M. Seltzer, B. Raj, and R. Stern A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition Speech Communication 43 4 2004 379 393
    • (2004) Speech Communication , vol.43 , Issue.4 , pp. 379-393
    • Seltzer, M.1    Raj, B.2    Stern, R.3
  • 41
    • 33745217822 scopus 로고    scopus 로고
    • Growing an n-gram language model
    • Lisbon, Portugal
    • V. Siivola, and B. Pellom Growing an n-gram language model Proc. INTERSPEECH Lisbon, Portugal 2005 1309 1312
    • (2005) Proc. INTERSPEECH , pp. 1309-1312
    • Siivola, V.1    Pellom, B.2
  • 42
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • S. Srinivasan, N. Roman, and D. Wang Binary and ratio time-frequency masks for robust speech recognition Speech Communication 48 11 2006 1486 1501
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.3
  • 44
    • 42549131394 scopus 로고    scopus 로고
    • Exploiting temporal correlation of speech for error-robust and bandwidth-flexible distributed speech recognition
    • Z.-H. Tan, P. Dalsgaard, and B. Lindberg Exploiting temporal correlation of speech for error-robust and bandwidth-flexible distributed speech recognition IEEE Transactions on Audio, Speech and Language Processing 15 4 2007 1391 1403
    • (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.4 , pp. 1391-1403
    • Tan, Z.-H.1    Dalsgaard, P.2    Lindberg, B.3
  • 45
    • 78049530897 scopus 로고    scopus 로고
    • CSC Tieteellinen laskenta Oy
    • CSC Tieteellinen laskenta Oy, The language bank of Finland, 2001. www.csc.fi/languagebank/.
    • (2001) The Language Bank of Finland
  • 46
    • 4544315110 scopus 로고    scopus 로고
    • Robust speech recognition using cepstral domain missing data techniques and noisy masks
    • Montreal, Quebec, Canada
    • H. Van hamme Robust speech recognition using cepstral domain missing data techniques and noisy masks Proc. ICASSP Montreal, Quebec, Canada 2004 213 216
    • (2004) Proc. ICASSP , pp. 213-216
    • Van Hamme, H.1
  • 47
    • 85009128803 scopus 로고    scopus 로고
    • PROSPECT features and their application to missing data techniques for robust speech recognition
    • Jeju Island, Korea
    • H. Van hamme PROSPECT features and their application to missing data techniques for robust speech recognition Proc. INTERSPEECH Jeju Island, Korea 2004 101 104
    • (2004) Proc. INTERSPEECH , pp. 101-104
    • Van Hamme, H.1
  • 48
    • 70450167189 scopus 로고    scopus 로고
    • Vector-Quantization based mask estimation for missing data automatic speech recognition
    • Antwerp, Belgium
    • M. Van Segbroeck, and H. Van hamme Vector-Quantization based mask estimation for missing data automatic speech recognition Proc. INTERSPEECH Antwerp, Belgium 2007 910 913
    • (2007) Proc. INTERSPEECH , pp. 910-913
    • Van Segbroeck, M.1    Van Hamme, H.2
  • 49
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga, and H. Steeneken Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems Speech Communication 12 3 1993 247 251
    • (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-251
    • Varga, A.1    Steeneken, H.2
  • 50
    • 0002603206 scopus 로고    scopus 로고
    • Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: An integrated study
    • Budapest, Hungary
    • A. Vizinho, P. Green, M. Cooke, and L. Josifovski Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study Proc. EUROSPEECH Budapest, Hungary 1999 2407 2410
    • (1999) Proc. EUROSPEECH , pp. 2407-2410
    • Vizinho, A.1    Green, P.2    Cooke, M.3    Josifovski, L.4
  • 51
    • 48149090146 scopus 로고    scopus 로고
    • Estimating single-channel source separation masks: Relevance vector machine classifiers vs. pitch-based masking
    • Pittsburgh, Pennsylvania, USA
    • R. Weiss, and D. Ellis Estimating single-channel source separation masks: relevance vector machine classifiers vs. pitch-based masking Proc. Workshop on Statistical and Perceptual Audition SAPA-06 Pittsburgh, Pennsylvania, USA 2006 31 36
    • (2006) Proc. Workshop on Statistical and Perceptual Audition SAPA-06 , pp. 31-36
    • Weiss, R.1    Ellis, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.