메뉴 건너뛰기




Volumn 4, Issue 2, 2010, Pages 272-287

Compressive sensing for missing data imputation in noise robust speech recognition

Author keywords

Automatic speech recognition (ASR); Compressive sensing (CS); Missing data techniques; Noise robustness

Indexed keywords

A-FRAMES; AUTOMATIC SPEECH RECOGNITION; CLEAN SPEECH; CONVENTIONAL TECHNIQUES; ERROR PRONES; IMPUTATION TECHNIQUES; LINEAR COMBINATIONS; LOW SIGNAL-TO-NOISE RATIO; MISSING DATA; MISSING DATA TECHNIQUES; NOISE ROBUST SPEECH RECOGNITION; NOISE ROBUSTNESS; NOISY OBSERVATIONS; NOISY SPEECH; NON-PARAMETRIC; OVERCOMPLETE DICTIONARIES; PARAMETRIC MODELS; TIME FRAME; TIME WINDOWS;

EID: 77949695902     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2009.2039171     Document Type: Article
Times cited : (97)

References (55)
  • 1
    • 0000652102 scopus 로고
    • Some solutions to the missing feature problem in vision
    • S. Hanson, J. Cowan, and C. Giles, Eds. San Franciso, CA: Morgan Kaufmann
    • S. Ahmad and V. Tresp, "Some solutions to the missing feature problem in vision," in Advances in Neural Information Processing Systems 5, S. Hanson, J. Cowan, and C. Giles, Eds. San Franciso, CA: Morgan Kaufmann, 1993, pp. 393-400.
    • (1993) Advances in Neural Information Processing Systems , vol.5 , pp. 393-400
    • Ahmad, S.1    Tresp, V.2
  • 3
    • 85128375707 scopus 로고    scopus 로고
    • Inference of missing spectrographic features for robust automatic speech recognition
    • B. Raj, R. Singh, and R. Stern, "Inference of missing spectrographic features for robust automatic speech recognition," in Proc. Inf Conf. Spoken Lang. Process., 1998, pp. 1491-1494.
    • (1998) Proc. Inf Conf. Spoken Lang. Process. , pp. 1491-1494
    • Raj, B.1    Singh, R.2    Stern, R.3
  • 4
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovksi, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol.34, pp. 267-285, 2001.
    • (2001) Speech Commun , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovksi, L.3    Vizinho, A.4
  • 5
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • Sep.
    • B. Raj and R. M. Stern, "Missing-feature approaches in speech recognition," Signal Process. Mag. , vol.22, no.5, pp. 101-116, Sep. 2005.
    • (2005) Signal Process. Mag. , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.M.2
  • 7
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • B. Raj, M. Seltzer, and R. Stern, "Reconstruction of missing features for robust speech recognition," Speech Commun., vol.43, pp. 275-296, 2004.
    • (2004) Speech Commun , vol.43 , pp. 275-296
    • Raj, B.1    Seltzer, M.2    Stern, R.3
  • 8
    • 33645712892 scopus 로고    scopus 로고
    • Compressed sensing
    • Apr.
    • D. L. Donoho, "Compressed sensing," IEEE Trans. Inf. Theory, vol.52, no.4, pp. 1289-1306, Apr. 2006.
    • (2006) IEEE Trans. Inf. Theory , vol.52 , Issue.4 , pp. 1289-1306
    • Donoho, D.L.1
  • 9
    • 33745604236 scopus 로고    scopus 로고
    • Stable signal recovery from incomplete and inaccurate measurements
    • E. J. Candès, J. Romberg, and T. Tao, "Stable signal recovery from incomplete and inaccurate measurements," Commun. Pure Appl. Math., vol.59, no.8, pp. 1207-1223, 2006.
    • (2006) Commun. Pure Appl. Math. , vol.59 , Issue.8 , pp. 1207-1223
    • Candès, E.J.1    Romberg, J.2    Tao, T.3
  • 10
    • 67749091294 scopus 로고    scopus 로고
    • When is missing data recoverable?
    • Rice Univ., Houston, TX
    • Y. Zhang, When is missing data recoverable? Rice Univ., Houston, TX, CAAM Tech. Rep. TR06-15, 2006.
    • (2006) CAAM Tech. Rep. TR06-15
    • Zhang, Y.1
  • 12
    • 0038669544 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France
    • H. Hirsch and D. Pearce, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA ASR2000 Workshop, Paris, France, 2000, pp. 181-188.
    • (2000) Proc. ISCA ASR2000 Workshop , pp. 181-188
    • Hirsch, H.1    Pearce, D.2
  • 13
    • 4544315110 scopus 로고    scopus 로고
    • Robust speech recognition using cepstral domain missing data techniques and noisy masks
    • H. Van Hamme, "Robust speech recognition using cepstral domain missing data techniques and noisy masks," in Proc. IEEE ICASSP, 2004, vol.1, pp. 213-216.
    • (2004) Proc. IEEE ICASSP , vol.1 , pp. 213-216
    • Van Hamme, H.1
  • 15
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. Barker,M. Cooke, and D. Ellis, "Decoding speech in the presence of other sources," Speech Commun., vol.45, pp. 5-25, 2005.
    • (2005) Speech Commun , vol.45 , pp. 5-25
    • Barkerm. Cooke, J.1    Ellis, D.2
  • 17
    • 0002603206 scopus 로고    scopus 로고
    • Missing data theory, spectral subtraction and signal-to-noise estimation for robust asr: An integrated study
    • A. Vizinho, P. Green, M. Cooke, and L. Josifovski, "Missing data theory, spectral subtraction and signal-to-noise estimation for robust asr: An integrated study," in Proc. Interspeech'99, 1999, pp. 2407-2410.
    • (1999) Proc. Interspeech'99 , pp. 2407-2410
    • Vizinho, A.1    Green, P.2    Cooke, M.3    Josifovski, L.4
  • 18
    • 4644317224 scopus 로고    scopus 로고
    • A bayesian classifier for spectrographic mask estimation for missing feature speech recognition
    • M. Seltzer, B. Raj, and R. Stern, "A bayesian classifier for spectrographic mask estimation for missing feature speech recognition," Speech Commun., vol.43, pp. 379-393, 2004.
    • (2004) Speech Commun , vol.43 , pp. 379-393
    • Seltzer, M.1    Raj, B.2    Stern, R.3
  • 19
    • 33947703708 scopus 로고    scopus 로고
    • Band-independent mask estimation for missing-feature reconstruction in the presence of unknown background noise
    • W. Kim and R. M. Stern, "Band-independent mask estimation for missing-feature reconstruction in the presence of unknown background noise," in Proc. IEEE ICASSP, 2006, pp. 305-308.
    • (2006) Proc. IEEE ICASSP , pp. 305-308
    • Kim, W.1    Stern, R.M.2
  • 20
    • 33744971131 scopus 로고    scopus 로고
    • Mask estimation for missing data speech recognition based on statistics of binaural interaction
    • Jan.
    • S. Harding, J. Barker, and G. J. Brown, "Mask estimation for missing data speech recognition based on statistics of binaural interaction," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.1, pp. 58-67, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 58-67
    • Harding, S.1    Barker, J.2    Brown, G.J.3
  • 21
    • 34748817500 scopus 로고    scopus 로고
    • Exploiting correlogram structure for robust speech recognition with multiple speech sources
    • N. Ma, P. Green, J. Barker, and A. Coy, "Exploiting correlogram structure for robust speech recognition with multiple speech sources," Speech Commun., vol.49, no.12, pp. 874-891, 2007.
    • (2007) Speech Commun , vol.49 , Issue.12 , pp. 874-891
    • Ma, N.1    Green, P.2    Barker, J.3    Coy, A.4
  • 22
    • 33847629729 scopus 로고    scopus 로고
    • On noise masking for automatic missing data speech recognition: A survey and discussion
    • C. Cerisara, S. Demange, and J.-P. Haton, "On noise masking for automatic missing data speech recognition: A survey and discussion," Comput. Speech Lang., vol.21, no.3, pp. 443-457, 2007.
    • (2007) Comput. Speech Lang. , vol.21 , Issue.3 , pp. 443-457
    • Cerisara, C.1    Demange, S.2    Haton, J.-P.3
  • 23
    • 85009106519 scopus 로고    scopus 로고
    • Robust asr based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
    • J. Barker, M. Cooke, and P. Green, "Robust asr based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise," in Proc. Eurospeech, 2001.
    • (2001) Proc. Eurospeech
    • Barker, J.1    Cooke, M.2    Green, P.3
  • 24
    • 70349196731 scopus 로고    scopus 로고
    • Using sparse representations for missing data imputation in noise robust speech recognition
    • J. Gemmeke and B. Cranen, "Using sparse representations for missing data imputation in noise robust speech recognition," in Proc. EUSIPCO 2008, 2008.
    • (2008) Proc. EUSIPCO 2008
    • Gemmeke, J.1    Cranen, B.2
  • 25
    • 0029291966 scopus 로고
    • Sparse approximate solutions to linear systems
    • B. K. Natarajan, "Sparse approximate solutions to linear systems," SIAM J. Comput., vol.24, no.2, pp. 227-234, 1995.
    • (1995) SIAM J. Comput. , vol.24 , Issue.2 , pp. 227-234
    • Natarajan, B.K.1
  • 26
    • 33646365077 scopus 로고    scopus 로고
    • For most large underdetermined systems of linear equations the minimal l1-norm solution is also the sparsest solution
    • D. L. Donoho, "For most large underdetermined systems of linear equations the minimal l1-norm solution is also the sparsest solution," Commun. Pure Appl. Math., vol.59, no.6, pp. 797-829, 2006.
    • (2006) Commun. Pure Appl. Math. , vol.59 , Issue.6 , pp. 797-829
    • Donoho, D.L.1
  • 27
    • 0001287271 scopus 로고    scopus 로고
    • Regression shrinkage and selection via the lasso
    • R. Tibshirani, "Regression shrinkage and selection via the lasso," J. R. Statist. Soc. Ser. B (Methodological), vol.58, no.1, pp. 267-288, 1996.
    • (1996) J. R. Statist. Soc. Ser. B (Methodological) , vol.58 , Issue.1 , pp. 267-288
    • Tibshirani, R.1
  • 29
    • 29144439194 scopus 로고    scopus 로고
    • Decoding by linear programming
    • Dec.
    • E. J. Candès and T. Tao, "Decoding by linear programming," IEEE Trans. Inf. Theory, vol.51, no.12, pp. 4203-4215, Dec. 2005.
    • (2005) IEEE Trans. Inf. Theory , vol.51 , Issue.12 , pp. 4203-4215
    • Candès, E.J.1    Tao, T.2
  • 30
    • 34249687049 scopus 로고    scopus 로고
    • Sparsity and incoherence in compressive sampling
    • Jun. [Online].Available:
    • E. J. Candès and J. Romberg, "Sparsity and incoherence in compressive sampling," Inverse Problems vol.23, no.3, pp. 969-985, Jun. 2007 [Online].Available: http://www.acm.caltech.edu/emmanuel/papers/ PartialMeasurements.pdf
    • (2007) Inverse Problems , vol.23 , Issue.3 , pp. 969-985
    • Candès, E.J.1    Romberg, J.2
  • 31
    • 0001654702 scopus 로고
    • Extensions of Lipschitz embeddings into a Hilbert space
    • W. Johnson and J. Lindenstrauss, "Extensions of Lipschitz embeddings into a Hilbert space," Contemporary Math., vol.26, no.10, pp. 189-206, 1984.
    • (1984) Contemporary Math , vol.26 , Issue.10 , pp. 189-206
    • Johnson, W.1    Lindenstrauss, J.2
  • 32
    • 85009128803 scopus 로고    scopus 로고
    • Prospect features and their application to missing data techniques for robust speech recognition
    • H. Van Hamme, "Prospect features and their application to missing data techniques for robust speech recognition," in Proc. Interspeech-04, 2004, pp. 101-104.
    • (2004) Proc. Interspeech-04 , pp. 101-104
    • Van Hamme, H.1
  • 33
    • 33947622695 scopus 로고    scopus 로고
    • Handling time-derivative features in a missing data framework for robust automatic speech recognition
    • H. Van Hamme, "Handling time-derivative features in a missing data framework for robust automatic speech recognition," in Proc. IEEE ICASSP, 2006, pp. 293-296.
    • (2006) Proc. IEEE ICASSP , pp. 293-296
    • Van Hamme, H.1
  • 34
    • 51449106172 scopus 로고    scopus 로고
    • Robust speech recognition using missing data techniqies in the prospect domain and fuzzy masks
    • M. V. Segbroeck and H. Van Hamme, "Robust speech recognition using missing data techniqies in the prospect domain and fuzzy masks," in Proc. IEEE ICASSP, 2008, pp. 4393-4396.
    • (2008) Proc. IEEE ICASSP , pp. 4393-4396
    • Segbroeck, M.V.1    Van Hamme, H.2
  • 35
  • 36
    • 85156264921 scopus 로고    scopus 로고
    • Multiplicative updates for nonnegative quadratic programming in support vector machines
    • Cambridge, MA: MIT Press
    • F. Sha, L. K. Saul, and D. D. Lee, "Multiplicative updates for nonnegative quadratic programming in support vector machines," in Advances in Neural Information Processing Systems 15. Cambridge, MA: MIT Press, 2002, pp. 1041-1048.
    • (2002) Advances in Neural Information Processing Systems 15 , pp. 1041-1048
    • Sha, F.1    Saul, L.K.2    Lee, D.D.3
  • 37
    • 70450167189 scopus 로고    scopus 로고
    • Vector-Quantization based mask estimation for missing data automatic speech recognition
    • Antwerp, Belgium, Aug.
    • M. Van Segbroeck and H. Van Hamme, "Vector-Quantization based mask estimation for missing data automatic speech recognition," in Proc. ICSLP, Antwerp, Belgium, Aug. 2007, pp. 910-913.
    • (2007) Proc. ICSLP , pp. 910-913
    • Van Segbroeck, M.1    Van Hamme, H.2
  • 38
    • 70450163899 scopus 로고    scopus 로고
    • Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases
    • J. F. Gemmeke, Y. Wang, M. V. Segbroeck, B. Cranen, and H. Van Hamme, "Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases," in Proc. Interspeech-09, 2009, pp. 1227-1230.
    • (2009) Proc. Interspeech-09 , pp. 1227-1230
    • Gemmeke, J.F.1    Wang, Y.2    Segbroeck, M.V.3    Cranen, B.4    Van Hamme, H.5
  • 40
  • 41
    • 70449584968 scopus 로고    scopus 로고
    • Missing data imputation using compressive sensing techniques for connected digit recognition
    • J. Gemmeke and B. Cranen, "Missing data imputation using compressive sensing techniques for connected digit recognition," in Proc. DSP-09, 2009.
    • (2009) Proc. DSP-09
    • Gemmeke, J.1    Cranen, B.2
  • 42
    • 51949116658 scopus 로고    scopus 로고
    • Motion segmentation via robust subspace separation in the presence of outlying, incomplete, or corrupted trajectories
    • S. Rao, R. Tron, R. Vidal, and Y. Ma, "Motion segmentation via robust subspace separation in the presence of outlying, incomplete, or corrupted trajectories," in Proc. IEEE Inf Conf. Comput. Vis. Pattern Recognition, 2008, pp. 1-8.
    • (2008) Proc. IEEE Inf Conf. Comput. Vis. Pattern Recognition , pp. 1-8
    • Rao, S.1    Tron, R.2    Vidal, R.3    Ma, Y.4
  • 43
    • 27844460570 scopus 로고    scopus 로고
    • Simultaneous cartoon and texture image inpainting using morphological component analysis (mca)
    • Nov.
    • M. Elad, J.-L. Starck, D. Donoho, and P. Querre, "Simultaneous cartoon and texture image inpainting using morphological component analysis (mca)," J. Appl. Comput. Harmonic Anal., vol.19, pp. 340-358, Nov. 2005.
    • (2005) J. Appl. Comput. Harmonic Anal. , vol.19 , pp. 340-358
    • Elad, M.1    Starck, J.-L.2    Donoho, D.3    Querre, P.4
  • 44
    • 60649102899 scopus 로고    scopus 로고
    • Inpainting and zooming using sparse representations
    • M. Fadili, J.-L. Starck, and F. Murtagh, "Inpainting and zooming using sparse representations," Comput. J., vol.52, no.1, pp. 64-79, 2009.
    • (2009) Comput. J. , vol.52 , Issue.1 , pp. 64-79
    • Fadili, M.1    Starck, J.-L.2    Murtagh, F.3
  • 47
    • 34547511508 scopus 로고    scopus 로고
    • Sparse overcomplete decomposition for single channel speaker separation
    • M. V. S. Shashanka, B. Raj, and P. Smaragdis, "Sparse overcomplete decomposition for single channel speaker separation," in Proc. IEEE ICASSP, 2007, pp. 641-644.
    • (2007) Proc. IEEE ICASSP , pp. 641-644
    • Shashanka, M.V.S.1    Raj, B.2    Smaragdis, P.3
  • 48
    • 34247623029 scopus 로고    scopus 로고
    • An automatic speech recognition system based on the scene analysis account of auditory perception
    • A. Coy and J. Barker, "An automatic speech recognition system based on the scene analysis account of auditory perception," Speech Commun., vol.49, no.5, pp. 384-401, 2007.
    • (2007) Speech Commun , vol.49 , Issue.5 , pp. 384-401
    • Coy, A.1    Barker, J.2
  • 49
    • 33745215811 scopus 로고    scopus 로고
    • Prospect features and their application to missing data techniques for vocal tract length normalization
    • W. Jansen and H. Van Hamme, "Prospect features and their application to missing data techniques for vocal tract length normalization," in Proc. Interspeech-05, 2005, pp. 2753-2756.
    • (2005) Proc. Interspeech-05 , pp. 2753-2756
    • Jansen, W.1    Van Hamme, H.2
  • 50
    • 85009063707 scopus 로고    scopus 로고
    • Soft decisions in missing data techniques for robust automatic speech recognition
    • J. Barker, L. Josifovski, M. Cooke, and P. Green, "Soft decisions in missing data techniques for robust automatic speech recognition," in Proc. ICSLP-00, 2000, pp. 373-376.
    • (2000) Proc. ICSLP-00 , pp. 373-376
    • Barker, J.1    Josifovski, L.2    Cooke, M.3    Green, P.4
  • 51
    • 70349227603 scopus 로고    scopus 로고
    • Sparse imputation for noise robust speech recognition using soft masks
    • J. Gemmeke and B. Cranen, "Sparse imputation for noise robust speech recognition using soft masks," in Proc. IEEE ICASSP, 2009, pp. 4645-4648.
    • (2009) Proc. IEEE ICASSP , pp. 4645-4648
    • Gemmeke, J.1    Cranen, B.2
  • 52
    • 70349211664 scopus 로고    scopus 로고
    • 1-minimization
    • [Online]. Available: submitted for publication
    • 1-minimization," IEEE Trans. Inf. Theory 2009 [Online]. Available: http://perception. csl.uiuc.edu/jnwright/, submitted for publication
    • (2009) IEEE Trans. Inf. Theory
    • Wright, J.1    Ma, Y.2
  • 53
    • 70449614853 scopus 로고    scopus 로고
    • Shift invariant sparse coding of image and music data
    • submitted for publication
    • M. Mørup, M. N. Schmidt, and L. K. Hansen, "Shift invariant sparse coding of image and music data," Neural Netw., 2008, submitted for publication.
    • (2008) Neural Netw
    • Mørup, M.1    Schmidt, M.N.2    Hansen, L.K.3
  • 55
    • 33750383209 scopus 로고    scopus 로고
    • The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representations
    • Nov.
    • M. Aharon, M. Elad, and A. M. Bruckstein, "The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representations," IEEE Trans. Signal Process., vol.54, no.11, pp. 4311-4322, Nov. 2006.
    • (2006) IEEE Trans. Signal Process. , vol.54 , Issue.11 , pp. 4311-4322
    • Aharon, M.1    Elad, M.2    Bruckstein, A.M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.