SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 25, Issue 2, 2011, Pages 462-479

Sparse imputation for large vocabulary noise robust ASR

(3) Gemmeke, Jort Florent a Cranen, Bert a Remes, Ulpu b

a RADBOUD UNIVERSITY NIJMEGEN (Netherlands)

b AALTO UNIVERSITY (Finland)

Author keywords

Automatic speech recognition; Missing data techniques; Noise robustness; Sparse imputation

Indexed keywords

A-FRAMES; AUTOMATIC SPEECH RECOGNITION; CLEAN SPEECH; CONNECTED DIGITS; ERROR PRONES; FEATURE RELIABILITY; IMPUTATION TECHNIQUES; LARGE VOCABULARY; MISSING DATA TECHNIQUES; NOISE CONDITIONS; NOISE ROBUSTNESS; NOISY SPEECH; PARAMETRIC MODELS; ROBUST ASR; SPARSE IMPUTATION; TIME FRAME;

ACOUSTIC NOISE; FEATURE EXTRACTION; VOCABULARY CONTROL;

SPEECH RECOGNITION;

EID: 78049527664 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2010.06.004 Document Type: Article

Times cited : (26)

References (52)

1
- 85009063707
- Soft decisions in missing data techniques for robust automatic speech recognition
- Beijing, China
- J. Barker, L. Josifovski, M. Cooke, and P. Green Soft decisions in missing data techniques for robust automatic speech recognition Proc. ICSLP Beijing, China 2000 373 376
- (2000) Proc. ICSLP , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.³ Green, P.⁴

2
- 85009106519
- Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
- Aalborg, Denmark
- J. Barker, M. Cooke, and P. Green Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise Proc. EUROSPEECH Aalborg, Denmark 2001 213 216
- (2001) Proc. EUROSPEECH , pp. 213-216
- Barker, J.¹ Cooke, M.² Green, P.³

3
- 11144316019
- Decoding speech in the presence of other sources
- J. Barker, M. Cooke, and D. Ellis Decoding speech in the presence of other sources Speech Communication 45 1 2005 5 25
- (2005) Speech Communication , vol.45 , Issue.1 , pp. 5-25
- Barker, J.¹ Cooke, M.² Ellis, D.³

4
- 33745604236
- Stable signal recovery from incomplete and inaccurate measurements
- E.J. Cands, J. Romberg, and T. Tao Stable signal recovery from incomplete and inaccurate measurements Communications on Pure and Applied Mathematics 59 8 2006 1207 1223
- (2006) Communications on Pure and Applied Mathematics , vol.59 , Issue.8 , pp. 1207-1223
- Cands, E.J.¹ Romberg, J.² Tao, T.³

5
- 33847629729
- On noise masking for automatic missing data speech recognition: A survey and discussion
- C. Cerisara, S. Demange, and J.-P. Haton On noise masking for automatic missing data speech recognition: a survey and discussion Computer Speech and Language 21 3 2007 443 457
- (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 443-457
- Cerisara, C.¹ Demange, S.² Haton, J.-P.³

6
- 84869001637
- Handling missing data in speech recognition
- Yokohama, Japan
- M. Cooke, P. Green, and M. Crawford Handling missing data in speech recognition Proc. ICSLP Yokohama, Japan 1994 1555 1558
- (1994) Proc. ICSLP , pp. 1555-1558
- Cooke, M.¹ Green, P.² Crawford, M.³

7
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovksi, and A. Vizinho Robust automatic speech recognition with missing and unreliable acoustic data Speech Communication 34 3 2001 267 285
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovksi, L.³ Vizinho, A.⁴

8
- 33644661135
- A glimpsing model of speech perception in noise
- M. Cooke A glimpsing model of speech perception in noise J. Acoust. Soc. Am. 119 3 2006 1562 1573
- (2006) J. Acoust. Soc. Am. , vol.119 , Issue.3 , pp. 1562-1573
- Cooke, M.¹

9
- 11844273505
- Unsupervised discovery of morphemes
- Philadelphia, Pennsylvania, USA
- M. Creutz, and K. Lagus Unsupervised discovery of morphemes Proc. ACL-02 Workshop on Morphological and Phonological Learning Philadelphia, Pennsylvania, USA 2002 21 30
- (2002) Proc. ACL-02 Workshop on Morphological and Phonological Learning , pp. 21-30
- Creutz, M.¹ Lagus, K.²

10
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S.B. Davis, and P. Mermelstein Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Transactions on Acoustics, Speech, and Signal Processing 28 4 1980 357 366
- (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

11
- 33645712892
- Compressed sensing
- D.L. Donoho Compressed sensing IEEE Transactions on Information Theory 52 4 2006 1289 1306
- (2006) IEEE Transactions on Information Theory , vol.52 , Issue.4 , pp. 1289-1306
- Donoho, D.L.¹

12
- 33646365077
- For most large underdetermined systems of linear equations the minimal L1-norm solution is also the sparsest solution
- D.L. Donoho For most large underdetermined systems of linear equations the minimal L1-norm solution is also the sparsest solution Communications on Pure and Applied Mathematics 59 6 2006 797 829
- (2006) Communications on Pure and Applied Mathematics , vol.59 , Issue.6 , pp. 797-829
- Donoho, D.L.¹

13
- 0032638856
- Semi-tied covariance matrices for hidden Markov models
- M. Gales Semi-tied covariance matrices for hidden Markov models IEEE Transactions on Speech and Audio Processing 7 3 1999 272 281
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.3 , pp. 272-281
- Gales, M.¹

14
- 70349196731
- Using sparse representations for missing data imputation in noise robust speech recognition
- Lausanne, Switzerland
- J. Gemmeke, and B. Cranen Using sparse representations for missing data imputation in noise robust speech recognition Proc. EUSIPCO Lausanne, Switzerland 2008
- (2008) Proc. EUSIPCO
- Gemmeke, J.¹ Cranen, B.²

15
- 70449584968
- Missing data imputation using compressive sensing techniques for connected digit recognition
- Santorini, Greece
- J. Gemmeke, and B. Cranen Missing data imputation using compressive sensing techniques for connected digit recognition Proc. DSP Santorini, Greece 2009 1 8
- (2009) Proc. DSP , pp. 1-8
- Gemmeke, J.¹ Cranen, B.²

16
- 70349227603
- Sparse imputation for noise robust speech recognition using soft masks
- Taipei, Taiwan
- J. Gemmeke, and B. Cranen Sparse imputation for noise robust speech recognition using soft masks Proc. ICASSP Taipei, Taiwan 2009 4645 4648
- (2009) Proc. ICASSP , pp. 4645-4648
- Gemmeke, J.¹ Cranen, B.²

17
- 70450163899
- Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases
- Brighton, UK
- J.F. Gemmeke, Y. Wang, M. Van Segbroeck, B. Cranen, and H. Van hamme Application of noise robust MDT speech recognition on the SPEECON and SpeechDat-Car databases Proc. INTERSPEECH Brighton, UK 2009
- (2009) Proc. INTERSPEECH
- Gemmeke, J.F.¹ Wang, Y.² Van Segbroeck, M.³ Cranen, B.⁴ Van Hamme, H.⁵

18
- 77949695902
- Compressive sensing for missing data imputation in noise robust speech recognition
- J.F. Gemmeke, H.V. hamme, B. Cranen, and L. Boves Compressive sensing for missing data imputation in noise robust speech recognition IEEE Journal of Selected Topics in Signal Processing 4 2 2010 272 287
- (2010) IEEE Journal of Selected Topics in Signal Processing , vol.4 , Issue.2 , pp. 272-287
- Gemmeke, J.F.¹ Hamme, H.V.² Cranen, B.³ Boves, L.⁴

19
- 79959814198
- Observation uncertainty measures for sparse imputation
- Gemmeke, J.F., Remes, U., Palomäki, K.J., 2010. Observation uncertainty measures for sparse imputation. In: Accepted to INTERSPEECH.
- (2010) Accepted to INTERSPEECH
- Gemmeke, J.F.¹

20
- 33744971131
- Mask estimation for missing data speech recognition based on statistics of binaural interaction
- S. Harding, J. Barker, and G.J. Brown Mask estimation for missing data speech recognition based on statistics of binaural interaction IEEE Transactions on Audio, Speech and Language Processing 14 1 2006 58 67
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 58-67
- Harding, S.¹ Barker, J.² Brown, G.J.³

21
- 0038669544
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Paris, France
- H. Hirsch, and D. Pearce The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions Proc. ISCA Tutorial and Research Workshop ASR2000 Paris, France 2000 181 188
- (2000) Proc. ISCA Tutorial and Research Workshop ASR2000 , pp. 181-188
- Hirsch, H.¹ Pearce, D.²

22
- 33746524944
- Unlimited vocabulary speech recognition with morph language models applied to Finnish
- T. Hirsimäki, M. Creutz, V. Siivola, M. Kurimo, S. Virpioja, and J. Pylkkönen Unlimited vocabulary speech recognition with morph language models applied to Finnish Computer Speech and Language 20 4 2006 515 541
- (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 515-541
- Hirsimäki, T.¹ Creutz, M.² Siivola, V.³ Kurimo, M.⁴ Virpioja, S.⁵ Pylkkönen, J.⁶

23
- 84910032186
- SPEECON - Speech databases for consumer devices: Database specification and validation
- Las Palmas, Canary Islands, Spain
- D. Iskra, B. Grosskopf, K. Marasek, H.V.D. Heuvel, F. Diehl, and A. Kiessling SPEECON - speech databases for consumer devices: database specification and validation Proc. LREC Las Palmas, Canary Islands, Spain 2002 329 333
- (2002) Proc. LREC , pp. 329-333
- Iskra, D.¹ Grosskopf, B.² Marasek, K.³ Heuvel, H.V.D.⁴ Diehl, F.⁵ Kiessling, A.⁶

24
- 0037841203
- State based imputation of missing data for robust speech recognition and speech enhancement
- Budapest, Hungary
- L. Josifovski, M. Cooke, P. Green, and A. Vizinho State based imputation of missing data for robust speech recognition and speech enhancement Proc. EUROSPEECH Budapest, Hungary 1999 2837 2840
- (1999) Proc. EUROSPEECH , pp. 2837-2840
- Josifovski, L.¹ Cooke, M.² Green, P.³ Vizinho, A.⁴

25
- 0003770709
- Kluwer Academic Publishers
- J.-C. Junqua, and J.-P. Haton Robustness in Automatic Speech Recognition: Fundamentals and Applications 1996 Kluwer Academic Publishers
- (1996) Robustness in Automatic Speech Recognition: Fundamentals and Applications
- Junqua, J.-C.¹ Haton, J.-P.²

26
- 33947703708
- Band-independent mask estimation for missing-feature reconstruction in the presence of unknown background noise
- Toulouse, France
- W. Kim, and R.M. Stern Band-independent mask estimation for missing-feature reconstruction in the presence of unknown background noise Proc. ICASSP Toulouse, France 2006 305 308
- (2006) Proc. ICASSP , pp. 305-308
- Kim, W.¹ Stern, R.M.²

27
- 39449109476
- An interior-point method for large-scale l1-regularized least squares
- S. Kim, K. Koh, M. Lustig, S. Boyd, and D. Gorinevsky An interior-point method for large-scale l1-regularized least squares IEEE Journal on Selected Topics in Signal Processing 1 4 2007 606 617
- (2007) IEEE Journal on Selected Topics in Signal Processing , vol.1 , Issue.4 , pp. 606-617
- Kim, S.¹ Koh, K.² Lustig, M.³ Boyd, S.⁴ Gorinevsky, D.⁵

28
- 40249103761
- Issues with uncertainty decoding for noise robust automatic speech recognition
- H. Liao, and M.J.F. Gales Issues with uncertainty decoding for noise robust automatic speech recognition Speech Communication 50 4 2008 265 277
- (2008) Speech Communication , vol.50 , Issue.4 , pp. 265-277
- Liao, H.¹ Gales, M.J.F.²

29
- 34748817500
- Exploiting correlogram structure for robust speech recognition with multiple speech sources
- N. Ma, P. Green, J. Barker, and A. Coy Exploiting correlogram structure for robust speech recognition with multiple speech sources Speech Communication 49 12 2007 874 891
- (2007) Speech Communication , vol.49 , Issue.12 , pp. 874-891
- Ma, N.¹ Green, P.² Barker, J.³ Coy, A.⁴

30
- 0024753593
- Speech recognition using noise-adaptive prototypes
- A. Nadas, D. Nahamoo, and M. Picheny Speech recognition using noise-adaptive prototypes IEEE Transactions on Acoustics, Speech and Signal Processing 37 10 1989 1495 1503
- (1989) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.37 , Issue.10 , pp. 1495-1503
- Nadas, A.¹ Nahamoo, D.² Picheny, M.³

31
- 0003805597
- Ph.D. Thesis. University of Cambridge
- Odell, J.J., 1995. The use of context in large vocabulary speech recognition. Ph.D. Thesis. University of Cambridge.
- (1995) The Use of Context in Large Vocabulary Speech Recognition
- Odell, J.J.¹

32
- 33745202179
- An efficient one-pass decoder for Finnish large vocabulary continuous speech recognition
- Tallinn, Estonia
- J. Pylkkönen An efficient one-pass decoder for Finnish large vocabulary continuous speech recognition Proc. 2nd Baltic Conference on Human Language Technologies Tallinn, Estonia 2005 167 172
- (2005) Proc. 2nd Baltic Conference on Human Language Technologies , pp. 167-172
- Pylkkönen, J.¹

33
- 85009089669
- Duration modeling techniques for continuous speech recognition
- Jeju Island, Korea
- J. Pylkkönen, and M. Kurimo Duration modeling techniques for continuous speech recognition Proc. INTERSPEECH Jeju Island, Korea 2004 385 388
- (2004) Proc. INTERSPEECH , pp. 385-388
- Pylkkönen, J.¹ Kurimo, M.²

34
- 85032752225
- Missing-feature approaches in speech recognition
- B. Raj, and R.M. Stern Missing-feature approaches in speech recognition IEEE Signal Processing Magazine 22 5 2005 101 116
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
- Raj, B.¹ Stern, R.M.²

35
- 0001774275
- Inference of missing spectrographic features for robust automatic speech recognition
- Sydney, Australia
- B. Raj, R. Singh, and R. Stern Inference of missing spectrographic features for robust automatic speech recognition Proc. ICSLP Sydney, Australia 1998 1491 1494
- (1998) Proc. ICSLP , pp. 1491-1494
- Raj, B.¹ Singh, R.² Stern, R.³

36
- 4644336054
- Reconstruction of missing features for robust speech recognition
- B. Raj, M. Seltzer, and R. Stern Reconstruction of missing features for robust speech recognition Speech Communication 43 4 2004 275 296
- (2004) Speech Communication , vol.43 , Issue.4 , pp. 275-296
- Raj, B.¹ Seltzer, M.² Stern, R.³

37
- 0038331253
- Ph.D. Thesis, Carnegie Mellon University
- Raj, B., 2000. Reconstruction of incomplete spectrograms for robust speech recognition. Ph.D. Thesis, Carnegie Mellon University.
- (2000) Reconstruction of Incomplete Spectrograms for Robust Speech Recognition
- Raj, B.¹

38
- 70450147392
- Missing feature reconstruction and acoustic model adaptation combined for large vocabulary continuous speech recognition
- Lausanne, Switzerland
- U. Remes, K.J. Palomäki, and M. Kurimo Missing feature reconstruction and acoustic model adaptation combined for large vocabulary continuous speech recognition Proc. EUSIPCO Lausanne, Switzerland 2008
- (2008) Proc. EUSIPCO
- Remes, U.¹ Palomäki, K.J.² Kurimo, M.³

39
- 78049530924
- Ph.D. Thesis, K.U. Leuven
- Segbroeck, M.V., 2010. Robust large vocabulary continuous speech recognition using missing data techniques. Ph.D. Thesis, K.U. Leuven.
- (2010) Robust Large Vocabulary Continuous Speech Recognition Using Missing Data Techniques
- Segbroeck, M.V.¹

40
- 4644317224
- A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
- M. Seltzer, B. Raj, and R. Stern A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition Speech Communication 43 4 2004 379 393
- (2004) Speech Communication , vol.43 , Issue.4 , pp. 379-393
- Seltzer, M.¹ Raj, B.² Stern, R.³

41
- 33745217822
- Growing an n-gram language model
- Lisbon, Portugal
- V. Siivola, and B. Pellom Growing an n-gram language model Proc. INTERSPEECH Lisbon, Portugal 2005 1309 1312
- (2005) Proc. INTERSPEECH , pp. 1309-1312
- Siivola, V.¹ Pellom, B.²

42
- 33750311718
- Binary and ratio time-frequency masks for robust speech recognition
- S. Srinivasan, N. Roman, and D. Wang Binary and ratio time-frequency masks for robust speech recognition Speech Communication 48 11 2006 1486 1501
- (2006) Speech Communication , vol.48 , Issue.11 , pp. 1486-1501
- Srinivasan, S.¹ Roman, N.² Wang, D.³

43
- 50449097354
- Ph.D. Thesis, K.U. Leuven
- Stouten, V., 2006. Robust automatic speech recognition in time-varying environments. Ph.D. Thesis, K.U. Leuven.
- (2006) Robust Automatic Speech Recognition in Time-varying Environments
- Stouten, V.¹

44
- 42549131394
- Exploiting temporal correlation of speech for error-robust and bandwidth-flexible distributed speech recognition
- Z.-H. Tan, P. Dalsgaard, and B. Lindberg Exploiting temporal correlation of speech for error-robust and bandwidth-flexible distributed speech recognition IEEE Transactions on Audio, Speech and Language Processing 15 4 2007 1391 1403
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.4 , pp. 1391-1403
- Tan, Z.-H.¹ Dalsgaard, P.² Lindberg, B.³

45
- 78049530897
- CSC Tieteellinen laskenta Oy
- CSC Tieteellinen laskenta Oy, The language bank of Finland, 2001. www.csc.fi/languagebank/.
- (2001) The Language Bank of Finland

46
- 4544315110
- Robust speech recognition using cepstral domain missing data techniques and noisy masks
- Montreal, Quebec, Canada
- H. Van hamme Robust speech recognition using cepstral domain missing data techniques and noisy masks Proc. ICASSP Montreal, Quebec, Canada 2004 213 216
- (2004) Proc. ICASSP , pp. 213-216
- Van Hamme, H.¹

47
- 85009128803
- PROSPECT features and their application to missing data techniques for robust speech recognition
- Jeju Island, Korea
- H. Van hamme PROSPECT features and their application to missing data techniques for robust speech recognition Proc. INTERSPEECH Jeju Island, Korea 2004 101 104
- (2004) Proc. INTERSPEECH , pp. 101-104
- Van Hamme, H.¹

48
- 70450167189
- Vector-Quantization based mask estimation for missing data automatic speech recognition
- Antwerp, Belgium
- M. Van Segbroeck, and H. Van hamme Vector-Quantization based mask estimation for missing data automatic speech recognition Proc. INTERSPEECH Antwerp, Belgium 2007 910 913
- (2007) Proc. INTERSPEECH , pp. 910-913
- Van Segbroeck, M.¹ Van Hamme, H.²

49
- 0027623210
- Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- A. Varga, and H. Steeneken Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems Speech Communication 12 3 1993 247 251
- (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-251
- Varga, A.¹ Steeneken, H.²

50
- 0002603206
- Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: An integrated study
- Budapest, Hungary
- A. Vizinho, P. Green, M. Cooke, and L. Josifovski Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study Proc. EUROSPEECH Budapest, Hungary 1999 2407 2410
- (1999) Proc. EUROSPEECH , pp. 2407-2410
- Vizinho, A.¹ Green, P.² Cooke, M.³ Josifovski, L.⁴

51
- 48149090146
- Estimating single-channel source separation masks: Relevance vector machine classifiers vs. pitch-based masking
- Pittsburgh, Pennsylvania, USA
- R. Weiss, and D. Ellis Estimating single-channel source separation masks: relevance vector machine classifiers vs. pitch-based masking Proc. Workshop on Statistical and Perceptual Audition SAPA-06 Pittsburgh, Pennsylvania, USA 2006 31 36
- (2006) Proc. Workshop on Statistical and Perceptual Audition SAPA-06 , pp. 31-36
- Weiss, R.¹ Ellis, D.²

52
- 61549128441
- Robust face recognition via sparse representation
- J. Wright, A.Y. Yang, A. Ganesh, S.S. Sastry, and Y. Ma Robust face recognition via sparse representation IEEE Transactions on Pattern Analysis and Machine Intelligence 31 2 2009 210 227
- (2009) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.31 , Issue.2 , pp. 210-227
- Wright, J.¹ Yang, A.Y.² Ganesh, A.³ Sastry, S.S.⁴ Ma, Y.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.