메뉴 건너뛰기




Volumn 52, Issue 1, 2010, Pages 72-81

Robust speech recognition by integrating speech separation and hypothesis testing

Author keywords

Ideal binary mask; Missing data recognizer; Robust speech recognition; Speech segregation; Top down processing

Indexed keywords

IDEAL BINARY MASK; MISSING-DATA RECOGNIZER; ROBUST SPEECH RECOGNITION; SPEECH SEGREGATION; TOP-DOWN PROCESSING;

EID: 70350038037     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2009.08.008     Document Type: Article
Times cited : (21)

References (39)
  • 3
    • 70350032356 scopus 로고    scopus 로고
    • Boersma, P, Weenink, D, 2002. Praat: doing phonetics by computer. Version 4.0.26. Last viewed on 24 October 2007. URL
    • Boersma, P., Weenink, D., 2002. Praat: doing phonetics by computer. Version 4.0.26. Last viewed on 24 October 2007. URL .
  • 7
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke M.P., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication 34 (2001) 267-285
    • (2001) Speech Communication , vol.34 , pp. 267-285
    • Cooke, M.P.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 9
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S.B., and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP 28 4 (1980) 357-366
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 12
    • 0026843273 scopus 로고
    • A Bayesian estimation approach for speech enhancement using hidden Markov models
    • Ephraim Y. A Bayesian estimation approach for speech enhancement using hidden Markov models. IEEE Transactions on Signal Processing 40 4 (1992) 725-735
    • (1992) IEEE Transactions on Signal Processing , vol.40 , Issue.4 , pp. 725-735
    • Ephraim, Y.1
  • 13
    • 70349227947 scopus 로고    scopus 로고
    • The application of Hidden Markov models in speech recognition
    • Gales M., and Young S. The application of Hidden Markov models in speech recognition. Foundations and Trends in Signal Processing 1 3 (2007) 195-304
    • (2007) Foundations and Trends in Signal Processing , vol.1 , Issue.3 , pp. 195-304
    • Gales, M.1    Young, S.2
  • 14
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: a survey
    • Gong Y. Speech recognition in noisy environments: a survey. Speech Communication 16 (1995) 261-291
    • (1995) Speech Communication , vol.16 , pp. 261-291
    • Gong, Y.1
  • 15
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Hu G., and Wang D.L. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Transactions on Neural Networks 15 (2004) 1135-1150
    • (2004) IEEE Transactions on Neural Networks , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 19
    • 0142063407 scopus 로고    scopus 로고
    • Novelty detection: a review-part 1: statistical approaches
    • Markou M., and Singh S. Novelty detection: a review-part 1: statistical approaches. Signal Processing 83 12 (2003) 2481-2497
    • (2003) Signal Processing , vol.83 , Issue.12 , pp. 2481-2497
    • Markou, M.1    Singh, S.2
  • 28
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • Srinivasan S., Roman N., and Wang D.L. Binary and ratio time-frequency masks for robust speech recognition. Speech Communication 48 (2006) 1486-1501
    • (2006) Speech Communication , vol.48 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.L.3
  • 30
    • 11144339352 scopus 로고    scopus 로고
    • A schema-based model for phonemic restoration
    • Srinivasan S., and Wang D.L. A schema-based model for phonemic restoration. Speech Communication 45 (2005) 63-87
    • (2005) Speech Communication , vol.45 , pp. 63-87
    • Srinivasan, S.1    Wang, D.L.2
  • 32
    • 84947734535 scopus 로고    scopus 로고
    • Outlier detection using classifier instability
    • Amin, A, Dori, D, Pudil, P, Freeman, H, Eds, Proceedings of the Joint IAPR International Workshops on Advances in Pattern Recognition, Springer, Berlin, pp
    • Tax, D.M.J., Duin, R.P.W., 1998. Outlier detection using classifier instability. In: Amin, A., Dori, D., Pudil, P., Freeman, H. (Eds.), Proceedings of the Joint IAPR International Workshops on Advances in Pattern Recognition, Lecture Notes in Computer Science, vol. 1451, Springer, Berlin, pp. 593-601.
    • (1998) Lecture Notes in Computer Science , vol.1451 , pp. 593-601
    • Tax, D.M.J.1    Duin, R.P.W.2
  • 35
    • 0004319968 scopus 로고
    • The NOISEX-92 study on the effect of additive noise on automatic speech recognition
    • Technical Report, Speech Research Unit, Defense Research Agency, Malvern, UK
    • Varga, A.P., Steeneken, H.J.M., Tomlinson, M., Jones, D., 1992. The NOISEX-92 study on the effect of additive noise on automatic speech recognition. Technical Report, Speech Research Unit, Defense Research Agency, Malvern, UK.
    • (1992)
    • Varga, A.P.1    Steeneken, H.J.M.2    Tomlinson, M.3    Jones, D.4
  • 36
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • Wang D.L., and Brown G.J. Separation of speech from interfering sounds based on oscillatory correlation. IEEE Transactions on Neural Networks 10 3 (1999) 684-697
    • (1999) IEEE Transactions on Neural Networks , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 38
    • 0033344872 scopus 로고    scopus 로고
    • Confidence measures from local posterior probability estimates
    • Williams G., and Renals S. Confidence measures from local posterior probability estimates. Computer Speech and Language 13 (1999) 395-413
    • (1999) Computer Speech and Language , vol.13 , pp. 395-413
    • Williams, G.1    Renals, S.2
  • 39
    • 70350021067 scopus 로고    scopus 로고
    • Young, S, Kershaw, D, Odell, J, Valtchev, V, Woodland, P, 2000. The HTK Book for HTK Version 3.0, Microsoft Corporation
    • Young, S., Kershaw, D., Odell, J., Valtchev, V., Woodland, P., 2000. The HTK Book (for HTK Version 3.0). Microsoft Corporation.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.