메뉴 건너뛰기




Volumn , Issue , 2012, Pages 4685-4688

ASR-driven top-down binary mask estimation using spectral priors

Author keywords

ideal binary mask; mask estimation; robust automatic speech recognition

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; BASELINE RECOGNITION SYSTEMS; BINARY MASKS; ESTIMATION ALGORITHM; IDEAL BINARY MASK; LINGUISTIC INFORMATION; LOW-LEVEL FEATURES; MODEL SELECTION; PILOT STUDIES; SNR IMPROVEMENT; TOP-DOWN APPROACH; TOPDOWN;

EID: 84867589172     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2012.6288964     Document Type: Conference Paper
Times cited : (3)

References (13)
  • 1
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • P. Divenyi, Ed., Kluwer Academic, Norwell MA
    • D. L.Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech separation by humans and machines, P. Divenyi, Ed., pp. 181-197. Kluwer Academic, Norwell MA, 2005.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 2
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Communication, vol. 34, pp. 267-285, 2001.
    • (2001) Speech Communication , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 4
    • 79956289561 scopus 로고    scopus 로고
    • A novel mask estimation method employing posterior-based representative mean estimate for missing-feature speech recognition
    • July
    • W. Kim and J. H. L. Hansen, "A novel mask estimation method employing posterior-based representative mean estimate for missing-feature speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 5, pp. 1434-1443, July 2011.
    • (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.5 , pp. 1434-1443
    • Kim, W.1    Hansen, J.H.L.2
  • 5
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. Barker, M. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," Speech Communication, vol. 45, pp. 5-25, 2005.
    • (2005) Speech Communication , vol.45 , pp. 5-25
    • Barker, J.1    Cooke, M.2    Ellis, D.P.W.3
  • 6
    • 70350038037 scopus 로고    scopus 로고
    • Robust speech recognition by integrating speech separation and hypothesis testing
    • S. Srinivasan and D. L. Wang, "Robust speech recognition by integrating speech separation and hypothesis testing," Speech Communication, vol. 52, pp. 72-81, 2010.
    • (2010) Speech Communication , vol.52 , pp. 72-81
    • Srinivasan, S.1    Wang, D.L.2
  • 9
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • B. Raj, M. L. Seltzer, and R. M. Stern, "Reconstruction of missing features for robust speech recognition," Speech Communication, vol. 43, pp. 275-296, 2004.
    • (2004) Speech Communication , vol.43 , pp. 275-296
    • Raj, B.1    Seltzer, M.L.2    Stern, R.M.3
  • 13
    • 0032626792 scopus 로고    scopus 로고
    • Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures
    • D. P. W. Ellis, "Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures," Speech Communication, vol. 27, pp. 281-298, 1999.
    • (1999) Speech Communication , vol.27 , pp. 281-298
    • Ellis, D.P.W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.