메뉴 건너뛰기




Volumn 1, Issue , 2006, Pages 73-76

A computational auditory scene analysis system for robust speech recognition

Author keywords

Binary time frequency mask; Computational auditory scene analysis; Robust speech recognition; Speech segregation

Indexed keywords

BINS; DEEP NEURAL NETWORKS; PATIENT REHABILITATION; SPEECH; SPEECH COMMUNICATION;

EID: 40749137520     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (11)

References (19)
  • 2
    • 4644257621 scopus 로고    scopus 로고
    • Single microphone source separation using high resolution signal reconstruction
    • T. Kristjansson, H. Attias, and J. Hershey, "Single microphone source separation using high resolution signal reconstruction," in Proc. ICASSP '04, vol. 2, 2004, pp. 817-820.
    • (2004) Proc. ICASSP '04 , vol.2 , pp. 817-820
    • Kristjansson, T.1    Attias, H.2    Hershey, J.3
  • 3
    • 84899014722 scopus 로고    scopus 로고
    • A probabilistic approach to single channel blind signal separation
    • S. Becker, S. Thrun, and K. Obermayer, Eds. Cambridge, MA: MIT Press
    • G-J Jang and T-W Lee, "A probabilistic approach to single channel blind signal separation," in Advances in Neural Information Processing Systems 15, S. Becker, S. Thrun, and K. Obermayer, Eds. Cambridge, MA: MIT Press, 2003, pp. 1173-1180.
    • (2003) Advances in Neural Information Processing Systems 15 , pp. 1173-1180
    • Jang, G.-J.1    Lee, T.-W.2
  • 4
    • 33745190244 scopus 로고    scopus 로고
    • Recognizing speech from simultaneous speakers
    • B. Raj, R. Singh, and P. Smaragdis, "Recognizing speech from simultaneous speakers," in Proc. Interspeech '05, 2005, pp. 3317-3320.
    • (2005) Proc. Interspeech '05 , pp. 3317-3320
    • Raj, B.1    Singh, R.2    Smaragdis, P.3
  • 5
    • 4544369701 scopus 로고    scopus 로고
    • A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel
    • A. N. Deoras and M. Hasegawa-Johnson, "A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel," in Proc. ICASSP '04, vol. 1, 2004, pp. 861-864.
    • (2004) Proc. ICASSP '04 , vol.1 , pp. 861-864
    • Deoras, A.N.1    Hasegawa-Johnson, M.2
  • 6
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • P. Divenyi, Ed, Norwell, MA
    • D. L. Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech separation by humans and machines, P. Divenyi, Ed., Norwell, MA, 2005, pp. 181-197.
    • (2005) Speech separation by humans and machines , pp. 181-197
    • Wang, D.L.1
  • 7
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Comm., vol. 34, pp. 267-285, 2001.
    • (2001) Speech Comm , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 8
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Am., vol. 114, pp. 2236-2252, 2003.
    • (2003) J. Acoust. Soc. Am , vol.114 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 9
    • 85022115206 scopus 로고    scopus 로고
    • A. S. Bregman, Auditory scene analysis. Cambridge, MA: The MIT Press, 1990.
    • A. S. Bregman, Auditory scene analysis. Cambridge, MA: The MIT Press, 1990.
  • 10
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. on Neural Networks, vol. 10, no. 3, pp. 684-697, 1999.
    • (1999) IEEE Trans. on Neural Networks , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 12
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • B. Raj, M. L. Seltzer, and R. M. Stem, "Reconstruction of missing features for robust speech recognition," Speech Communication, vol. 43, pp. 275-296, 2004.
    • (2004) Speech Communication , vol.43 , pp. 275-296
    • Raj, B.1    Seltzer, M.L.2    Stem, R.M.3
  • 14
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. on Neural Networks, vol. 15, pp. 1135-1150, 2004.
    • (2004) IEEE Trans. on Neural Networks , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 15
    • 85045165251 scopus 로고    scopus 로고
    • Monaural speech organization and segregation,
    • Ph.D. dissertation, Biophysics Program, The Ohio State University
    • G. Hu, "Monaural speech organization and segregation," Ph.D. dissertation, Biophysics Program, The Ohio State University, 2006.
    • (2006)
    • Hu, G.1
  • 17
    • 33947649051 scopus 로고    scopus 로고
    • Robust speaker recognition using binary time-frequency masks
    • _, "Robust speaker recognition using binary time-frequency masks," in Proc. ICASSP '06, vol. I, 2006, pp. 645-648.
    • (2006) Proc. ICASSP '06 , vol.1 , pp. 645-648
    • Shao, Y.1    Wang, D.L.2
  • 18
    • 0026172104 scopus 로고
    • Watersheds in digital spaces: An efficient algorithm based on immersion simulations
    • L. Vincent and P. Soille, "Watersheds in digital spaces: An efficient algorithm based on immersion simulations," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 13, no. 6, pp. 583-598, 1991.
    • (1991) IEEE Trans. on Pattern Analysis and Machine Intelligence , vol.13 , Issue.6 , pp. 583-598
    • Vincent, L.1    Soille, P.2
  • 19
    • 33947644911 scopus 로고    scopus 로고
    • A supervised learning approach to uncertainty decoding for robust speech recognition
    • S. Srinivasan and D. L. Wang, "A supervised learning approach to uncertainty decoding for robust speech recognition," in Proc. ICASSP '06, vol. I, 2006, pp. 297-300.
    • (2006) Proc. ICASSP '06 , vol.1 , pp. 297-300
    • Srinivasan, S.1    Wang, D.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.