메뉴 건너뛰기




Volumn 120, Issue 1, 2006, Pages 458-469

Pitch-based monaural segregation of reverberant speech

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; HARMONIC GENERATION; HEARING AIDS; REVERBERATION; SENSORY PERCEPTION; SIGNAL TO NOISE RATIO; SPEECH PROCESSING; SPEECH RECOGNITION;

EID: 33745761651     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.2204590     Document Type: Article
Times cited : (35)

References (50)
  • 1
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • Allen, J. B., and Berkley, D. A. (1979). "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Am. 65, 943-950.
    • (1979) J. Acoust. Soc. Am. , vol.65 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 2
    • 0036649241 scopus 로고    scopus 로고
    • Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets
    • Barros, A. K., Rutkowski, T., Itakura, F., and Ohnishi, N. (2002). "Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets," IEEE Trans. Neural Netw. 13, 888-893.
    • (2002) IEEE Trans. Neural Netw. , vol.13 , pp. 888-893
    • Barros, A.K.1    Rutkowski, T.2    Itakura, F.3    Ohnishi, N.4
  • 6
    • 0039334758 scopus 로고    scopus 로고
    • The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions
    • Bronkhorst, A. (2000). "The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions," Acustica 86, 117-128.
    • (2000) Acustica , vol.86 , pp. 117-128
    • Bronkhorst, A.1
  • 7
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • Brown, G. J., and Cooke, M. (1994). "Computational auditory scene analysis," Comput. Speech Lang. 8, 297-336.
    • (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.2
  • 8
    • 33644639591 scopus 로고    scopus 로고
    • Separation of speech by computational auditory scene analysis
    • J. Benesty, S. Makino, and J. Chen, eds. (Springer, New York)
    • Brown, G. J., and Wang, D. L. (2005). "Separation of speech by computational auditory scene analysis," in Speech Enhancement, J. Benesty, S. Makino, and J. Chen, eds. (Springer, New York), pp. 371-402.
    • (2005) Speech Enhancement , pp. 371-402
    • Brown, G.J.1    Wang, D.L.2
  • 10
    • 0016522748 scopus 로고
    • Anthropometric manikin for acoustic research
    • Burkhard, M. D., and Sachs, R. M. (1975). "Anthropometric manikin for acoustic research," J. Acoust. Soc. Am. 58, 214-222.
    • (1975) J. Acoust. Soc. Am. , vol.58 , pp. 214-222
    • Burkhard, M.D.1    Sachs, R.M.2
  • 12
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke, M. P., Green, P., Josifovski, L., and Vizinho, A. (2001). "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun. 34, 267-285.
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.P.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 13
    • 0242440783 scopus 로고    scopus 로고
    • Effects of reverberation on perceptual segregation of competing voices
    • Culling, J. F., Hodder, K. I., and Toh, C. Y. (2003). "Effects of reverberation on perceptual segregation of competing voices," J. Acoust. Soc. Am. 114, 2871-2876.
    • (2003) J. Acoust. Soc. Am. , vol.114 , pp. 2871-2876
    • Culling, J.F.1    Hodder, K.I.2    Toh, C.Y.3
  • 14
    • 0001698589 scopus 로고
    • Auditory grouping
    • B. C. J. Moore, ed. (Academic, London)
    • Darwin, C. J., and Carlyon, R. P. (1995). "Auditory grouping," in The Handbook of Perception and Cognition, vol. 6, B. C. J. Moore, ed. (Academic, London), pp. 387-424.
    • (1995) The Handbook of Perception and Cognition , vol.6 , pp. 387-424
    • Darwin, C.J.1    Carlyon, R.P.2
  • 15
    • 0033939839 scopus 로고    scopus 로고
    • Effects of reverberation on spatial, prosodic, and vocal-tract size cues to selective attention
    • Darwin, C. J., and Hukin, R. W. (2000). "Effects of reverberation on spatial, prosodic, and vocal-tract size cues to selective attention," J. Acoust. Soc. Am. 108, 335-342.
    • (2000) J. Acoust. Soc. Am. , vol.108 , pp. 335-342
    • Darwin, C.J.1    Hukin, R.W.2
  • 16
    • 0029345417 scopus 로고
    • A signal subspace approach for speech enhancement
    • Ephraim, Y., and Trees, H. L. (1995). "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Process. 3, 251-266.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 251-266
    • Ephraim, Y.1    Trees, H.L.2
  • 17
    • 0030674105 scopus 로고    scopus 로고
    • Two-channel blind deconvolution for non-minimum phase impulse responses
    • Furuya, K., and Kaneda, Y. (1997). "Two-channel blind deconvolution for non-minimum phase impulse responses," Proc. ICASSP, pp. 1315-1318.
    • (1997) Proc. ICASSP , pp. 1315-1318
    • Furuya, K.1    Kaneda, Y.2
  • 19
    • 0036289676 scopus 로고    scopus 로고
    • Acoustic diversity for improved speech recognition in reverberant environments
    • Gillespie, B. W., and Atlas, L. E. (2002). "Acoustic diversity for improved speech recognition in reverberant environments," Proc. ICASSP, pp. 557-560.
    • (2002) Proc. ICASSP , pp. 557-560
    • Gillespie, B.W.1    Atlas, L.E.2
  • 20
    • 0034857681 scopus 로고    scopus 로고
    • Speech dereverberation via maximum-kurtosis subband adaptive filtering
    • Gillespie, B. W., Malvar, H. S., and Florencio, D. A. F. (2001). "Speech dereverberation via maximum-kurtosis subband adaptive filtering," Proc. ICASSP, vol. 6, pp. 3701-3704.
    • (2001) Proc. ICASSP , vol.6 , pp. 3701-3704
    • Gillespie, B.W.1    Malvar, H.S.2    Florencio, D.A.F.3
  • 22
    • 0141788523 scopus 로고    scopus 로고
    • Separation of stop consonants
    • Hu, G., and Wang, D. L. (2003). "Separation of stop consonants," Proc. ICASSP, vol. 2, pp. 749-752.
    • (2003) Proc. ICASSP , vol.2 , pp. 749-752
    • Hu, G.1    Wang, D.L.2
  • 23
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Hu, G., and Wang, D. L. (2004). "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw. 15, 1135-1150.
    • (2004) IEEE Trans. Neural Netw. , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 24
    • 33646786460 scopus 로고    scopus 로고
    • Separation of fricatives and affricates
    • Hu, G., and Wang, D. L. (2005). "Separation of fricatives and affricates," Proc. ICASSP vol. 1, pp. 1101-1104.
    • (2005) Proc. ICASSP , vol.1 , pp. 1101-1104
    • Hu, G.1    Wang, D.L.2
  • 25
    • 0038630563 scopus 로고    scopus 로고
    • Single channel signal separation using time-domain basis functions
    • Jang, G.-J., Lee, T.-W., and Oh, Y.-H. (2003). "Single channel signal separation using time-domain basis functions" IEEE Signal Process. Lett. 10(6), 168-171.
    • (2003) IEEE Signal Process. Lett. , vol.10 , Issue.6 , pp. 168-171
    • Jang, G.-J.1    Lee, T.-W.2    Oh, Y.-H.3
  • 26
    • 0001463644 scopus 로고
    • A duplex theory of pitch perception
    • Licklider, J. C. R. (1951). "A duplex theory of pitch perception," Experientia 7, 128-134.
    • (1951) Experientia , vol.7 , pp. 128-134
    • Licklider, J.C.R.1
  • 27
    • 0342571948 scopus 로고
    • A speech separation system that is robust to reverberation
    • Luo, H. Y., and Denbigh, P. N. (1994). "A speech separation system that is robust to reverberation," Proc. ISSIPNN, pp. 339-342.
    • (1994) Proc. ISSIPNN , pp. 339-342
    • Luo, H.Y.1    Denbigh, P.N.2
  • 28
    • 4544267645 scopus 로고    scopus 로고
    • Perceptual Kalman filtering for speech enhancement in colored noise
    • Ma, N., Bouchard, M., and Goubran, R. (2004). "Perceptual Kalman filtering for speech enhancement in colored noise," Proc. ICASSP, vol. 1, pp. 717-720.
    • (2004) Proc. ICASSP , vol.1 , pp. 717-720
    • Ma, N.1    Bouchard, M.2    Goubran, R.3
  • 29
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Martin, R. (2001). "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process. 9, 504-512.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 504-512
    • Martin, R.1
  • 31
    • 0020325263 scopus 로고
    • Monaural and binaural speech perception in reverberation for listeners of various ages
    • Nabelek, A. K., and Robinson, P. K. (1982). "Monaural and binaural speech perception in reverberation for listeners of various ages," J. Acoust. Soc. Am. 71, 1242-1248.
    • (1982) J. Acoust. Soc. Am. , vol.71 , pp. 1242-1248
    • Nabelek, A.K.1    Robinson, P.K.2
  • 32
    • 0141830958 scopus 로고    scopus 로고
    • Blind dereverberation of single channel speech signal based on harmonic structure
    • Nakatani, T., and Miyoshi, M. (2003). "Blind dereverberation of single channel speech signal based on harmonic structure," Proc. ICASSP, pp. 92-95.
    • (2003) Proc. ICASSP , pp. 92-95
    • Nakatani, T.1    Miyoshi, M.2
  • 33
    • 0018494073 scopus 로고
    • Invertibility of a room impulse response
    • Neely, S. T., and Allen, J. B. (1979). "Invertibility of a room impulse response," J. Acoust. Soc. Am. 66, 165-169.
    • (1979) J. Acoust. Soc. Am. , vol.66 , pp. 165-169
    • Neely, S.T.1    Allen, J.B.2
  • 35
    • 4644304197 scopus 로고    scopus 로고
    • A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation
    • Palomaki, K. J., Brown, G. J., and Wang, D. L. (2004). "A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation," Speech Commun. 43, 361-378.
    • (2004) Speech Commun. , vol.43 , pp. 361-378
    • Palomaki, K.J.1    Brown, G.J.2    Wang, D.L.3
  • 37
    • 0016916948 scopus 로고
    • Binaural and monaural speech intelligibility of connected discourse in reverberation as a function of a single competing sound source (speech or noise)
    • Plomp, R. (1976). "Binaural and monaural speech intelligibility of connected discourse in reverberation as a function of a single competing sound source (speech or noise)," Acustica 34, 200-211.
    • (1976) Acustica , vol.34 , pp. 200-211
    • Plomp, R.1
  • 38
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • Roman, N., Wang, D. L., and Brown, G. J. (2003). "Speech segregation based on sound localization," J. Acoust. Soc. Am. 114, 2236-2252.
    • (2003) J. Acoust. Soc. Am. , vol.114 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 39
    • 0031124228 scopus 로고    scopus 로고
    • A pitch determination and voice/unvoiced decision algorithm for noisy speech
    • Rouat, J., Liu, Y. C., and Morissette, D. (1997). "A pitch determination and voice/unvoiced decision algorithm for noisy speech," Speech Commun. 21, 191-207.
    • (1997) Speech Commun. , vol.21 , pp. 191-207
    • Rouat, J.1    Liu, Y.C.2    Morissette, D.3
  • 40
    • 33744996003 scopus 로고    scopus 로고
    • Model-based sequential organization in cochannel speech
    • Shao, Y., and Wang, D. L. (2006). "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Proc. 14, 289-298.
    • (2006) IEEE Trans. Audio, Speech, Lang. Proc. , vol.14 , pp. 289-298
    • Shao, Y.1    Wang, D.L.2
  • 41
    • 0035254668 scopus 로고    scopus 로고
    • A sound segregation algorithm for reverberant conditions
    • Shamsoddini, A., and Denbigh, P. N. (2001). "A sound segregation algorithm for reverberant conditions," Speech Commun. 33, 179-196.
    • (2001) Speech Commun. , vol.33 , pp. 179-196
    • Shamsoddini, A.1    Denbigh, P.N.2
  • 42
    • 0002296637 scopus 로고
    • On the importance of time - A temporal representation of sound
    • M. P. Cooke, S. Beet, and M. Crawford, eds. (Wiley, New York)
    • Slaney, M., and Lyon, R. F. (1993). "On the importance of time - A temporal representation of sound," in Visual Representations of Speech Signals, M. P. Cooke, S. Beet, and M. Crawford, eds. (Wiley, New York), pp. 95-116.
    • (1993) Visual Representations of Speech Signals , pp. 95-116
    • Slaney, M.1    Lyon, R.F.2
  • 43
    • 85009151411 scopus 로고    scopus 로고
    • On binary and ratio time-frequency masks for robust speech recognition
    • Srinivasan, S., Roman, N., and Wang, D. L. (2004). "On binary and ratio time-frequency masks for robust speech recognition," Proc. ICSLP, pp. 2541-2544.
    • (2004) Proc. ICSLP , pp. 2541-2544
    • Srinivasan, S.1    Roman, N.2    Wang, D.L.3
  • 44
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • P. Divenyi, ed. (Kluwer Academic, Norwell, MA)
    • Wang, D. L. (2005). "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, ed. (Kluwer Academic, Norwell, MA), pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 45
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • Wang, D. L., and Brown, G. J. (1999). "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw. 10, 684-697.
    • (1999) IEEE Trans. Neural Netw. , vol.10 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 48
    • 33745761716 scopus 로고    scopus 로고
    • A two-stage algorithm for one-microphone reverberant speech enhancement
    • Wu, M., and Wang, D. L. (2006). "A two-stage algorithm for one-microphone reverberant speech enhancement," IEEE Trans. Audio, Speech, Lang. Proc. 10, 774-784.
    • (2006) IEEE Trans. Audio, Speech, Lang. Proc. , vol.10 , pp. 774-784
    • Wu, M.1    Wang, D.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.