SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 120, Issue 1, 2006, Pages 458-469

Pitch-based monaural segregation of reverberant speech

(2) Roman, Nicoleta a Wang, DeLiang a

a Ohio State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; HARMONIC GENERATION; HEARING AIDS; REVERBERATION; SENSORY PERCEPTION; SIGNAL TO NOISE RATIO; SPEECH PROCESSING; SPEECH RECOGNITION;

ADDITIVE NOISE; PITCH-BASED SEGREGATION ALGORITHM; SPEECH PERCEPTION; SPEECH SEGREGATION;

SPEECH ANALYSIS;

ALGORITHM; ARTICLE; HEARING AID; HISTOGRAM; HUMAN; HUMAN EXPERIMENT; MATHEMATICAL MODEL; MONAURAL HEARING; NOISE; NORMAL HUMAN; PERIODICITY; PITCH; PRIORITY JOURNAL; PSYCHOPHYSICS; SIGNAL NOISE RATIO; SOUND DETECTION; SPEECH AUDIOMETRY; SPEECH DISCRIMINATION; SPEECH PERCEPTION; WHITE NOISE;

ALGORITHMS; CONDITIONING (PSYCHOLOGY); ENVIRONMENT; FEMALE; HUMANS; MALE; MODELS, BIOLOGICAL; NOISE; PITCH PERCEPTION; SOUND SPECTROGRAPHY; SPEECH PERCEPTION; SPEECH PRODUCTION MEASUREMENT; SPEECH RECEPTION THRESHOLD TEST;

EID: 33745761651 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.2204590 Document Type: Article

Times cited : (35)

References (50)

1
- 0018455820
- Image method for efficiently simulating small-room acoustics
- Allen, J. B., and Berkley, D. A. (1979). "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Am. 65, 943-950.
- (1979) J. Acoust. Soc. Am. , vol.65 , pp. 943-950
- Allen, J.B.¹ Berkley, D.A.²

2
- 0036649241
- Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets
- Barros, A. K., Rutkowski, T., Itakura, F., and Ohnishi, N. (2002). "Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets," IEEE Trans. Neural Netw. 13, 888-893.
- (2002) IEEE Trans. Neural Netw. , vol.13 , pp. 888-893
- Barros, A.K.¹ Rutkowski, T.² Itakura, F.³ Ohnishi, N.⁴

3
- 0038028639
- AR processes and sources can be reconstructed from degenerate mixtures
- Balan, R., Jourjine, A., and Rosca, J. (1999). "AR processes and sources can be reconstructed from degenerate mixtures," Proc. 1st Int. Workshop on Independent Component Analysis and Signal Separation, pp. 467-472.
- (1999) Proc. 1st Int. Workshop on Independent Component Analysis and Signal Separation , pp. 467-472
- Balan, R.¹ Jourjine, A.² Rosca, J.³

4
- 0038120523
- Boersma, P., and Weenink, D. (2002). Praat: doing Phonetics by Computer, Version 4.0.26 (http://www.fon.hum.uva.nl/praat).
- (2002) Praat: Doing Phonetics by Computer, Version 4.0.26
- Boersma, P.¹ Weenink, D.²

5
- 0003684441
- MIT Press, Cambridge, MA
- Bregman, A. S. (1990). Auditory Scene Analysis (MIT Press, Cambridge, MA).
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

6
- 0039334758
- The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions
- Bronkhorst, A. (2000). "The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions," Acustica 86, 117-128.
- (2000) Acustica , vol.86 , pp. 117-128
- Bronkhorst, A.¹

7
- 0028531926
- Computational auditory scene analysis
- Brown, G. J., and Cooke, M. (1994). "Computational auditory scene analysis," Comput. Speech Lang. 8, 297-336.
- (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.²

8
- 33644639591
- Separation of speech by computational auditory scene analysis
- J. Benesty, S. Makino, and J. Chen, eds. (Springer, New York)
- Brown, G. J., and Wang, D. L. (2005). "Separation of speech by computational auditory scene analysis," in Speech Enhancement, J. Benesty, S. Makino, and J. Chen, eds. (Springer, New York), pp. 371-402.
- (2005) Speech Enhancement , pp. 371-402
- Brown, G.J.¹ Wang, D.L.²

9
- 33745741350
- unpublished
- Brungart, D., Chang, P., Simpson, B., and Wang, D. L. (2006). "Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask," (unpublished).
- (2006) Isolating the Energetic Component of Speech-on-speech Masking with An Ideal Binary Time-frequency Mask
- Brungart, D.¹ Chang, P.² Simpson, B.³ Wang, D.L.⁴

10
- 0016522748
- Anthropometric manikin for acoustic research
- Burkhard, M. D., and Sachs, R. M. (1975). "Anthropometric manikin for acoustic research," J. Acoust. Soc. Am. 58, 214-222.
- (1975) J. Acoust. Soc. Am. , vol.58 , pp. 214-222
- Burkhard, M.D.¹ Sachs, R.M.²

11
- 0003479143
- Cambridge University Press, Cambridge, U.K.
- Cooke, M. P. (1993). Modeling Auditory Processing and Organization (Cambridge University Press, Cambridge, U.K).
- (1993) Modeling Auditory Processing and Organization
- Cooke, M.P.¹

12
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- Cooke, M. P., Green, P., Josifovski, L., and Vizinho, A. (2001). "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun. 34, 267-285.
- (2001) Speech Commun. , vol.34 , pp. 267-285
- Cooke, M.P.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

13
- 0242440783
- Effects of reverberation on perceptual segregation of competing voices
- Culling, J. F., Hodder, K. I., and Toh, C. Y. (2003). "Effects of reverberation on perceptual segregation of competing voices," J. Acoust. Soc. Am. 114, 2871-2876.
- (2003) J. Acoust. Soc. Am. , vol.114 , pp. 2871-2876
- Culling, J.F.¹ Hodder, K.I.² Toh, C.Y.³

14
- 0001698589
- Auditory grouping
- B. C. J. Moore, ed. (Academic, London)
- Darwin, C. J., and Carlyon, R. P. (1995). "Auditory grouping," in The Handbook of Perception and Cognition, vol. 6, B. C. J. Moore, ed. (Academic, London), pp. 387-424.
- (1995) The Handbook of Perception and Cognition , vol.6 , pp. 387-424
- Darwin, C.J.¹ Carlyon, R.P.²

15
- 0033939839
- Effects of reverberation on spatial, prosodic, and vocal-tract size cues to selective attention
- Darwin, C. J., and Hukin, R. W. (2000). "Effects of reverberation on spatial, prosodic, and vocal-tract size cues to selective attention," J. Acoust. Soc. Am. 108, 335-342.
- (2000) J. Acoust. Soc. Am. , vol.108 , pp. 335-342
- Darwin, C.J.¹ Hukin, R.W.²

16
- 0029345417
- A signal subspace approach for speech enhancement
- Ephraim, Y., and Trees, H. L. (1995). "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Process. 3, 251-266.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 251-266
- Ephraim, Y.¹ Trees, H.L.²

17
- 0030674105
- Two-channel blind deconvolution for non-minimum phase impulse responses
- Furuya, K., and Kaneda, Y. (1997). "Two-channel blind deconvolution for non-minimum phase impulse responses," Proc. ICASSP, pp. 1315-1318.
- (1997) Proc. ICASSP , pp. 1315-1318
- Furuya, K.¹ Kaneda, Y.²

18
- 0004089083
- HRTF measurements of a KEMAR dummy-head microphone
- Gardner, W. G., and Martin, K. D. (1994). "HRTF measurements of a KEMAR dummy-head microphone," MIT Media Lab Perceptual Computing Technical Report #280.
- (1994) MIT Media Lab Perceptual Computing Technical Report #280
- Gardner, W.G.¹ Martin, K.D.²

19
- 0036289676
- Acoustic diversity for improved speech recognition in reverberant environments
- Gillespie, B. W., and Atlas, L. E. (2002). "Acoustic diversity for improved speech recognition in reverberant environments," Proc. ICASSP, pp. 557-560.
- (2002) Proc. ICASSP , pp. 557-560
- Gillespie, B.W.¹ Atlas, L.E.²

20
- 0034857681
- Speech dereverberation via maximum-kurtosis subband adaptive filtering
- Gillespie, B. W., Malvar, H. S., and Florencio, D. A. F. (2001). "Speech dereverberation via maximum-kurtosis subband adaptive filtering," Proc. ICASSP, vol. 6, pp. 3701-3704.
- (2001) Proc. ICASSP , vol.6 , pp. 3701-3704
- Gillespie, B.W.¹ Malvar, H.S.² Florencio, D.A.F.³

21
- 0003807773
- Prentice-Hall, Upper Saddle River, NJ
- Haykin, S. (2002). Adaptive Filter Theory, 4th ed. (Prentice-Hall, Upper Saddle River, NJ).
- (2002) Adaptive Filter Theory, 4th Ed.
- Haykin, S.¹

22
- 0141788523
- Separation of stop consonants
- Hu, G., and Wang, D. L. (2003). "Separation of stop consonants," Proc. ICASSP, vol. 2, pp. 749-752.
- (2003) Proc. ICASSP , vol.2 , pp. 749-752
- Hu, G.¹ Wang, D.L.²

23
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Hu, G., and Wang, D. L. (2004). "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw. 15, 1135-1150.
- (2004) IEEE Trans. Neural Netw. , vol.15 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

24
- 33646786460
- Separation of fricatives and affricates
- Hu, G., and Wang, D. L. (2005). "Separation of fricatives and affricates," Proc. ICASSP vol. 1, pp. 1101-1104.
- (2005) Proc. ICASSP , vol.1 , pp. 1101-1104
- Hu, G.¹ Wang, D.L.²

25
- 0038630563
- Single channel signal separation using time-domain basis functions
- Jang, G.-J., Lee, T.-W., and Oh, Y.-H. (2003). "Single channel signal separation using time-domain basis functions" IEEE Signal Process. Lett. 10(6), 168-171.
- (2003) IEEE Signal Process. Lett. , vol.10 , Issue.6 , pp. 168-171
- Jang, G.-J.¹ Lee, T.-W.² Oh, Y.-H.³

26
- 0001463644
- A duplex theory of pitch perception
- Licklider, J. C. R. (1951). "A duplex theory of pitch perception," Experientia 7, 128-134.
- (1951) Experientia , vol.7 , pp. 128-134
- Licklider, J.C.R.¹

27
- 0342571948
- A speech separation system that is robust to reverberation
- Luo, H. Y., and Denbigh, P. N. (1994). "A speech separation system that is robust to reverberation," Proc. ISSIPNN, pp. 339-342.
- (1994) Proc. ISSIPNN , pp. 339-342
- Luo, H.Y.¹ Denbigh, P.N.²

28
- 4544267645
- Perceptual Kalman filtering for speech enhancement in colored noise
- Ma, N., Bouchard, M., and Goubran, R. (2004). "Perceptual Kalman filtering for speech enhancement in colored noise," Proc. ICASSP, vol. 1, pp. 717-720.
- (2004) Proc. ICASSP , vol.1 , pp. 717-720
- Ma, N.¹ Bouchard, M.² Goubran, R.³

29
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Martin, R. (2001). "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process. 9, 504-512.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 504-512
- Martin, R.¹

30
- 0003789815
- Academic, San Diego, CA
- Moore, B. C. J. (2003). An Introduction to the Psychology of Hearing, 5th ed. (Academic, San Diego, CA).
- (2003) An Introduction to the Psychology of Hearing, 5th Ed.
- Moore, B.C.J.¹

31
- 0020325263
- Monaural and binaural speech perception in reverberation for listeners of various ages
- Nabelek, A. K., and Robinson, P. K. (1982). "Monaural and binaural speech perception in reverberation for listeners of various ages," J. Acoust. Soc. Am. 71, 1242-1248.
- (1982) J. Acoust. Soc. Am. , vol.71 , pp. 1242-1248
- Nabelek, A.K.¹ Robinson, P.K.²

32
- 0141830958
- Blind dereverberation of single channel speech signal based on harmonic structure
- Nakatani, T., and Miyoshi, M. (2003). "Blind dereverberation of single channel speech signal based on harmonic structure," Proc. ICASSP, pp. 92-95.
- (2003) Proc. ICASSP , pp. 92-95
- Nakatani, T.¹ Miyoshi, M.²

33
- 0018494073
- Invertibility of a room impulse response
- Neely, S. T., and Allen, J. B. (1979). "Invertibility of a room impulse response," J. Acoust. Soc. Am. 66, 165-169.
- (1979) J. Acoust. Soc. Am. , vol.66 , pp. 165-169
- Neely, S.T.¹ Allen, J.B.²

34
- 0003522449
- Piscataway, IEEE Press, NJ
- O' Shaughnessy, D. (2000). Speech Communications: Human and Machine, 2nd ed. (Piscataway, IEEE Press, NJ).
- (2000) Speech Communications: Human and Machine, 2nd Ed.
- O'Shaughnessy, D.¹

35
- 4644304197
- A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation
- Palomaki, K. J., Brown, G. J., and Wang, D. L. (2004). "A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation," Speech Commun. 43, 361-378.
- (2004) Speech Commun. , vol.43 , pp. 361-378
- Palomaki, K.J.¹ Brown, G.J.² Wang, D.L.³

36
- 0003548690
- Applied Psychology Unit, Cambridge
- Patterson, R. D., Nimmo-Smith, I., Holdsworth, J., and Price, P. (1988). "APU Report 2341: An efficient auditory interbank based on the gamma-tone function," Applied Psychology Unit, Cambridge.
- (1988) APU Report 2341: An Efficient Auditory Interbank Based on the Gamma-tone Function
- Patterson, R.D.¹ Nimmo-Smith, I.² Holdsworth, J.³ Price, P.⁴

37
- 0016916948
- Binaural and monaural speech intelligibility of connected discourse in reverberation as a function of a single competing sound source (speech or noise)
- Plomp, R. (1976). "Binaural and monaural speech intelligibility of connected discourse in reverberation as a function of a single competing sound source (speech or noise)," Acustica 34, 200-211.
- (1976) Acustica , vol.34 , pp. 200-211
- Plomp, R.¹

38
- 0142026377
- Speech segregation based on sound localization
- Roman, N., Wang, D. L., and Brown, G. J. (2003). "Speech segregation based on sound localization," J. Acoust. Soc. Am. 114, 2236-2252.
- (2003) J. Acoust. Soc. Am. , vol.114 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

39
- 0031124228
- A pitch determination and voice/unvoiced decision algorithm for noisy speech
- Rouat, J., Liu, Y. C., and Morissette, D. (1997). "A pitch determination and voice/unvoiced decision algorithm for noisy speech," Speech Commun. 21, 191-207.
- (1997) Speech Commun. , vol.21 , pp. 191-207
- Rouat, J.¹ Liu, Y.C.² Morissette, D.³

40
- 33744996003
- Model-based sequential organization in cochannel speech
- Shao, Y., and Wang, D. L. (2006). "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Proc. 14, 289-298.
- (2006) IEEE Trans. Audio, Speech, Lang. Proc. , vol.14 , pp. 289-298
- Shao, Y.¹ Wang, D.L.²

41
- 0035254668
- A sound segregation algorithm for reverberant conditions
- Shamsoddini, A., and Denbigh, P. N. (2001). "A sound segregation algorithm for reverberant conditions," Speech Commun. 33, 179-196.
- (2001) Speech Commun. , vol.33 , pp. 179-196
- Shamsoddini, A.¹ Denbigh, P.N.²

42
- 0002296637
- On the importance of time - A temporal representation of sound
- M. P. Cooke, S. Beet, and M. Crawford, eds. (Wiley, New York)
- Slaney, M., and Lyon, R. F. (1993). "On the importance of time - A temporal representation of sound," in Visual Representations of Speech Signals, M. P. Cooke, S. Beet, and M. Crawford, eds. (Wiley, New York), pp. 95-116.
- (1993) Visual Representations of Speech Signals , pp. 95-116
- Slaney, M.¹ Lyon, R.F.²

43
- 85009151411
- On binary and ratio time-frequency masks for robust speech recognition
- Srinivasan, S., Roman, N., and Wang, D. L. (2004). "On binary and ratio time-frequency masks for robust speech recognition," Proc. ICSLP, pp. 2541-2544.
- (2004) Proc. ICSLP , pp. 2541-2544
- Srinivasan, S.¹ Roman, N.² Wang, D.L.³

44
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- P. Divenyi, ed. (Kluwer Academic, Norwell, MA)
- Wang, D. L. (2005). "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, ed. (Kluwer Academic, Norwell, MA), pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

45
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- Wang, D. L., and Brown, G. J. (1999). "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw. 10, 684-697.
- (1999) IEEE Trans. Neural Netw. , vol.10 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

46
- 0003982501
- Ph.D. dissertation, Stanford University Department of Electrical Engineering
- Weintraub, M. (1985). "A theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Stanford University Department of Electrical Engineering.
- (1985) A Theory and Computational Model of Auditory Monaural Sound Separation
- Weintraub, M.¹

47
- 4644333729
- PhD thesis, The Ohio State University, Department of Computer and Information Science
- Wu, M. (2003). "Pitch tracking and speech enhancement in noisy and reverberant environments," PhD thesis, The Ohio State University, Department of Computer and Information Science.
- (2003) Pitch Tracking and Speech Enhancement in Noisy and Reverberant Environments
- Wu, M.¹

48
- 33745761716
- A two-stage algorithm for one-microphone reverberant speech enhancement
- Wu, M., and Wang, D. L. (2006). "A two-stage algorithm for one-microphone reverberant speech enhancement," IEEE Trans. Audio, Speech, Lang. Proc. 10, 774-784.
- (2006) IEEE Trans. Audio, Speech, Lang. Proc. , vol.10 , pp. 774-784
- Wu, M.¹ Wang, D.L.²

49
- 0037767686
- A multipitch tracking algorithm for noisy speech
- Wu, M., Wang, D. L., and Brown, G. J. (2003) "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech Audio Process. 11, 229-241.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 229-241
- Wu, M.¹ Wang, D.L.² Brown, G.J.³

50
- 1242316819
- Blind source separation by sparse decomposition
- S. J. Roberts and R. M. Everson, eds. (Cambridge University Press, Cambridge)
- Zibulevsky, M., Pearlmutter, B. A., Bofill, P., and Kisilev, P. (2001). "Blind source separation by sparse decomposition," in Independent Component Analysis: Principles and Practice, S. J. Roberts and R. M. Everson, eds. (Cambridge University Press, Cambridge).
- (2001) Independent Component Analysis: Principles and Practice
- Zibulevsky, M.¹ Pearlmutter, B.A.² Bofill, P.³ Kisilev, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.