SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 2, 2007, Pages 396-405

Auditory segmentation based on onset and offset analysis

(2) Hu, Guoning a Wang, Deliang b

a OHIO STATE UNIVERSITY (United States)

b The Ohio State University (United States)

Author keywords

Auditory segmentation; Event detection; Multiscale analysis; Onset and offset

Indexed keywords

AUDITORY SCENE ANALYSIS; AUDITORY SEGMENTATION; AUDITORY SYSTEMS; EVENT DETECTION; MULTI-SCALE APPROACHES; MULTIPLE SOURCES; MULTISCALE ANALYSIS; NATURAL ENVIRONMENTS; ONSET AND OFFSET; QUANTITATIVE MEASURES; SEGMENTATION EVALUATIONS; SYSTEMATIC EVALUATIONS; TARGET SPEECH; UNVOICED SPEECH;

PATIENT REHABILITATION;

EID: 38849102154 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.881700 Document Type: Article

Times cited : (112)

References (34)

1
- 4544333241
- Underdetermined blind separation for speech in real environments with sparseness and ICA
- S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, "Underdetermined blind separation for speech in real environments with sparseness and ICA," in Proc. ICASSP, 2004, vol. 3, pp. 881-884.
- (2004) Proc. ICASSP , vol.3 , pp. 881-884
- Araki, S.¹ Makino, S.² Blin, A.³ Mukai, R.⁴ Sawada, H.⁵

2
- 11144316019
- Decoding speech in the presence of other sources
- J. P. Barker, M. P. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," in Speech Commun., 2005, vol. 45, pp. 5-25.
- (2005) Speech Commun , vol.45 , pp. 5-25
- Barker, J.P.¹ Cooke, M.P.² Ellis, D.P.W.³

3
- 64149112366
- P. Boersma and D. Weenink, Praat: Doing phonetics by computer, Version 4.2.31 2004 [Online, Available
- P. Boersma and D. Weenink, Praat: Doing phonetics by computer, Version 4.2.31 2004 [Online]. Available: http://www.fon.hum.uva.nl/praat/

4
- 0003684441
- Cambridge, MA: MIT Press
- A. S. Bregman, Auditory Scene Analysis. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

5
- 0028531926
- Computational auditory scene analysis
- G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, 1994.
- (1994) Comput. Speech Lang , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.P.²

6
- 33644639591
- Separation of speech by computational auditory scene analysis
- J. Benesty, S. Makino, and J. Chen, Eds. New York: Springer
- G. J. Brown and D. L. Wang, "Separation of speech by computational auditory scene analysis," in Speech Enhancement, J. Benesty, S. Makino, and J. Chen, Eds. New York: Springer, 2005, pp. 371-402.
- (2005) Speech Enhancement , pp. 371-402
- Brown, G.J.¹ Wang, D.L.²

7
- 64149132173
- P. S. Chang, Exploration of behavioral, physiological, and computational approaches to auditory scene analysis, M.S. thesis, Dept. Comput. Sci. Eng., The Ohio State Univ., Columbus, 2004.
- P. S. Chang, "Exploration of behavioral, physiological, and computational approaches to auditory scene analysis," M.S. thesis, Dept. Comput. Sci. Eng., The Ohio State Univ., Columbus, 2004.

8
- 0003479143
- Cambridge, U.K, Cambridge Univ. Press
- M. P. Cooke, Modelling Auditory Processing and Organisation. Cambridge, U.K.: Cambridge Univ. Press, 1993.
- (1993) Modelling Auditory Processing and Organisation
- Cooke, M.P.¹

9
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. P. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," in Speech Commun., 2001, vol. 34, pp. 267-285.
- (2001) Speech Commun , vol.34 , pp. 267-285
- Cooke, M.P.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

10
- 0021743658
- Perceiving vowels in the presence of another sound: Constraints on formant perception
- C. J. Darwin, "Perceiving vowels in the presence of another sound: Constraints on formant perception," J. Acoust. Soc. Amer., vol. 76, pp. 1636-1647, 1984.
- (1984) J. Acoust. Soc. Amer , vol.76 , pp. 1636-1647
- Darwin, C.J.¹

11
- 0003424145
- New York: Macmillan
- J. R. Deller, J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals. New York: Macmillan, 1993.
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Proakis, J.G.² Hansen, J.H.L.³

12
- 84892300819
- P. Divenyi, Ed, Norwell, MA: Kluwer
- P. Divenyi, Ed., Speech Separation by Humans and Machines. Norwell, MA: Kluwer, 2005.
- (2005) Speech Separation by Humans and Machines

13
- 0027957839
- Effect of temporal envelope smearing on speech reception
- R. Drullman, J. M. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech reception," J. Acoust. Soc. Amer., vol. 95, pp. 1053-1064, 1994.
- (1994) J. Acoust. Soc. Amer , vol.95 , pp. 1053-1064
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

14
- 0028287770
- Effect of reducing slow temporal modulations on speech reception
- --, "Effect of reducing slow temporal modulations on speech reception," J. Acoust. Soc. Amer., vol. 95, pp. 2670-2680, 1994.
- (1994) J. Acoust. Soc. Amer , vol.95 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

15
- 0003794341
- Prediction-driven computational auditory scene analysis,
- Ph.D. dissertation, Dept. Elec. Eng. and Comput. Sci, Mass. Inst. Technol, Cambridge
- D. P. W. Ellis, "Prediction-driven computational auditory scene analysis," Ph.D. dissertation, Dept. Elec. Eng. and Comput. Sci., Mass. Inst. Technol., Cambridge, 1996.
- (1996)
- Ellis, D.P.W.¹

16
- 0030193096
- An experimental comparison of range image segmentation algorithms
- Jul
- A. Hoover et al., "An experimental comparison of range image segmentation algorithms," IEEE Trans. Pattern Anal. Mach. Intell., vol. 18, no. 7, pp. 673-689, Jul. 1996.
- (1996) IEEE Trans. Pattern Anal. Mach. Intell , vol.18 , Issue.7 , pp. 673-689
- Hoover, A.¹

17
- 0141788523
- Separation of stop consonants
- G. Hu and D. L. Wang, "Separation of stop consonants," in Proc. ICASSP, 2003, vol. 2, pp. 749-752.
- (2003) Proc. ICASSP , vol.2 , pp. 749-752
- Hu, G.¹ Wang, D.L.²

18
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Sep
- --, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
- (2004) IEEE Trans. Neural Netw , vol.15 , Issue.5 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

19
- 85116246624
- Auditory segmentation based on event detection
- --, "Auditory segmentation based on event detection," in Proc. ISCA Tutorial and Research Workshop on Stat. Percept. Audio Process., 2004.
- (2004) Proc. ISCA Tutorial and Research Workshop on Stat. Percept. Audio Process
- Hu, G.¹ Wang, D.L.²

20
- 0004056285
- Upper Saddle River, NJ: Prentice-Hall
- X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: A Guide to Theory, Algorithms, and System Development. Upper Saddle River, NJ: Prentice-Hall, 2001.
- (2001) Spoken Language Processing: A Guide to Theory, Algorithms, and System Development
- Huang, X.¹ Acero, A.² Hon, H.-W.³

21
- 0035472866
- Speech enhancement using a constrained iterative sinusoidal model
- Oct
- J. Jensen and J. H. L. Hansen, "Speech enhancement using a constrained iterative sinusoidal model," IEEE Trans. Speech Audio Process., vol. 9, no. 7, pp. 731-740, Oct. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.7 , pp. 731-740
- Jensen, J.¹ Hansen, J.H.L.²

22
- 0017966541
- Revised estimate of minimal audible pressure: Where is the "missing 6 dB"?
- M. C. Killion, "Revised estimate of minimal audible pressure: Where is the "missing 6 dB"?," J. Acoust. Soc. Amer., vol. 63, pp. 1501-1510, 1978.
- (1978) J. Acoust. Soc. Amer , vol.63 , pp. 1501-1510
- Killion, M.C.¹

23
- 79251542316
- A computational model of filtering, detection, and compression in the cochlea
- R. F. Lyon, "A computational model of filtering, detection, and compression in the cochlea," in Proc. ICASSP, 1982, vol. 2, pp. 1282-1285.
- (1982) Proc. ICASSP , vol.2 , pp. 1282-1285
- Lyon, R.F.¹

24
- 0023244573
- Speech recognition in scale space
- --, "Speech recognition in scale space," in Proc. ICASSP, 1987, vol. 12, pp. 1265-1268.
- (1987) Proc. ICASSP , vol.12 , pp. 1265-1268
- Lyon, R.F.¹

25
- 0003789815
- 5th ed. San Diego, CA: Academic
- B. C. J. Moore, An Introduction to the Psychology of Hearing, 5th ed. San Diego, CA: Academic, 2003.
- (2003) An Introduction to the Psychology of Hearing
- Moore, B.C.J.¹

26
- 0141624530
- An efficient auditory filterbank based on the gammatone function
- R. D. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function,"MRCAppl. Psychol. Unit., 1988.
- (1988) MRCAppl. Psychol. Unit
- Patterson, R.D.¹ Nimmo-Smith, I.² Holdsworth, J.³ Rice, P.⁴

27
- 0004106903
- 2nd ed. London, U.K, Academic
- J. O. Pickles, An Introduction to the Physiology of Hearing, 2nd ed. London, U.K.: Academic, 1988.
- (1988) An Introduction to the Physiology of Hearing
- Pickles, J.O.¹

28
- 0142026377
- Speech segregation based on sound localization
- N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
- (2003) J. Acoust. Soc. Amer , vol.114 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

29
- 0003538256
- B. Romeny, L. Florack, J. Koenderink, and M. Viergever, Eds, New York: Springer
- B. Romeny, L. Florack, J. Koenderink, and M. Viergever, Eds., Scale- Space Theory in Computer Vision. New York: Springer, 1997.
- (1997) Scale- Space Theory in Computer Vision

30
- 0032166087
- HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
- Sep
- H. Sameti, H. Sheikhzadeh, L. Deng, and R. L. Brennan, "HMM-based strategies for enhancement of speech signals embedded in nonstationary noise," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 445-455, Sep. 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.5 , pp. 445-455
- Sameti, H.¹ Sheikhzadeh, H.² Deng, L.³ Brennan, R.L.⁴

31
- 0036216713
- Rhythmic masking release: Contribution of cues for perceptual organization to the cross-spectral fusion of concurrent narrow-band noises
- M. Turgeon, A. S. Bregman, and P. A. Ahad, "Rhythmic masking release: Contribution of cues for perceptual organization to the cross-spectral fusion of concurrent narrow-band noises," J. Acoust. Soc. Amer., vol. 111, pp. 1819-1831, 2002.
- (2002) J. Acoust. Soc. Amer , vol.111 , pp. 1819-1831
- Turgeon, M.¹ Bregman, A.S.² Ahad, P.A.³

32
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- D. L. Wang, P. Divenyi, Ed
- D. L. Wang, P. Divenyi, Ed., "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, 2005, pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197

33
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- May
- D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
- (1999) IEEE Trans. Neural Netw , vol.10 , Issue.3 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

34
- 0003982501
- A theory and computational model of auditory monaural sound separation,
- Ph.D. dissertation, Dept. Elect. Eng, Stanford Univ, Stanford, CA
- M. Weintraub, "A theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Dept. Elect. Eng., Stanford Univ., Stanford, CA, 1985.
- (1985)
- Weintraub, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.