메뉴 건너뛰기




Volumn 15, Issue 2, 2007, Pages 396-405

Auditory segmentation based on onset and offset analysis

Author keywords

Auditory segmentation; Event detection; Multiscale analysis; Onset and offset

Indexed keywords

AUDITORY SCENE ANALYSIS; AUDITORY SEGMENTATION; AUDITORY SYSTEMS; EVENT DETECTION; MULTI-SCALE APPROACHES; MULTIPLE SOURCES; MULTISCALE ANALYSIS; NATURAL ENVIRONMENTS; ONSET AND OFFSET; QUANTITATIVE MEASURES; SEGMENTATION EVALUATIONS; SYSTEMATIC EVALUATIONS; TARGET SPEECH; UNVOICED SPEECH;

EID: 38849102154     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.881700     Document Type: Article
Times cited : (112)

References (34)
  • 1
    • 4544333241 scopus 로고    scopus 로고
    • Underdetermined blind separation for speech in real environments with sparseness and ICA
    • S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, "Underdetermined blind separation for speech in real environments with sparseness and ICA," in Proc. ICASSP, 2004, vol. 3, pp. 881-884.
    • (2004) Proc. ICASSP , vol.3 , pp. 881-884
    • Araki, S.1    Makino, S.2    Blin, A.3    Mukai, R.4    Sawada, H.5
  • 2
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. P. Barker, M. P. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," in Speech Commun., 2005, vol. 45, pp. 5-25.
    • (2005) Speech Commun , vol.45 , pp. 5-25
    • Barker, J.P.1    Cooke, M.P.2    Ellis, D.P.W.3
  • 3
    • 64149112366 scopus 로고    scopus 로고
    • P. Boersma and D. Weenink, Praat: Doing phonetics by computer, Version 4.2.31 2004 [Online, Available
    • P. Boersma and D. Weenink, Praat: Doing phonetics by computer, Version 4.2.31 2004 [Online]. Available: http://www.fon.hum.uva.nl/praat/
  • 5
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, 1994.
    • (1994) Comput. Speech Lang , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 6
    • 33644639591 scopus 로고    scopus 로고
    • Separation of speech by computational auditory scene analysis
    • J. Benesty, S. Makino, and J. Chen, Eds. New York: Springer
    • G. J. Brown and D. L. Wang, "Separation of speech by computational auditory scene analysis," in Speech Enhancement, J. Benesty, S. Makino, and J. Chen, Eds. New York: Springer, 2005, pp. 371-402.
    • (2005) Speech Enhancement , pp. 371-402
    • Brown, G.J.1    Wang, D.L.2
  • 7
    • 64149132173 scopus 로고    scopus 로고
    • P. S. Chang, Exploration of behavioral, physiological, and computational approaches to auditory scene analysis, M.S. thesis, Dept. Comput. Sci. Eng., The Ohio State Univ., Columbus, 2004.
    • P. S. Chang, "Exploration of behavioral, physiological, and computational approaches to auditory scene analysis," M.S. thesis, Dept. Comput. Sci. Eng., The Ohio State Univ., Columbus, 2004.
  • 9
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. P. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," in Speech Commun., 2001, vol. 34, pp. 267-285.
    • (2001) Speech Commun , vol.34 , pp. 267-285
    • Cooke, M.P.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 10
    • 0021743658 scopus 로고
    • Perceiving vowels in the presence of another sound: Constraints on formant perception
    • C. J. Darwin, "Perceiving vowels in the presence of another sound: Constraints on formant perception," J. Acoust. Soc. Amer., vol. 76, pp. 1636-1647, 1984.
    • (1984) J. Acoust. Soc. Amer , vol.76 , pp. 1636-1647
    • Darwin, C.J.1
  • 13
    • 0027957839 scopus 로고
    • Effect of temporal envelope smearing on speech reception
    • R. Drullman, J. M. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech reception," J. Acoust. Soc. Amer., vol. 95, pp. 1053-1064, 1994.
    • (1994) J. Acoust. Soc. Amer , vol.95 , pp. 1053-1064
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 14
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • --, "Effect of reducing slow temporal modulations on speech reception," J. Acoust. Soc. Amer., vol. 95, pp. 2670-2680, 1994.
    • (1994) J. Acoust. Soc. Amer , vol.95 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 15
    • 0003794341 scopus 로고    scopus 로고
    • Prediction-driven computational auditory scene analysis,
    • Ph.D. dissertation, Dept. Elec. Eng. and Comput. Sci, Mass. Inst. Technol, Cambridge
    • D. P. W. Ellis, "Prediction-driven computational auditory scene analysis," Ph.D. dissertation, Dept. Elec. Eng. and Comput. Sci., Mass. Inst. Technol., Cambridge, 1996.
    • (1996)
    • Ellis, D.P.W.1
  • 16
    • 0030193096 scopus 로고    scopus 로고
    • An experimental comparison of range image segmentation algorithms
    • Jul
    • A. Hoover et al., "An experimental comparison of range image segmentation algorithms," IEEE Trans. Pattern Anal. Mach. Intell., vol. 18, no. 7, pp. 673-689, Jul. 1996.
    • (1996) IEEE Trans. Pattern Anal. Mach. Intell , vol.18 , Issue.7 , pp. 673-689
    • Hoover, A.1
  • 17
    • 0141788523 scopus 로고    scopus 로고
    • Separation of stop consonants
    • G. Hu and D. L. Wang, "Separation of stop consonants," in Proc. ICASSP, 2003, vol. 2, pp. 749-752.
    • (2003) Proc. ICASSP , vol.2 , pp. 749-752
    • Hu, G.1    Wang, D.L.2
  • 18
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Sep
    • --, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
    • (2004) IEEE Trans. Neural Netw , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 21
    • 0035472866 scopus 로고    scopus 로고
    • Speech enhancement using a constrained iterative sinusoidal model
    • Oct
    • J. Jensen and J. H. L. Hansen, "Speech enhancement using a constrained iterative sinusoidal model," IEEE Trans. Speech Audio Process., vol. 9, no. 7, pp. 731-740, Oct. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.7 , pp. 731-740
    • Jensen, J.1    Hansen, J.H.L.2
  • 22
    • 0017966541 scopus 로고
    • Revised estimate of minimal audible pressure: Where is the "missing 6 dB"?
    • M. C. Killion, "Revised estimate of minimal audible pressure: Where is the "missing 6 dB"?," J. Acoust. Soc. Amer., vol. 63, pp. 1501-1510, 1978.
    • (1978) J. Acoust. Soc. Amer , vol.63 , pp. 1501-1510
    • Killion, M.C.1
  • 23
    • 79251542316 scopus 로고
    • A computational model of filtering, detection, and compression in the cochlea
    • R. F. Lyon, "A computational model of filtering, detection, and compression in the cochlea," in Proc. ICASSP, 1982, vol. 2, pp. 1282-1285.
    • (1982) Proc. ICASSP , vol.2 , pp. 1282-1285
    • Lyon, R.F.1
  • 24
    • 0023244573 scopus 로고
    • Speech recognition in scale space
    • --, "Speech recognition in scale space," in Proc. ICASSP, 1987, vol. 12, pp. 1265-1268.
    • (1987) Proc. ICASSP , vol.12 , pp. 1265-1268
    • Lyon, R.F.1
  • 28
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
    • (2003) J. Acoust. Soc. Amer , vol.114 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 29
    • 0003538256 scopus 로고    scopus 로고
    • B. Romeny, L. Florack, J. Koenderink, and M. Viergever, Eds, New York: Springer
    • B. Romeny, L. Florack, J. Koenderink, and M. Viergever, Eds., Scale- Space Theory in Computer Vision. New York: Springer, 1997.
    • (1997) Scale- Space Theory in Computer Vision
  • 30
    • 0032166087 scopus 로고    scopus 로고
    • HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
    • Sep
    • H. Sameti, H. Sheikhzadeh, L. Deng, and R. L. Brennan, "HMM-based strategies for enhancement of speech signals embedded in nonstationary noise," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 445-455, Sep. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.5 , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.L.4
  • 31
    • 0036216713 scopus 로고    scopus 로고
    • Rhythmic masking release: Contribution of cues for perceptual organization to the cross-spectral fusion of concurrent narrow-band noises
    • M. Turgeon, A. S. Bregman, and P. A. Ahad, "Rhythmic masking release: Contribution of cues for perceptual organization to the cross-spectral fusion of concurrent narrow-band noises," J. Acoust. Soc. Amer., vol. 111, pp. 1819-1831, 2002.
    • (2002) J. Acoust. Soc. Amer , vol.111 , pp. 1819-1831
    • Turgeon, M.1    Bregman, A.S.2    Ahad, P.A.3
  • 32
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • D. L. Wang, P. Divenyi, Ed
    • D. L. Wang, P. Divenyi, Ed., "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, 2005, pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
  • 33
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • May
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
    • (1999) IEEE Trans. Neural Netw , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 34
    • 0003982501 scopus 로고
    • A theory and computational model of auditory monaural sound separation,
    • Ph.D. dissertation, Dept. Elect. Eng, Stanford Univ, Stanford, CA
    • M. Weintraub, "A theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Dept. Elect. Eng., Stanford Univ., Stanford, CA, 1985.
    • (1985)
    • Weintraub, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.