메뉴 건너뛰기




Volumn 19, Issue 6, 2011, Pages 1556-1568

Sound event recognition with probabilistic distance SVMs

Author keywords

Divergence distance; probabilistic distance; sound characterization; sound event recognition; subband temporal envelope (STE); support vector machine (SVM)

Indexed keywords

DIVERGENCE DISTANCE; PROBABILISTIC DISTANCE; SOUND CHARACTERIZATION; SOUND EVENT RECOGNITION; SUB-BANDS;

EID: 79957687384     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2093519     Document Type: Article
Times cited : (58)

References (40)
  • 3
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification, search, and retrieval of audio
    • E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia, vol. 3, no. 3, pp. 27-36, Fall, 1996. (Pubitemid 126571576)
    • (1996) IEEE Multimedia , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keislar, D.3    Wheaton, J.4
  • 4
    • 0141855203 scopus 로고    scopus 로고
    • Automatic identification of bird species based on sinusoidal modeling of syllables
    • A. Härmä, "Automatic identification of bird species based on sinusoidal modeling of syllables," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2003, pp. 545-548.
    • (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 545-548
    • Härmä, A.1
  • 5
    • 34347345718 scopus 로고    scopus 로고
    • Parametric representations of bird sounds for automatic species recognition
    • P. Somervuo, A. Härmä, and S. Fagerlund, "Parametric representations of bird sounds for automatic species recognition," IEEE Trans. Speech Audio Process., vol. 14, pp. 2252-2263, 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , pp. 2252-2263
    • Somervuo, P.1    Härmä, A.2    Fagerlund, S.3
  • 7
    • 68149163531 scopus 로고    scopus 로고
    • Environmental sound recognition with time-frequency audio features
    • Aug.
    • S. Chu, S. Narayanan, and C. J. Kuo, "Environmental sound recognition with time-frequency audio features," Trans. Audio, Speech, Lang. Process., vol. 17, no. 6, pp. 1142-1158, Aug. 2009.
    • (2009) Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.6 , pp. 1142-1158
    • Chu, S.1    Narayanan, S.2    Kuo, C.J.3
  • 8
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • DOI 10.1109/TSA.2002.800560, PII 1011092002800560
    • G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. Speech Audio Process., vol. 10, no. 5, pp. 293-302, Jul. 2002. (Pubitemid 34950067)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 9
    • 17444399233 scopus 로고    scopus 로고
    • Musical instrument timbres classification with spectral features
    • DOI 10.1155/S1110865703210118
    • G. Agostini, M. Longari, and E. Pollastri, "Musical instrument timbres classification with spectral features," EURASIP J. Appl. Signal Process. , pp. 5-14, Jan. 2003. (Pubitemid 41283787)
    • (2003) Eurasip Journal on Applied Signal Processing , vol.2003 , Issue.1 , pp. 5-14
    • Agostini, G.1    Longari, M.2    Pollastri, E.3
  • 10
    • 76949083398 scopus 로고    scopus 로고
    • Dynamic spectral envelope modeling for timbre analysis of musical instrument sounds
    • Mar.
    • J. J. Burred, A. Röbel, and T. Sikora, "Dynamic spectral envelope modeling for timbre analysis of musical instrument sounds," Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 663-674, Mar. 2010.
    • (2010) Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 663-674
    • Burred, J.J.1    Röbel, A.2    Sikora, T.3
  • 12
    • 0035758890 scopus 로고    scopus 로고
    • Footstep detection and tracking
    • G. Succi, D. Clapp, R. Gampert, and G. Prado, "Footstep detection and tracking," in Proc. SPIE, 2001, vol. 4393, pp. 22-26.
    • (2001) Proc. SPIE , vol.4393 , pp. 22-26
    • Succi, G.1    Clapp, D.2    Gampert, R.3    Prado, G.4
  • 17
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front end for automatic speech recognition
    • J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., vol. 106, no. 4, pp. 2040-2050, 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.4 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 19
    • 84898982939 scopus 로고    scopus 로고
    • Exploiting generative models in discriminative classifiers
    • Cambridge, MA: MIT Press
    • T. Jaakkola and D. Haussler, "Exploiting generative models in discriminative classifiers," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 1998, vol. 11, pp. 487-493.
    • (1998) Advances in Neural Information Processing Systems. , vol.11 , pp. 487-493
    • Jaakkola, T.1    Haussler, D.2
  • 20
    • 14644412368 scopus 로고    scopus 로고
    • Speaker verification using sequence discriminant support vector machines
    • DOI 10.1109/TSA.2004.841042
    • V. Wan and S. Renals, "Speaker verification using sequence discriminant support vector machines," IEEE Trans. Speech Audio Process., vol. 13, no. 2, pp. 203-210, Mar. 2005. (Pubitemid 40320239)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.2 , pp. 203-210
    • Wan, V.1    Renals, S.2
  • 21
    • 17644380073 scopus 로고    scopus 로고
    • A Kullback-Leibler divergence based kernel for svm classification in multimedia applications
    • Cambridge, MA: MIT Press
    • P. J. Moreno, P. P. Ho, and N. Vasconcelos, "A Kullback-Leibler divergence based kernel for svm classification in multimedia applications," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2003, vol. 16.
    • (2003) Advances in Neural Information Processing Systems , vol.16
    • Moreno, P.J.1    Ho, P.P.2    Vasconcelos, N.3
  • 22
    • 0034313871 scopus 로고    scopus 로고
    • Earth mover's distance as a metric for image retrieval
    • DOI 10.1023/A:1026543900054
    • Y. Rubner, C. Tomasi, and L. Guibas, "The Earth Mover's distance as a metric for image retrieval," Int. J. Comput. Vis., vol. 40, no. 2, pp. 99-121, 2000. (Pubitemid 32136368)
    • (2000) International Journal of Computer Vision , vol.40 , Issue.2 , pp. 99-121
    • Rubner, Y.1    Tomasi, C.2    Guibas, L.J.3
  • 23
    • 0032594951 scopus 로고    scopus 로고
    • Support vector machines for histogram-based image classification
    • Sep.
    • O. Chapelle, P. Haffner, and V. Vapnik, "Support vector machines for histogram-based image classification," IEEE Trans. Neural Netw., vol. 10, no. 5, pp. 1055-1064, Sep. 1999.
    • (1999) IEEE Trans. Neural Netw. , vol.10 , Issue.5 , pp. 1055-1064
    • Chapelle, O.1    Haffner, P.2    Vapnik, V.3
  • 24
    • 9444269199 scopus 로고    scopus 로고
    • Bhattacharyya and expected likelihood kernels
    • Learning Theory and Kernel Machines
    • T. Jebara and R. Kondor, "Bhattacharyya and expected likelihood kernels," Lecture Notes in Computer Science, vol. 2777, pp. 57-71, 2003. (Pubitemid 37053195)
    • (2003) Lecture Notes in Computer Science , Issue.2777 , pp. 57-71
    • Jebara, T.1    Kondor, R.2
  • 25
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • May
    • W. M. Campbell, D. E. Sturim, and D. A. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-311, May 2006.
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-311
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3
  • 26
    • 85008056687 scopus 로고    scopus 로고
    • A SVM Kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition
    • C. H. You, K.-A. Lee, and H. Li, "A SVM Kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition," IEEE Signal Process. Lett., vol. 16, no. 1, pp. 49-52, 2009.
    • (2009) IEEE Signal Process. Lett. , vol.16 , Issue.1 , pp. 49-52
    • You, C.H.1    Lee, K.-A.2    Li, H.3
  • 33
    • 33947678069 scopus 로고    scopus 로고
    • Multichannel speech enhancement based on speech spectral magnitude estimation using generalized gamma prior distribution
    • T. H. Dat, K. Takeda, and F. Itakura, "Multichannel speech enhancement based on speech spectral magnitude estimation using generalized gamma prior distribution," in Proc. 31th IEEE Int. Conf. Acoust. Speech Signal Process., ICASSP'06, vol. 4, pp. 439-447.
    • Proc. 31th IEEE Int. Conf. Acoust. Speech Signal Process., ICASSP'06 , vol.4 , pp. 439-447
    • Dat, T.H.1    Takeda, K.2    Itakura, F.3
  • 34
    • 0000238336 scopus 로고
    • A simplex method for function minimization
    • J. A. Nelder and R. Mead, "A simplex method for function minimization," Comput. J., vol. 7, pp. 308-313, 1965.
    • (1965) Comput. J. , vol.7 , pp. 308-313
    • Nelder, J.A.1    Mead, R.2
  • 38
    • 33750380834 scopus 로고    scopus 로고
    • On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement
    • DOI 10.1016/j.specom.2006.06.009, PII S016763930600080X
    • T. H. Dat, K. Takeda, and F. Itakura, "On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement," Speech Commun., vol. 48, no. 11, pp. 1515-1527, 2006. (Pubitemid 44634771)
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1515-1527
    • Dat, T.H.1    Takeda, K.2    Itakura, F.3
  • 40
    • 0028430538 scopus 로고
    • Fast Gabor-like windowed Fourier and continuous wavelet transforms
    • May
    • M. Unser, "Fast Gabor-like windowed Fourier and continuous wavelet transforms," IEEE Signal Process. Lett., vol. 1, no. 5, pp. 76-79, May 1994.
    • (1994) IEEE Signal Process. Lett. , vol.1 , Issue.5 , pp. 76-79
    • Unser, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.