메뉴 건너뛰기




Volumn 21, Issue 2, 2013, Pages 367-377

Image feature representation of the subband power distribution for robust sound event classification

Author keywords

missing feature theory; Sound event classification; spectrogram; subband power distribution (SPD)

Indexed keywords

ACOUSTIC SURVEILLANCE; ENVIRONMENTAL SOUNDS; FEATURE CLASSIFICATION; IMAGE FEATURE REPRESENTATION; IMAGE FEATURES; MISSING FEATURE THEORIES; NEAREST NEIGHBOR CLASSIFIER; NOISE CONDITIONS; NONSTATIONARY NOISE; POWER DISTRIBUTIONS; SOUND EVENT CLASSIFICATION; SPECTRAL POWER DISTRIBUTION; SPECTROGRAMS; SUBBANDS;

EID: 84871391219     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2226160     Document Type: Article
Times cited : (82)

References (35)
  • 2
    • 80051605016 scopus 로고    scopus 로고
    • Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations
    • F. Weninger and B. Schuller, "Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011, pp. 337-340.
    • (2011) Proc IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP , pp. 337-340
    • Weninger, F.1    Schuller, B.2
  • 3
    • 68149163531 scopus 로고    scopus 로고
    • Environmental sound recognition with time-frequency audio features
    • Aug
    • S. Chu, S. Narayanan, and C. Kuo, "Environmental sound recognition with time-frequency audio features, " IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 6, pp. 1142-1158, Aug. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.6 , pp. 1142-1158
    • Chu, S.1    Narayanan, S.2    Kuo, C.3
  • 4
    • 85008548582 scopus 로고    scopus 로고
    • Time-frequency matrix feature extraction and classification of environmental audio signals
    • Sep.
    • B. Ghoraani and S. Krishnan, "Time-frequency matrix feature extraction and classification of environmental audio signals, " IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2197-2209, Sep. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.7 , pp. 2197-2220
    • Ghoraani, B.1    Krishnan, S.2
  • 5
    • 85032753469 scopus 로고    scopus 로고
    • Machine hearing: An emerging field
    • Sep.
    • R. Lyon, "Machine hearing: An emerging field, " IEEE Signal Process. Mag., vol. 27, no. 5, pp. 131-139, Sep. 2010.
    • (2010) IEEE Signal Process. Mag , vol.27 , Issue.5 , pp. 131-139
    • Lyon, R.1
  • 6
    • 78650982481 scopus 로고    scopus 로고
    • Spectrogram image feature for sound event classification in mismatched conditions
    • J. Dennis, H. Tran, and H. Li, "Spectrogram image feature for sound event classification in mismatched conditions, " IEEE Signal Process. Lett., vol. 18, no. 2, pp. 130-133, 2011.
    • (2011) IEEE Signal Process. Lett , vol.18 , Issue.2 , pp. 130-133
    • Dennis, J.1    Tran, H.2    Li, H.3
  • 8
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • DOI 10.1109/MSP.2005.1511828
    • B. Raj and R. Stern, "Missing-feature approaches in speech recognition, " IEEE Signal Process. Mag., vol. 22, no. 5, pp. 101-116, Sep. 2005. (Pubitemid 41488524)
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.M.2
  • 9
    • 84865804537 scopus 로고    scopus 로고
    • Image representation of the subband power distribution for robust sound classification
    • Aug
    • J. Dennis, H. Tran, and H. Li, "Image representation of the subband power distribution for robust sound classification, " in Proc. 12 Annu. Conf. Int. Speech Commun. Assoc., Aug. 2011, pp. 2437-2440.
    • (2011) Proc. 12 Annu. Conf. Int. Speech Commun. Assoc. , pp. 2437-2440
    • Dennis, J.1    Tran, H.2    Li, H.3
  • 10
    • 0042830801 scopus 로고    scopus 로고
    • Comparison of techniques for environmental sound recognition
    • DOI 10.1016/S0167-8655(03)00147-8
    • M. Cowling and R. Sitte, "Comparison of techniques for environmental sound recognition, " Pattern Recognit. Lett., vol. 24, no. 15, pp. 2895-2907, 2003. (Pubitemid 37027809)
    • (2003) Pattern Recognition Letters , vol.24 , Issue.15 , pp. 2895-2907
    • Cowling, M.1    Sitte, R.2
  • 12
    • 34347345718 scopus 로고    scopus 로고
    • Parametric representations of bird sounds for automatic species recognition
    • Nov
    • P. Somervuo, A. Harma, and S. Fagerlund, "Parametric representations of bird sounds for automatic species recognition, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 6, pp. 2252-2263, Nov. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.6 , pp. 2252-2263
    • Somervuo, P.1    Harma, A.2    Fagerlund, S.3
  • 13
    • 76949107820 scopus 로고    scopus 로고
    • Sound indexing using morphological description
    • Mar.
    • G. Peeters and E. Deruty, "Sound indexing using morphological description, " IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 675-687, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.3 , pp. 675-687
    • Peeters, G.1    Deruty, E.2
  • 14
    • 79957687384 scopus 로고    scopus 로고
    • Sound event recognition with probabilistic distance SVMs
    • Aug.
    • H. Tran and L. Haizhou, "Sound event recognition with probabilistic distance SVMs, " IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1556-1568, Aug. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.6 , pp. 1556-1568
    • Tran, H.1    Haizhou, L.2
  • 15
    • 14244272507 scopus 로고    scopus 로고
    • Methods for capturing spectro-temporal modulations in automatic speech recognition
    • M. Kleinschmidt, "Methods for capturing spectro-temporal modulations in automatic speech recognition, " Acta Acustica United With Acustica, vol. 88, no. 3, pp. 416-422, 2002. (Pubitemid 34732124)
    • (2002) Acta Acustica united with Acustica , vol.88 , Issue.3 , pp. 416-422
    • Kleinschmidt, M.1
  • 20
    • 84863744672 scopus 로고    scopus 로고
    • Gradient-based musical feature extraction based on scale-invariant feature transform
    • T. Matsui, M. Goto, J. Vert, and Y. Uchiyama, "Gradient-based musical feature extraction based on scale-invariant feature transform, " in Proc. 19th Eur. Signal Process. Conf., 2011, pp. 724-728.
    • (2011) Proc. 19th Eur. Signal Process. Conf , pp. 724-728
    • Matsui, T.1    Goto, M.2    Vert, J.3    Uchiyama, Y.4
  • 21
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data, " Speech Commun., vol. 34, no. 3, pp. 267-285, 2001. (Pubitemid 32284867)
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 22
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
    • M. Seltzer, B. Raj, and R. Stern, "A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition, " Speech Commun., vol. 43, no. 4, pp. 379-393, 2004.
    • (2004) Speech Commun , vol.43 , Issue.4 , pp. 379-393
    • Seltzer, M.1    Raj, B.2    Stern, R.3
  • 23
    • 0141624530 scopus 로고
    • An efficient auditory filterbank based on the gammatone function
    • R. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function, " APU Rep., 1988, vol. 2341.
    • (1988) APU Rep , pp. 2341
    • Patterson, R.1    Nimmo-Smith, I.2    Holdsworth, J.3    Rice, P.4
  • 24
    • 0003913694 scopus 로고
    • An efficient implementation of the Patterson-Holdsworth auditory filter bank
    • Tech. Rep.
    • M. Slaney, "An efficient implementation of the Patterson-Holdsworth auditory filter bank, " Apple Computer, 1993, Tech. Rep. .
    • (1993) Apple Computer
    • Slaney, M.1
  • 25
    • 0003626435 scopus 로고    scopus 로고
    • Upper Saddle River NJ Prentice-Hall ISBN 0-201-18075-8
    • R. Gonzalez and R. Woods, Digital Image Processing. Upper Saddle River, NJ: Prentice-Hall, 2002, ISBN 0-201-18075-8.
    • (2002) Digital Image Processing
    • Gonzalez, R.1    Woods, R.2
  • 26
    • 3042535216 scopus 로고    scopus 로고
    • Distinctive image features from scale-invariant keypoints
    • D. Lowe, "Distinctive image features from scale-invariant keypoints, " Int. J. Comput. Vis., vol. 60, no. 2, pp. 91-110, 2004.
    • (2004) Int. J. Comput. Vis , vol.60 , Issue.2 , pp. 91-110
    • Lowe, D.1
  • 27
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979. (Pubitemid 9467471)
    • (1979) IEEE Trans Acoust Speech Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll Steven, F.1
  • 31
    • 78049391669 scopus 로고    scopus 로고
    • Acoustical sound database in real environments for sound scene understanding and hands-free speech recognition
    • S. Nakamura, K. Hiyane, F. Asano, T. Nishiura, and T. Yamada, "Acoustical sound database in real environments for sound scene understanding and hands-free speech recognition, " in Proc. ICLRE, 2000, pp. 965-968.
    • (2000) Proc. ICLRE , pp. 965-968
    • Nakamura, S.1    Hiyane, K.2    Asano, F.3    Nishiura, T.4    Yamada, T.5
  • 34
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga and H. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems, " Speech Commun., vol. 12, no. 3, pp. 247-251, 1993.
    • (1993) Speech Commun , vol.12 , Issue.3 , pp. 247-251
    • Varga, A.1    Steeneken, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.