메뉴 건너뛰기




Volumn 17, Issue 10, 2015, Pages 1733-1746

Detection and Classification of Acoustic Scenes and Events

Author keywords

Audio databases; event detection; machine intelligence; pattern recognition

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; ARTIFICIAL INTELLIGENCE; AUDIO ACOUSTICS; AUDIO SYSTEMS; INTELLIGENT SYSTEMS; PATTERN RECOGNITION; SIGNAL PROCESSING;

EID: 84960474809     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2015.2428998     Document Type: Article
Times cited : (528)

References (66)
  • 2
    • 84878543263 scopus 로고    scopus 로고
    • The PASCAL CHiME speech separation and recognition challenge
    • J. Barker, E. Vincent, N. Ma, H. Christensen, and P. Green, "The PASCAL CHiME speech separation and recognition challenge," Comput. Speech Language, vol. 27, no. 3, pp. 621-633, 2012.
    • (2012) Comput. Speech Language , vol.27 , Issue.3 , pp. 621-633
    • Barker, J.1    Vincent, E.2    Ma, N.3    Christensen, H.4    Green, P.5
  • 3
    • 84874821721 scopus 로고    scopus 로고
    • Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model
    • E. Benetos and S. Dixon, "Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model," J. Acoust. Soc. America, vol. 133, pp. 1727-1741, 2013.
    • (2013) J. Acoust. Soc. America , vol.133 , pp. 1727-1741
    • Benetos, E.1    Dixon, S.2
  • 4
    • 11144341364 scopus 로고    scopus 로고
    • An industrial strength audio search algorithm
    • Oct.
    • A. Wang, "An industrial strength audio search algorithm," in Proc. 4th Int. Conf. Music Inf. Retrieval, Oct. 2003, pp. 7-13.
    • (2003) Proc. 4th Int. Conf. Music Inf. Retrieval , pp. 7-13
    • Wang, A.1
  • 5
    • 3042525207 scopus 로고    scopus 로고
    • Localization of simultaneous moving sound sources for mobile robot using a frequencydomain steered beamformer approach
    • Apr.-May
    • J.-M. Valin, F. Michaud, B. Hadjou, and J. Rouat, "Localization of simultaneous moving sound sources for mobile robot using a frequencydomain steered beamformer approach," in Proc. 2004 IEEE Int. Conf. Robot. Automat., Apr.-May 2004, vol. 1, pp. 1033-1038.
    • (2004) Proc. 2004 IEEE Int. Conf. Robot. Automat. , vol.1 , pp. 1033-1038
    • Valin, J.-M.1    Michaud, F.2    Hadjou, B.3    Rouat, J.4
  • 6
    • 3042638661 scopus 로고    scopus 로고
    • Natural sound archives: Past, present and future
    • R. Ranft, "Natural sound archives: Past, present and future," Anais da Academia Brasileira de Ciências, vol. 76, no. 2, pp. 456-460, 2004.
    • (2004) Anais da Academia Brasileira de Ciências , vol.76 , Issue.2 , pp. 456-460
    • Ranft, R.1
  • 10
    • 34547645414 scopus 로고    scopus 로고
    • The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music
    • J.-J. Aucouturier, B. Defreville, and F. Pachet, "The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music," J. Acoust. Soc. America, vol. 122, no. 2, pp. 881-891, 2007.
    • (2007) J. Acoust. Soc. America , vol.122 , Issue.2 , pp. 881-891
    • Aucouturier, J.-J.1    Defreville, B.2    Pachet, F.3
  • 12
    • 85032752479 scopus 로고    scopus 로고
    • Automatic genre classification of music content: A survey
    • Mar.
    • N. Scaringella, G. Zoia, and D. Mlynek, "Automatic genre classification of music content: A survey," IEEE Signal Process. Mag., vol. 23, no. 2, pp. 133-141, Mar. 2006.
    • (2006) IEEE Signal Process. Mag. , vol.23 , Issue.2 , pp. 133-141
    • Scaringella, N.1    Zoia, G.2    Mlynek, D.3
  • 13
    • 84953683778 scopus 로고
    • Efficient acoustic parameters for speaker recognition
    • Jun.
    • J. J. Wolf, "Efficient acoustic parameters for speaker recognition," J. Acoust. Soc. America, vol. 51, pp. 2044-2056, Jun. 1972.
    • (1972) J. Acoust. Soc. America , vol.51 , pp. 2044-2056
    • Wolf, J.J.1
  • 16
    • 84904341868 scopus 로고    scopus 로고
    • Ph.D. dissertation, School of Electron. Eng. and Comput. Sci., Queen Mary University of London, London, U.K., Dec.
    • E. Benetos, "Automatic transcription of polyphonic music exploiting temporal evolution," Ph.D. dissertation, School of Electron. Eng. and Comput. Sci., Queen Mary University of London, London, U.K., Dec. 2012.
    • (2012) Automatic Transcription of Polyphonic Music Exploiting Temporal Evolution
    • Benetos, E.1
  • 17
    • 68149163531 scopus 로고    scopus 로고
    • Environmental sound recognition with time-frequency audio features
    • Aug.
    • S. Chu, S. Narayanan, and C.-C. Jay Kuo, "Environmental sound recognition with time-frequency audio features," IEEE Trans. Audio, Speech Language Process., vol. 17, no. 6, pp. 1142-1158, Aug. 2009.
    • (2009) IEEE Trans. Audio, Speech Language Process. , vol.17 , Issue.6 , pp. 1142-1158
    • Chu, S.1    Narayanan, S.2    Jay Kuo, C.-C.3
  • 18
    • 34047261805 scopus 로고    scopus 로고
    • An overview of automatic speaker diarization systems
    • Sep.
    • S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Language Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Language Process. , vol.14 , Issue.5 , pp. 1557-1565
    • Tranter, S.E.1    Reynolds, D.A.2
  • 24
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. P. Barker, M. P. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," Speech Commun., vol. 45, no. 1, pp. 5-25, 2005.
    • (2005) Speech Commun. , vol.45 , Issue.1 , pp. 5-25
    • Barker, J.P.1    Cooke, M.P.2    Ellis, D.P.W.3
  • 25
    • 84918783217 scopus 로고    scopus 로고
    • Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach
    • F. Briggs and B. Lakshminarayanan et al., "Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach," J. Acoust. Soc. America, vol. 131, pp. 4640-4650, 2012.
    • (2012) J. Acoust. Soc. America , vol.131 , pp. 4640-4650
    • Briggs, F.1    Lakshminarayanan, B.2
  • 27
    • 80052281439 scopus 로고    scopus 로고
    • Automatic extraction of pornographic contents using radon transform based audio features
    • Jun.
    • M. J. Kim and H. Kim, "Automatic extraction of pornographic contents using radon transform based audio features," in Proc. 9th Int.Workshop Content-Based Multimedia Indexing, Jun. 2011, pp. 205-210.
    • (2011) Proc. 9th Int.Workshop Content-Based Multimedia Indexing , pp. 205-210
    • Kim, M.J.1    Kim, H.2
  • 30
    • 84885664567 scopus 로고    scopus 로고
    • Segregating event streams and noise with a Markov renewal process model
    • D. Stowell and M. D. Plumbley, "Segregating event streams and noise with a Markov renewal process model," J. Mach. Learning Res., vol. 14, pp. 1891-1916, 2013.
    • (2013) J. Mach. Learning Res. , vol.14 , pp. 1891-1916
    • Stowell, D.1    Plumbley, M.D.2
  • 36
    • 84907449171 scopus 로고    scopus 로고
    • A simple method to determine if a music information retrieval system is a 'horse'
    • Oct.
    • B. L. Sturm, "A simple method to determine if a music information retrieval system is a 'horse'," IEEE Trans. Multimedia, vol. 16, no. 6, pp. 1636-1644, Oct. 2014.
    • (2014) IEEE Trans. Multimedia , vol.16 , Issue.6 , pp. 1636-1644
    • Sturm, B.L.1
  • 39
    • 84873592866 scopus 로고    scopus 로고
    • Semantic annotation and retrieval of music using a bag of systems representation
    • Oct.
    • K. Ellis, E. Coviello, and G. Lanckriet, "Semantic annotation and retrieval of music using a bag of systems representation," in Proc. 12th Int. Conf. Music Inf. Retrieval, Oct. 2011, pp. 723-728.
    • (2011) Proc. 12th Int. Conf. Music Inf. Retrieval , pp. 723-728
    • Ellis, K.1    Coviello, E.2    Lanckriet, G.3
  • 40
    • 33646023117 scopus 로고    scopus 로고
    • An introduction to ROC analysis
    • T. Fawcett, "An introduction to ROC analysis," Pattern Recog. Lett., vol. 27, no. 8, pp. 861-874, 2006.
    • (2006) Pattern Recog. Lett. , vol.27 , Issue.8 , pp. 861-874
    • Fawcett, T.1
  • 43
    • 38049176869 scopus 로고    scopus 로고
    • CLEAR evaluation of acoustic event detection and classification systems
    • A. Temko, R. Malkin, C. Zieger, D. Macho, C. Nadeu, and M. Omologo, "CLEAR evaluation of acoustic event detection and classification systems," in Proc CLEAR, 2007, pp. 311-322.
    • (2007) Proc CLEAR , pp. 311-322
    • Temko, A.1    Malkin, R.2    Zieger, C.3    Macho, D.4    Nadeu, C.5    Omologo, M.6
  • 44
    • 33847655586 scopus 로고    scopus 로고
    • A generalized divergence measure for nonnegative matrix factorization
    • R. Kompass, "A generalized divergence measure for nonnegative matrix factorization," Neural Comput., vol. 19, no. 3, pp. 780-791, 2007.
    • (2007) Neural Comput. , vol.19 , Issue.3 , pp. 780-791
    • Kompass, R.1
  • 64
    • 85032751587 scopus 로고    scopus 로고
    • Acoustic scene classification: Classifying environments from the sounds they produce
    • May
    • D. Barchiesi, D. Giannoulis, D. Stowell, and M. D. Plumbley, "Acoustic scene classification: Classifying environments from the sounds they produce," IEEE Signal Process. Magazine vol. 32, no. 3, pp. 16-34, May 2015.
    • (2015) IEEE Signal Process. Magazine , vol.32 , Issue.3 , pp. 16-34
    • Barchiesi, D.1    Giannoulis, D.2    Stowell, D.3    Plumbley, M.D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.