메뉴 건너뛰기




Volumn 24, Issue 3, 2017, Pages 279-283

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

Author keywords

Deep convolutional neural networks (CNNs); deep learning; environmental sound classification; urban sound dataset

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; CLASSIFICATION (OF INFORMATION); CONVOLUTION; DEEP LEARNING; NETWORK ARCHITECTURE; NEURAL NETWORKS;

EID: 85015238568     PISSN: 10709908     EISSN: None     Source Type: Journal    
DOI: 10.1109/LSP.2017.2657381     Document Type: Article
Times cited : (1333)

References (32)
  • 1
    • 68149163531 scopus 로고    scopus 로고
    • Environmental sound recognition with time-frequency audio features
    • Aug.
    • S. Chu, S. Narayanan, and C.-C. Kuo, "Environmental sound recognition with time-frequency audio features," IEEE Trans. Audio, Speech, Language Process., vol. 17, no. 6, pp. 1142-1158, Aug. 2009
    • (2009) IEEE Trans. Audio, Speech, Language Process. , vol.17 , Issue.6 , pp. 1142-1158
    • Chu, S.1    Narayanan, S.2    Kuo, C.-C.3
  • 3
    • 84997327948 scopus 로고    scopus 로고
    • The implementation of low-cost urban acoustic monitoring devices
    • C. Mydlarz, J. Salamon, and J. P. Bello, "The implementation of low-cost urban acoustic monitoring devices," Appl. Acoust., vol. 117, pp. 207-218, 2016
    • (2016) Appl. Acoust. , vol.117 , pp. 207-218
    • Mydlarz, C.1    Salamon, J.2    Bello, J.P.3
  • 4
    • 84946032854 scopus 로고    scopus 로고
    • Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations
    • Brisbane, Australia, Apr.
    • A. Mesaros, T. Heittola, O. Dikmen, and T. Virtanen, "Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Brisbane, Australia, Apr. 2015, pp. 151-155
    • (2015) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 151-155
    • Mesaros, A.1    Heittola, T.2    Dikmen, O.3    Virtanen, T.4
  • 5
    • 84973400179 scopus 로고    scopus 로고
    • Detection of overlapping acoustic events using a temporally-constrained probabilistic model
    • Shanghai, China, Mar.
    • E. Benetos, G. Lafay, M. Lagrange, and M. D. Plumbley, "Detection of overlapping acoustic events using a temporally-constrained probabilistic model," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Shanghai, China, Mar. 2016, pp. 6450-6454
    • (2016) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 6450-6454
    • Benetos, E.1    Lafay, G.2    Lagrange, M.3    Plumbley, M.D.4
  • 6
    • 84973279159 scopus 로고    scopus 로고
    • Acoustic scene classification with matrix factorization for unsupervised feature learning
    • Shanghai, China, Mar.
    • V. Bisot, R. Serizel, S. Essid, and G. Richard, "Acoustic scene classification with matrix factorization for unsupervised feature learning," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Shanghai, China, Mar. 2016, pp. 6445-6449
    • (2016) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 6445-6449
    • Bisot, V.1    Serizel, R.2    Essid, S.3    Richard, G.4
  • 7
    • 84946051287 scopus 로고    scopus 로고
    • Unsupervised feature learning for urban sound classification
    • Brisbane, Australia, Apr.
    • J. Salamon and J. P. Bello, "Unsupervised feature learning for urban sound classification," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Brisbane, Australia, Apr. 2015, pp. 171-175
    • (2015) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 171-175
    • Salamon, J.1    Bello, J.P.2
  • 8
    • 84963983684 scopus 로고    scopus 로고
    • Feature learning with deep scattering for urban sound analysis
    • Nice, France, Aug.
    • J. Salamon and J. P. Bello, "Feature learning with deep scattering for urban sound analysis," in Proc. 2015 23rd Eur. Signal Process. Conf., Nice, France, Aug. 2015, pp. 724-728
    • (2015) Proc. 2015 23rd Eur. Signal Process. Conf. , pp. 724-728
    • Salamon, J.1    Bello, J.P.2
  • 9
    • 84963954566 scopus 로고    scopus 로고
    • Improving event detection for audio surveillance using gabor filterbank features
    • Nice, France, Aug.
    • J. T. Geiger and K. Helwani, "Improving event detection for audio surveillance using gabor filterbank features," in Proc. 23rd Eur. Signal Process. Conf., Nice, France, Aug. 2015, pp. 714-718
    • (2015) Proc. 23rd Eur. Signal Process. Conf. , pp. 714-718
    • Geiger, J.T.1    Helwani, K.2
  • 11
    • 84960857918 scopus 로고    scopus 로고
    • Environmental sound classification with convolutional neural networks
    • Boston, MA, USA, Sep.
    • K. J. Piczak, "Environmental sound classification with convolutional neural networks," in Proc. 25th Int. Workshop Mach. Learning Signal Process., Boston, MA, USA, Sep. 2015, pp. 1-6
    • (2015) Proc. 25th Int. Workshop Mach. Learning Signal Process. , pp. 1-6
    • Piczak, K.J.1
  • 14
  • 15
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Nov.
    • Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proc. IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998
    • (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 16
    • 83455255740 scopus 로고    scopus 로고
    • Spectral vs. Spectro-temporal features for acoustic event detection
    • New Paltz, NY, USA, Oct.
    • C. V. Cotton and D. P.W. Ellis, "Spectral vs. spectro-temporal features for acoustic event detection," in Proc. IEEE Workshop Appl. Signal Process. Audio Acoust., New Paltz, NY, USA, Oct. 2011, pp. 69-72
    • (2011) Proc. IEEE Workshop Appl. Signal Process. Audio Acoust. , pp. 69-72
    • Cotton, C.V.1    Ellis, D.P.W.2
  • 17
    • 84913580340 scopus 로고    scopus 로고
    • A dataset and taxonomy for urban sound research
    • Multimedia, Orlando, FL, USA, Nov.
    • J. Salamon, C. Jacoby, and J. P. Bello, "A dataset and taxonomy for urban sound research," in Proc. 22nd ACM Int. Conf. Multimedia, Orlando, FL, USA, Nov. 2014, pp. 1041-1044
    • (2014) Proc. 22nd ACM Int. Conf. , pp. 1041-1044
    • Salamon, J.1    Jacoby, C.2    Bello, J.P.3
  • 18
    • 84962792164 scopus 로고    scopus 로고
    • ESC: Dataset for environmental sound classification
    • Brisbane, Australia, Oct.
    • K. J. Piczak, "ESC: Dataset for environmental sound classification," in Proc. 23rd ACM Int. Conf. Multimedia, Brisbane, Australia, Oct. 2015, pp. 1015-1018
    • (2015) Proc. 23rd ACM Int. Conf. Multimedia , pp. 1015-1018
    • Piczak, K.J.1
  • 21
    • 84945900998 scopus 로고    scopus 로고
    • Best practices for convolutional neural networks applied to visual document analysis
    • Edinburgh, U.K., Aug.
    • P. Y. Simard, D. Steinkraus, and J. C. Platt, "Best practices for convolutional neural networks applied to visual document analysis," in Proc. Int. Conf. Document Anal. Recognit., Edinburgh, U.K., Aug. 2003, vol. 3, pp. 958-962
    • (2003) Proc. Int. Conf. Document Anal. Recognit. , vol.3 , pp. 958-962
    • Simard, P.Y.1    Steinkraus, D.2    Platt, J.C.3
  • 23
    • 84973278436 scopus 로고    scopus 로고
    • Recurrent neural networks for polyphonic sound event detection in real life recordings
    • Shanghai, China, Mar.
    • G. Parascandolo, H. Huttunen, and T. Virtanen, "Recurrent neural networks for polyphonic sound event detection in real life recordings," in Proc. Int. Conf. Acoust., Speech, Signal Process., Shanghai, China, Mar. 2016, pp. 6440-6444
    • (2016) Proc. Int. Conf. Acoust., Speech, Signal Process. , pp. 6440-6444
    • Parascandolo, G.1    Huttunen, H.2    Virtanen, T.3
  • 24
    • 85019537868 scopus 로고    scopus 로고
    • ESSENTIA: An audio analysis library for music information retrieval
    • Curitiba, Brazil, Nov.
    • D. Bogdanov et al. "ESSENTIA: An audio analysis library for music information retrieval," in Proc. 14th Int. Soc. Music Inf. Retrieval Conf., Curitiba, Brazil, Nov. 2013, pp. 493-498
    • (2013) Proc. 14th Int. Soc. Music Inf. Retrieval Conf. , pp. 493-498
    • Bogdanov, D.1
  • 25
    • 84904136037 scopus 로고    scopus 로고
    • Large-scale machine learning with stochastic gradient descent
    • Paris, France, Aug.
    • L. Bottou, "Large-scale machine learning with stochastic gradient descent," in Proc. 19th Int. Conf. Comput. Statist., Paris, France, Aug. 2010, pp. 177-186. [Online]. Available: http://dx.doi.org/10.1007/978-3-7908-2604-3-16
    • (2010) Proc. 19th Int. Conf. Comput. Statist. , pp. 177-186
    • Bottou, L.1
  • 30
    • 85015218858 scopus 로고    scopus 로고
    • Accessed on: Aug. 12
    • "Icecast streaming media server forum," [Online]. Available: http://icecast.imux.net/viewtopic.phpt=3462. Accessed on: Aug. 12, 2016
    • (2016) Icecast Streaming Media Server Forum
  • 32
    • 85015250879 scopus 로고    scopus 로고
    • Pump up the JAMS: V0.2 and beyond
    • New York University, New York, NY, USA, Oct. unpublished
    • B. McFee et al., "Pump up the JAMS: V0.2 and beyond," Music and Audio Research Laboratory, New York University, New York, NY, USA, Oct. 2015, unpublished.
    • (2015) Music and Audio Research Laboratory
    • McFee, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.