메뉴 건너뛰기




Volumn 2015-November, Issue , 2015, Pages

Joint time-frequency scattering for audio classification

Author keywords

audio classification; Convolutional networks; invariant descriptors; time frequency structure; wavelets

Indexed keywords

ARTIFICIAL INTELLIGENCE; AUDIO ACOUSTICS; CLASSIFICATION (OF INFORMATION); COMPLEX NETWORKS; FREQUENCY MODULATION; LEARNING SYSTEMS; WAVELET TRANSFORMS;

EID: 84960933893     PISSN: 21610363     EISSN: 21610371     Source Type: Conference Proceeding    
DOI: 10.1109/MLSP.2015.7324385     Document Type: Conference Paper
Times cited : (45)

References (14)
  • 1
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S.B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. 28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 2
    • 84896734479 scopus 로고    scopus 로고
    • Deep scattering spectrum
    • J.,én and S. Mallat, "Deep scattering spectrum," IEEE Trans. Sig. Proc., vol. 62, pp. 4114-4128, 2014.
    • (2014) IEEE Trans. Sig. Proc. , vol.62 , pp. 4114-4128
    • Én, J.1    Mallat, S.2
  • 3
    • 84864324516 scopus 로고    scopus 로고
    • Group invariant scattering
    • S. Mallat, "Group invariant scattering," Comm. Pure Appl. Math., vol. 65, no. 10, pp. 1331-1398, 2012.
    • (2012) Comm. Pure Appl. Math. , vol.65 , Issue.10 , pp. 1331-1398
    • Mallat, S.1
  • 4
    • 23744508888 scopus 로고    scopus 로고
    • Multiresolution spectrotemporal analysis of complex sounds
    • T. Chi, P. Ru, and S. Shamma, "Multiresolution spectrotemporal analysis of complex sounds," J. Acoust. Soc. Am., vol. 118, no. 2, pp. 887-906, 2005.
    • (2005) J. Acoust. Soc. Am. , vol.118 , Issue.2 , pp. 887-906
    • Chi, T.1    Ru, P.2    Shamma, S.3
  • 5
    • 0141520589 scopus 로고    scopus 로고
    • A non-uniform modulation transform for audio coding with increased time resolution
    • J. Thompson and L. Atlas, "A non-uniform modulation transform for audio coding with increased time resolution," in IEEE Int. Conf. on Acoust. Speech, and Sig. Proc., 2003, vol. 5, pp. V-397.
    • (2003) IEEE Int. Conf. on Acoust. Speech, and Sig. Proc. , vol.5 , pp. V-397
    • Thompson, J.1    Atlas, L.2
  • 7
    • 77956502334 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • H. Lee, P. Pham, Y. Largman, and A. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks," in Proc. NIPS, 2009.
    • (2009) Proc. NIPS
    • Lee, H.1    Pham, P.2    Largman, Y.3    Ng, A.4
  • 8
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
    • N. Mesgarani, M. Slaney, and S. Shamma, "Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 920-930, 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 920-930
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.3
  • 9
    • 27144544136 scopus 로고    scopus 로고
    • Improving word accuracy with gabor feature extraction
    • M. Kleinschmidt and D. Gelbart, "Improving word accuracy with gabor feature extraction.," in Interspeech, 2002.
    • (2002) Interspeech
    • Kleinschmidt, M.1    Gelbart, D.2
  • 11
    • 80052406394 scopus 로고    scopus 로고
    • Sound texture perception via statistics of the auditory periphery: Evidence from sound synthesis
    • J. McDermott and E. Simoncelli, "Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis," Neuron, vol. 71, no. 5, pp. 926-940, 2011.
    • (2011) Neuron , vol.71 , Issue.5 , pp. 926-940
    • McDermott, J.1    Simoncelli, E.2
  • 13
    • 79955702502 scopus 로고    scopus 로고
    • LIBSVM: A library for support vector machines
    • Software available at
    • C. Chang and C. Lin, "LIBSVM: A library for support vector machines," ACM Trans. on Intell. Syst., Technol., vol. 2, pp. 27:1-27:27, 2011, Software available at http://www.csie.ntu.edu. tw/∼cjlin/libsvm.
    • (2011) ACM Trans. on Intell. Syst., Technol. , vol.2 , pp. 271-2727
    • Chang, C.1    Lin, C.2
  • 14
    • 44849141781 scopus 로고    scopus 로고
    • Hierarchical large-margin Gaussian mixture models for phonetic classification
    • H. Chang and J. Glass, "Hierarchical large-margin gaussian mixture models for phonetic classification," in Proc. ASRU. IEEE, 2007, pp. 272-277.
    • (2007) Proc. ASRU. IEEE , pp. 272-277
    • Chang, H.1    Glass, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.