메뉴 건너뛰기




Volumn 20, Issue 5, 2012, Pages 1542-1552

A graphical representation and dissimilarity measure for basic everyday sound events

Author keywords

Audio analysis and synthesis; audio coding

Indexed keywords

AGGREGATE STATE; ATOMIC FILTERS; AUDIO ANALYSIS; AUDIO CODING; DISSIMILARITY FUNCTION; DISSIMILARITY MEASURES; ECOLOGICAL PSYCHOLOGY; EVERYDAY SOUND; FILTER FUNCTION; GRAPH-MATCHING ALGORITHMS; GRAPHICAL REPRESENTATIONS; KERNEL MACHINE; PHYSICAL NATURE; POINT PATTERNS; SPARSE DECOMPOSITION; SPARSE METHODS; TIME FREQUENCY DOMAIN;

EID: 84858650534     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2184752     Document Type: Article
Times cited : (11)

References (30)
  • 1
    • 84963163818 scopus 로고
    • How do we hear in the world? Explorations in ecological acoustics
    • W. W. Gaver, "How do we hear in the world? Explorations in ecological acoustics," Ecol. Psychol., vol. 5, no. 4, pp. 285-313, 1993.
    • (1993) Ecol. Psychol. , vol.5 , Issue.4 , pp. 285-313
    • Gaver, W.W.1
  • 4
    • 84857919776 scopus 로고    scopus 로고
    • Classification of everyday sounds: Influence of the degree of sound source identification
    • O. Houix, G. Lemaitre, N. Misdariis, and P. Susini, "Classification of everyday sounds: Influence of the degree of sound source identification, " J. Acoust. Soc. Amer., vol. 123, no. 5, p. 3414, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.5 , pp. 3414
    • Houix, O.1    Lemaitre, G.2    Misdariis, N.3    Susini, P.4
  • 5
    • 84858661368 scopus 로고    scopus 로고
    • Naïve and expert listeners use different strategies to categorize everyday sounds
    • G. Lemaitre, O. Houix, N. Misdariis, and P. Susini, "Naïve and expert listeners use different strategies to categorize everyday sounds," J. Acoust. Soc. Amer., vol. 123, no. 5, p. 3689, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.5 , pp. 3689
    • Lemaitre, G.1    Houix, O.2    Misdariis, N.3    Susini, P.4
  • 7
    • 34547645414 scopus 로고    scopus 로고
    • The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music
    • DOI 10.1121/1.2750160
    • J.-J. Aucouturier, B. Defreville, and F. Pachet, "The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music," J. Acoust. Soc. Amer., vol. 122, no. 2, pp. 881-891, 2007. (Pubitemid 47205542)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.2 , pp. 881-891
    • Aucouturier, J.-J.1    Defreville, B.2    Pachet, F.3
  • 8
    • 51449105193 scopus 로고    scopus 로고
    • Environmental sound recognition using MP-based features
    • S. Chu, S. Narayanan, and C.-C. J. Kuo, "Environmental sound recognition using MP-based features," in Proc. ICASSP, 2008, pp. 1-4.
    • (2008) Proc. ICASSP , pp. 1-4
    • Chu, S.1    Narayanan, S.2    Kuo, C.-C.J.3
  • 9
    • 0027842081 scopus 로고
    • Matching pursuits with time-frequency dictionaries
    • Dec.
    • S. G. Mallat and Z. Zhang, "Matching pursuits with time-frequency dictionaries," IEEE Trans. Signal Process., vol. 41, no. 12, pp. 3397-3415, Dec. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.12 , pp. 3397-3415
    • Mallat, S.G.1    Zhang, Z.2
  • 10
    • 0037230216 scopus 로고    scopus 로고
    • Harmonic decomposition of audio signals with matching pursuit
    • Jan.
    • R. Gribonval and E. Bacry, "Harmonic decomposition of audio signals with matching pursuit," IEEE Trans. Signal Process., vol. 51, no. 1, pp. 101-111, Jan. 2003.
    • (2003) IEEE Trans. Signal Process. , vol.51 , Issue.1 , pp. 101-111
    • Gribonval, R.1    Bacry, E.2
  • 11
    • 0026686048 scopus 로고
    • Entropy-based algorithms for best basis selection
    • Mar.
    • R. R. Coifman and M. V. Wickerhauser, "Entropy-based algorithms for best basis selection," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 713-718, Mar. 1992.
    • (1992) IEEE Trans. Inf. Theory , vol.38 , Issue.2 , pp. 713-718
    • Coifman, R.R.1    Wickerhauser, M.V.2
  • 12
    • 14544277086 scopus 로고    scopus 로고
    • Efficient coding of time-relative structure using spikes
    • DOI 10.1162/0899766052530839
    • E. Smith and M. S. Lewicki, "Efficient coding of time-relative structure using spikes," Neural Comput., vol. 17, pp. 19-45, 2005. (Pubitemid 40305881)
    • (2005) Neural Computation , vol.17 , Issue.1 , pp. 19-45
    • Smith, E.1    Lewicki, M.S.2
  • 13
    • 84866137432 scopus 로고    scopus 로고
    • [Online].Available http://www.sound-ideas.com
    • "Sound Ideas Sound Database, http://www.sound-ideas.com," [Online]. Available: http://www.sound-ideas.com
    • Sound Ideas Sound Database
  • 14
    • 25444478852 scopus 로고    scopus 로고
    • A functional model of neural activity patterns and auditory images
    • R. D. Patterson and J. Holdsworth, "A functional model of neural activity patterns and auditory images," Adv. Speech, Hear., Lang. Process., vol. 3, pp. 547-563, 1996.
    • (1996) Adv. Speech, Hear., Lang. Process. , vol.3 , pp. 547-563
    • Patterson, R.D.1    Holdsworth, J.2
  • 15
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • DOI 10.1016/0378-5955(90)90170-T
    • B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol. 47, pp. 103-138, 1990. (Pubitemid 20244652)
    • (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 16
    • 0024162368 scopus 로고
    • Temporal coding of resonances by lowfrequency auditory nerve fibers: Single fibre responses and a population model
    • L. H. Carney and C. T. Yin, "Temporal coding of resonances by lowfrequency auditory nerve fibers: Single fibre responses and a population model," J. Neurophysiol., vol. 60, pp. 1653-1677, 1988.
    • (1988) J. Neurophysiol. , vol.60 , pp. 1653-1677
    • Carney, L.H.1    Yin, C.T.2
  • 17
    • 0001050571 scopus 로고
    • Auditory filters and excitation patterns as representations of frequency resolution
    • B. Moore, Ed. London, U.K.: Academic
    • R. D. Patterson and B. Moore, "Auditory filters and excitation patterns as representations of frequency resolution," in Adv. Speech, Hear. Lang. Process., B. Moore, Ed. London, U.K.: Academic, 1986, pp. 123-177.
    • (1986) Adv. Speech, Hear. Lang. Process. , pp. 123-177
    • Patterson, R.D.1    Moore, B.2
  • 20
    • 0009985115 scopus 로고    scopus 로고
    • Mel frequency cepstral coefficients for music modeling
    • B. Logan, "Mel frequency cepstral coefficients for music modeling," in Proc. Int. Symp. Music Inf. Retrieval, 2000.
    • (2000) Proc. Int. Symp. Music Inf. Retrieval
    • Logan, B.1
  • 21
    • 33644626634 scopus 로고    scopus 로고
    • A large set of audio features for sound description (similarity and classification) in the CUIDADO Project
    • Tech. Rep.
    • G. Peeters, "A large set of audio features for sound description (similarity and classification) in the CUIDADO Project," IRCAM, Analysis/Synthesis Team, 2004, Tech. Rep..
    • (2004) IRCAM, Analysis/Synthesis Team
    • Peeters, G.1
  • 22
    • 0002719797 scopus 로고
    • The hungarian method for the assignment problem
    • H. W. Kuhn, "The hungarian method for the assignment problem," Naval Res. Logist. Quarterly, vol. 2, pp. 83-97, 1955.
    • (1955) Naval Res. Logist. Quarterly , vol.2 , pp. 83-97
    • Kuhn, H.W.1
  • 23
    • 35048848086 scopus 로고    scopus 로고
    • Learning with distance substitution kernels
    • B. Haasdonk and C. Bahlmann, "Learning with distance substitution kernels," in Proc. 26th DAGM Symp., 2004, pp. 220-227.
    • (2004) Proc. 26th DAGM Symp. , pp. 220-227
    • Haasdonk, B.1    Bahlmann, C.2
  • 25
    • 70350723650 scopus 로고    scopus 로고
    • A theory of learning with similarity functions
    • M.-F. Balcan, A. Blum, and N. Srebro, "A theory of learning with similarity functions," Mach. Learn. J., vol. 72, no. 1-2, pp. 89-112, 2008.
    • (2008) Mach. Learn.J. , vol.72 , Issue.1-2 , pp. 89-112
    • Balcan, M.-F.1    Blum, A.2    Srebro, N.3
  • 26
    • 33646875697 scopus 로고    scopus 로고
    • Support vector machines for dyadic data
    • DOI 10.1162/neco.2006.18.6.1472
    • S. Hochreiter and K. Obermayer, "Support vector machines for dyadic data," Neural Comput., vol. 18, no. 6, pp. 1472-1510, 2006. (Pubitemid 43778739)
    • (2006) Neural Computation , vol.18 , Issue.6 , pp. 1472-1510
    • Hochreiter, S.1    Obermayer, K.2
  • 27
    • 37748999259 scopus 로고    scopus 로고
    • An SMO algorithm for the potential support vector machine
    • S. Hochreiter, T. Knebel, and K. Obermayer, "An SMO algorithm for the potential support vector machine," Neural Comput., vol. 20, no. 1, pp. 271-287, 2008.
    • (2008) Neural Comput. , vol.20 , Issue.1 , pp. 271-287
    • Hochreiter, S.1    Knebel, T.2    Obermayer, K.3
  • 28
    • 0009671230 scopus 로고    scopus 로고
    • The complex-valued continuous wavelet transform as a preprocessor for auditory scene analysis
    • D. F. Rosenthal and H. G. Okuno, Eds. Hillsdale, NJ, Lawrence Erlbaum Associates
    • L. Solbach, R. Wöhrmann, and J. Kliewer, "The complex-valued continuous wavelet transform as a preprocessor for auditory scene analysis," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. G. Okuno, Eds. Hillsdale, NJ: Lawrence Erlbaum Associates, 1998, pp. 273-291.
    • (1998) Computational Auditory Scene Analysis , pp. 273-291
    • Solbach, L.1    Wöhrmann, R.2    Kliewer, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.