SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 5, 2012, Pages 1542-1552

A graphical representation and dissimilarity measure for basic everyday sound events

(5) Adiloǧlu, Kamil a Anniés, Robert b Wahlen, Elio c Purwins, Hendrik d Obermayer, Klaus e

a INRIA (France)

b UNIVERSITY OF BERN (Switzerland)

c HAMBURG UNIVERSITY OF APPLIED SCIENCES (Germany)

d UNIVERSITAT POMPEU FABRA (Spain)

e TECHNISCHE UNIVERSITÄT BERLIN (Germany)

Author keywords

Audio analysis and synthesis; audio coding

Indexed keywords

AGGREGATE STATE; ATOMIC FILTERS; AUDIO ANALYSIS; AUDIO CODING; DISSIMILARITY FUNCTION; DISSIMILARITY MEASURES; ECOLOGICAL PSYCHOLOGY; EVERYDAY SOUND; FILTER FUNCTION; GRAPH-MATCHING ALGORITHMS; GRAPHICAL REPRESENTATIONS; KERNEL MACHINE; PHYSICAL NATURE; POINT PATTERNS; SPARSE DECOMPOSITION; SPARSE METHODS; TIME FREQUENCY DOMAIN;

ECOLOGY; PATTERN MATCHING; TAXONOMIES; VISUALIZATION;

AUDIO ACOUSTICS;

EID: 84858650534 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2184752 Document Type: Article

Times cited : (11)

References (30)

1
- 84963163818
- How do we hear in the world? Explorations in ecological acoustics
- W. W. Gaver, "How do we hear in the world? Explorations in ecological acoustics," Ecol. Psychol., vol. 5, no. 4, pp. 285-313, 1993.
- (1993) Ecol. Psychol. , vol.5 , Issue.4 , pp. 285-313
- Gaver, W.W.¹

2
- 1542355285
- Ph.D. dissertation Cornell Univ., Ithaca, NY
- N. J. Vanderveer, "Ecological acoustics: Human perception of environmental sounds," Ph.D. dissertation, Cornell Univ., Ithaca, NY, 1979.
- (1979) Ecological Acoustics: Human Perception of Environmental Sounds
- Vanderveer, N.J.¹

3
- 84858661367
- Commission Européenne, Tech. Rep.
- O. Houix, G. Lemaitre, N. Misdariis, P. Susini, K. Franinovic, D. Hug, J. Otten, J. Scott, Y. Visell, D. Devallez, F. Fontana, S. Papetti, P. Polotti, and D. Rocchesso, "Everyday sound classification. Part 1: State of the art," 2003, Commission Européenne, Tech. Rep..
- (2003) Everyday Sound Classification. Part 1: State of the Art
- Houix, O.¹ Lemaitre, G.² Misdariis, N.³ Susini, P.⁴ Franinovic, K.⁵ Hug, D.⁶ Otten, J.⁷ Scott, J.⁸ Visell, Y.⁹ Devallez, D.¹⁰ Fontana, F.¹¹ Papetti, S.¹² Polotti, P.¹³ Rocchesso, D.¹⁴

4
- 84857919776
- Classification of everyday sounds: Influence of the degree of sound source identification
- O. Houix, G. Lemaitre, N. Misdariis, and P. Susini, "Classification of everyday sounds: Influence of the degree of sound source identification, " J. Acoust. Soc. Amer., vol. 123, no. 5, p. 3414, 2008.
- (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.5 , pp. 3414
- Houix, O.¹ Lemaitre, G.² Misdariis, N.³ Susini, P.⁴

5
- 84858661368
- Naïve and expert listeners use different strategies to categorize everyday sounds
- G. Lemaitre, O. Houix, N. Misdariis, and P. Susini, "Naïve and expert listeners use different strategies to categorize everyday sounds," J. Acoust. Soc. Amer., vol. 123, no. 5, p. 3689, 2008.
- (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.5 , pp. 3689
- Lemaitre, G.¹ Houix, O.² Misdariis, N.³ Susini, P.⁴

6
- 2942720260
- Features for audio and music classification
- Baltimore, MD
- J. Breebaart and M. McKinney, "Features for audio and music classification," in Proc. Int. Conf. Music Inf. Retrieval, Baltimore, MD, 2003.
- (2003) Proc. Int. Conf. Music Inf. Retrieval
- Breebaart, J.¹ McKinney, M.²

7
- 34547645414
- The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music
- DOI 10.1121/1.2750160
- J.-J. Aucouturier, B. Defreville, and F. Pachet, "The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music," J. Acoust. Soc. Amer., vol. 122, no. 2, pp. 881-891, 2007. (Pubitemid 47205542)
- (2007) Journal of the Acoustical Society of America , vol.122 , Issue.2 , pp. 881-891
- Aucouturier, J.-J.¹ Defreville, B.² Pachet, F.³

8
- 51449105193
- Environmental sound recognition using MP-based features
- S. Chu, S. Narayanan, and C.-C. J. Kuo, "Environmental sound recognition using MP-based features," in Proc. ICASSP, 2008, pp. 1-4.
- (2008) Proc. ICASSP , pp. 1-4
- Chu, S.¹ Narayanan, S.² Kuo, C.-C.J.³

9
- 0027842081
- Matching pursuits with time-frequency dictionaries
- Dec.
- S. G. Mallat and Z. Zhang, "Matching pursuits with time-frequency dictionaries," IEEE Trans. Signal Process., vol. 41, no. 12, pp. 3397-3415, Dec. 1993.
- (1993) IEEE Trans. Signal Process. , vol.41 , Issue.12 , pp. 3397-3415
- Mallat, S.G.¹ Zhang, Z.²

10
- 0037230216
- Harmonic decomposition of audio signals with matching pursuit
- Jan.
- R. Gribonval and E. Bacry, "Harmonic decomposition of audio signals with matching pursuit," IEEE Trans. Signal Process., vol. 51, no. 1, pp. 101-111, Jan. 2003.
- (2003) IEEE Trans. Signal Process. , vol.51 , Issue.1 , pp. 101-111
- Gribonval, R.¹ Bacry, E.²

11
- 0026686048
- Entropy-based algorithms for best basis selection
- Mar.
- R. R. Coifman and M. V. Wickerhauser, "Entropy-based algorithms for best basis selection," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 713-718, Mar. 1992.
- (1992) IEEE Trans. Inf. Theory , vol.38 , Issue.2 , pp. 713-718
- Coifman, R.R.¹ Wickerhauser, M.V.²

12
- 14544277086
- Efficient coding of time-relative structure using spikes
- DOI 10.1162/0899766052530839
- E. Smith and M. S. Lewicki, "Efficient coding of time-relative structure using spikes," Neural Comput., vol. 17, pp. 19-45, 2005. (Pubitemid 40305881)
- (2005) Neural Computation , vol.17 , Issue.1 , pp. 19-45
- Smith, E.¹ Lewicki, M.S.²

13
- 84866137432
- [Online].Available http://www.sound-ideas.com
- "Sound Ideas Sound Database, http://www.sound-ideas.com," [Online]. Available: http://www.sound-ideas.com
- Sound Ideas Sound Database

14
- 25444478852
- A functional model of neural activity patterns and auditory images
- R. D. Patterson and J. Holdsworth, "A functional model of neural activity patterns and auditory images," Adv. Speech, Hear., Lang. Process., vol. 3, pp. 547-563, 1996.
- (1996) Adv. Speech, Hear., Lang. Process. , vol.3 , pp. 547-563
- Patterson, R.D.¹ Holdsworth, J.²

15
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- DOI 10.1016/0378-5955(90)90170-T
- B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol. 47, pp. 103-138, 1990. (Pubitemid 20244652)
- (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

16
- 0024162368
- Temporal coding of resonances by lowfrequency auditory nerve fibers: Single fibre responses and a population model
- L. H. Carney and C. T. Yin, "Temporal coding of resonances by lowfrequency auditory nerve fibers: Single fibre responses and a population model," J. Neurophysiol., vol. 60, pp. 1653-1677, 1988.
- (1988) J. Neurophysiol. , vol.60 , pp. 1653-1677
- Carney, L.H.¹ Yin, C.T.²

17
- 0001050571
- Auditory filters and excitation patterns as representations of frequency resolution
- B. Moore, Ed. London, U.K.: Academic
- R. D. Patterson and B. Moore, "Auditory filters and excitation patterns as representations of frequency resolution," in Adv. Speech, Hear. Lang. Process., B. Moore, Ed. London, U.K.: Academic, 1986, pp. 123-177.
- (1986) Adv. Speech, Hear. Lang. Process. , pp. 123-177
- Patterson, R.D.¹ Moore, B.²

18
- 0004213132
- Palo Alto, CA: Interval Research Corp.
- M. Slaney, A Matlab Toolbox for Auditory ModelingWork. Palo Alto, CA: Interval Research Corp., 1998.
- (1998) A Matlab Toolbox for Auditory Modeling Work
- Slaney, M.¹

19
- 0003913694
- Cupertino, CA: Apple Computer
- M. Slaney, An Eifficient Implementation of the Patterson-Holdsworth Auditory Filter Bank. Cupertino, CA: Apple Computer, 1993.
- (1993) An Eifficient Implementation of the Patterson-Holdsworth Auditory Filter Bank
- Slaney, M.¹

20
- 0009985115
- Mel frequency cepstral coefficients for music modeling
- B. Logan, "Mel frequency cepstral coefficients for music modeling," in Proc. Int. Symp. Music Inf. Retrieval, 2000.
- (2000) Proc. Int. Symp. Music Inf. Retrieval
- Logan, B.¹

21
- 33644626634
- A large set of audio features for sound description (similarity and classification) in the CUIDADO Project
- Tech. Rep.
- G. Peeters, "A large set of audio features for sound description (similarity and classification) in the CUIDADO Project," IRCAM, Analysis/Synthesis Team, 2004, Tech. Rep..
- (2004) IRCAM, Analysis/Synthesis Team
- Peeters, G.¹

22
- 0002719797
- The hungarian method for the assignment problem
- H. W. Kuhn, "The hungarian method for the assignment problem," Naval Res. Logist. Quarterly, vol. 2, pp. 83-97, 1955.
- (1955) Naval Res. Logist. Quarterly , vol.2 , pp. 83-97
- Kuhn, H.W.¹

23
- 35048848086
- Learning with distance substitution kernels
- B. Haasdonk and C. Bahlmann, "Learning with distance substitution kernels," in Proc. 26th DAGM Symp., 2004, pp. 220-227.
- (2004) Proc. 26th DAGM Symp. , pp. 220-227
- Haasdonk, B.¹ Bahlmann, C.²

24
- 84899020966
- Classification on pairwise proximity data
- T. Graepel, R. Herbrich, P. Bollmann-Sdorra, and K. Obermayer, "Classification on pairwise proximity data," in Adv. Neural Inf. Process. Syst., 1998, pp. 438-444.
- (1998) Adv. Neural Inf. Process. Syst. , pp. 438-444
- Graepel, T.¹ Herbrich, R.² Bollmann-Sdorra, P.³ Obermayer, K.⁴

25
- 70350723650
- A theory of learning with similarity functions
- M.-F. Balcan, A. Blum, and N. Srebro, "A theory of learning with similarity functions," Mach. Learn. J., vol. 72, no. 1-2, pp. 89-112, 2008.
- (2008) Mach. Learn.J. , vol.72 , Issue.1-2 , pp. 89-112
- Balcan, M.-F.¹ Blum, A.² Srebro, N.³

26
- 33646875697
- Support vector machines for dyadic data
- DOI 10.1162/neco.2006.18.6.1472
- S. Hochreiter and K. Obermayer, "Support vector machines for dyadic data," Neural Comput., vol. 18, no. 6, pp. 1472-1510, 2006. (Pubitemid 43778739)
- (2006) Neural Computation , vol.18 , Issue.6 , pp. 1472-1510
- Hochreiter, S.¹ Obermayer, K.²

27
- 37748999259
- An SMO algorithm for the potential support vector machine
- S. Hochreiter, T. Knebel, and K. Obermayer, "An SMO algorithm for the potential support vector machine," Neural Comput., vol. 20, no. 1, pp. 271-287, 2008.
- (2008) Neural Comput. , vol.20 , Issue.1 , pp. 271-287
- Hochreiter, S.¹ Knebel, T.² Obermayer, K.³

28
- 0009671230
- The complex-valued continuous wavelet transform as a preprocessor for auditory scene analysis
- D. F. Rosenthal and H. G. Okuno, Eds. Hillsdale, NJ, Lawrence Erlbaum Associates
- L. Solbach, R. Wöhrmann, and J. Kliewer, "The complex-valued continuous wavelet transform as a preprocessor for auditory scene analysis," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. G. Okuno, Eds. Hillsdale, NJ: Lawrence Erlbaum Associates, 1998, pp. 273-291.
- (1998) Computational Auditory Scene Analysis , pp. 273-291
- Solbach, L.¹ Wöhrmann, R.² Kliewer, J.³

29
- 0034843832
- The Earth Mover's distance is the Mallows distance: Some insights from statistics
- E. Levina and P. Bickel, "The earth mover's distance is the mallows distance: Some insights from statistics," in Proc. IEEE Int. Conf. Comput. Vis., 2001, pp. 251-256. (Pubitemid 32795066)
- (2001) Proceedings of the IEEE International Conference on Computer Vision , vol.2 , pp. 251-256
- Levina, E.¹ Bickel, P.²

30
- 0017930815
- Dynamic programming algorithm optimization for spoken word recognition
- H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-26, no. 1, pp. 43-49, Jan. 1978. (Pubitemid 8601900)
- (1978) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-26 , Issue.1 , pp. 43-49
- Sakoe Hiroaki¹ Chiba Seibi²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.