메뉴 건너뛰기




Volumn 130, Issue 5, 2011, Pages 2902-2916

The Timbre Toolbox: Extracting audio descriptors from musical signals

Author keywords

[No Author keywords available]

Indexed keywords

AUDITORY MODELS; COMPLEX SOUNDS; CORRELATIONAL ANALYSIS; DESCRIPTIVE STATISTICS; DESCRIPTORS; ENERGETIC PROPERTIES; ENERGY ENVELOPE; HIER-ARCHICAL CLUSTERING; INFORMATION REDUNDANCIES; MACHINE-LEARNING; MUSIC INFORMATION RETRIEVAL; MUSICAL SIGNALS; SHORT-TERM FOURIER TRANSFORM; SINGLE-VALUE; SINUSOIDAL COMPONENTS; TIME VARYING;

EID: 81355164049     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.3642604     Document Type: Article
Times cited : (307)

References (50)
  • 1
    • 0005787306 scopus 로고    scopus 로고
    • Musical instrument identification using autocorrelation coefficients
    • (Leavenworth, Washington)
    • Brown, J. (1998). Musical instrument identification using autocorrelation coefficients, in Proc. Intern. Symposium on Musical Acoustics (Leavenworth, Washington), pp. 291-295.
    • (1998) Proc. Intern. Symposium on Musical Acoustics , pp. 291-295
    • Brown, J.1
  • 2
    • 0035100239 scopus 로고    scopus 로고
    • Feature dependence in the automatic identification of musical woodwind instruments
    • DOI 10.1121/1.1342075
    • Brown, J., Houix, O., and McAdams, S. (2001). Feature dependence in the automatic identification of musical woodwind instruments, J. Acoust. Soc. Am. 109, 1064-1072. (Pubitemid 32215912)
    • (2001) Journal of the Acoustical Society of America , vol.109 , Issue.3 , pp. 1064-1072
    • Brown, J.C.1    Houix, O.2    McAdams, S.3
  • 3
    • 22144448560 scopus 로고    scopus 로고
    • Acoustic correlates of timbre space dimensions: A confirmatory study using synthetic tones
    • DOI 10.1121/1.1929229
    • Caclin, A., McAdams, S., Smith, B., and Winsberg, S. (2005). Acoustic correlates of timbre space dimensions: A confirmatory study using synthetic tones, J. Acoust. Soc. Am. 118, 471-482. 10.1121/1.1929229 (Pubitemid 40981501)
    • (2005) Journal of the Acoustical Society of America , vol.118 , Issue.1 , pp. 471-482
    • Caclin, A.1    McAdams, S.2    Smith, B.K.3    Winsberg, S.4
  • 4
    • 52449117078 scopus 로고    scopus 로고
    • A sawtooth waveform inspired pitch estimator for speech and music
    • 10.1121/1.2951592
    • Camacho, A., and Harris, J. (2008). A sawtooth waveform inspired pitch estimator for speech and music, J. Acoust. Soc. Am. 124, 1638-1652. 10.1121/1.2951592
    • (2008) J. Acoust. Soc. Am. , vol.124 , pp. 1638-1652
    • Camacho, A.1    Harris, J.2
  • 5
    • 0036214787 scopus 로고    scopus 로고
    • YIN, a fundamental frequency estimator for speech and music
    • DOI 10.1121/1.1458024
    • de Cheveigné, A., and Kawahara, H. (2002). YIN, a fundamental frequency estimator for speech and music, J. Acoust. Soc. Am. 111, 1917-1930. 10.1121/1.1458024 (Pubitemid 34297247)
    • (2002) Journal of the Acoustical Society of America , vol.111 , Issue.4 , pp. 1917-1930
    • De Cheveigne, A.1
  • 6
    • 85159570327 scopus 로고    scopus 로고
    • Machine recognition of timbre using steady-state tone of acoustical musical instruments
    • (University of Michigan, Ann Arbor, MI)
    • Fujinaga, I. (1998). Machine recognition of timbre using steady-state tone of acoustical musical instruments, in Proc. of Int. Computer Music Conference (University of Michigan, Ann Arbor, MI), pp. 207-210.
    • (1998) Proc. of Int. Computer Music Conference , pp. 207-210
    • Fujinaga, I.1
  • 7
    • 85159262837 scopus 로고    scopus 로고
    • Realtime recognition of orchestral instruments
    • (Berlin, Germany)
    • Fujinaga, I., and MacMillan, K. (2000). Realtime recognition of orchestral instruments, in Proc. of Int. Computer Music Conference (Berlin, Germany), pp. 241-243.
    • (2000) Proc. of Int. Computer Music Conference , pp. 241-243
    • Fujinaga, I.1    MacMillan, K.2
  • 8
    • 77950565560 scopus 로고    scopus 로고
    • Integration of acoustical information in the perception of impacted sound sources: The role of information accuracy and exploitability
    • Giordano, B. L., Rocchesso, D., and McAdams, S. (2010). Integration of acoustical information in the perception of impacted sound sources: The role of information accuracy and exploitability, J. Exp. Psychol. 36, 462-479.
    • (2010) J. Exp. Psychol. , vol.36 , pp. 462-479
    • Giordano, B.L.1    Rocchesso, D.2    McAdams, S.3
  • 9
    • 0023371251 scopus 로고
    • The perceptual attack time of musical tones
    • 10.1121/1.395441
    • Gordon, J. (1987). The perceptual attack time of musical tones, J. Acoust. Soc. Am. 82, 88-105. 10.1121/1.395441
    • (1987) J. Acoust. Soc. Am. , vol.82 , pp. 88-105
    • Gordon, J.1
  • 10
    • 0004241258 scopus 로고    scopus 로고
    • 2nd ed. (Chapman Hall/CRC, Boca Raton, FL)
    • Gordon, A. D. (1999). Classification, 2nd ed. (Chapman Hall/CRC, Boca Raton, FL), pp. 1-256.
    • (1999) Classification , pp. 1-256
    • Gordon, A.D.1
  • 11
    • 0017595342 scopus 로고
    • Multidimensional perceptual scaling of musical timbres
    • Grey, J. (1977). Multidimensional perceptual scaling of musical timbres, J. Acoust. Soc. Am. 61, 1270-1277. 10.1121/1.381428 (Pubitemid 8086808)
    • (1977) Journal of the Acoustical Society of America , vol.61 , Issue.5 , pp. 1270-1277
    • Grey, J.M.1
  • 12
    • 0018139926 scopus 로고
    • Perceptual effects of spectral modifications on musical timbres
    • Grey, J., and Gordon, J. (1978). Perceptual effects of spectral modifications on musical timbres, J. Acoust. Soc. Am. 63, 1493-1500. 10.1121/1.381843 (Pubitemid 8346203)
    • (1978) Journal of the Acoustical Society of America , vol.63 , Issue.5 , pp. 1493-1500
    • Grey, J.M.1    Gordon, J.W.2
  • 13
    • 44649115080 scopus 로고    scopus 로고
    • Towards instrument segmentation for music content description: A critical review of instrument classification techniques
    • Plymouth, MA (University of Massachusetts, Amherst, MA)
    • Herrera, P., Amatriain, X., Batlle, E., and Serra, X. (2000). Towards instrument segmentation for music content description: A critical review of instrument classification techniques, in International Symposium on Music Information Retrieval (ISMIR 2000), Plymouth, MA (University of Massachusetts, Amherst, MA), pp. 23-25.
    • (2000) International Symposium on Music Information Retrieval (ISMIR 2000) , pp. 23-25
    • Herrera, P.1    Amatriain, X.2    Batlle, E.3    Serra, X.4
  • 14
    • 0000008146 scopus 로고
    • Comparing partitions
    • Hubert, L. J., and Arabie, P. (1985). Comparing partitions, J. Classif. 2, 193-218.
    • (1985) J. Classif. , vol.2 , pp. 193-218
    • Hubert, L.J.1    Arabie, P.2
  • 17
    • 0023963510 scopus 로고
    • TRANSFORM CODING OF AUDIO SIGNALS USING PERCEPTUAL NOISE CRITERIA.
    • DOI 10.1109/49.608
    • Johnston, J. (1988) Transform coding of audio signals using perceptual noise criteria, IEEE J. Sel. Areas Commun. 6 (2), 314-323. 10.1109/49.608 (Pubitemid 18596873)
    • (1988) IEEE Journal on Selected Areas in Communications , vol.6 , Issue.2 , pp. 314-323
    • Johnston James, D.1
  • 18
    • 2942572739 scopus 로고    scopus 로고
    • Perceptual and acoustical features of natural and synthetic orchestral instrument tones
    • Kendall, R., Carterette, E., and Hajda, J. (1999). Perceptual and acoustical features of natural and synthetic orchestral instrument tones, Music Percept. 16 (3), 327-364.
    • (1999) Music Percept. , vol.16 , Issue.3 , pp. 327-364
    • Kendall, R.1    Carterette, E.2    Hajda, J.3
  • 20
    • 0028428899 scopus 로고
    • Caractérisation du timbre des sons complexes. II. Analyses acoustiques et quantification psychophysique (Characterization of the timbre of complex sounds. II Acoustical analysis and psychophysical quantification)
    • Krimphoff, J., McAdams, S., and Winsberg, S. (1994). Caracté risation du timbre des sons complexes. II. Analyses acoustiques et quantification psychophysique (Characterization of the timbre of complex sounds. II Acoustical analysis and psychophysical quantification), J. Phys. 4, 625-628.
    • (1994) J. Phys. , vol.4 , pp. 625-628
    • Krimphoff, J.1    McAdams, S.2    Winsberg, S.3
  • 21
    • 0002477067 scopus 로고
    • Why is musical timbre so hard to understand?
    • edited by S. Nielzén Olsson (Excerpta Medica, Amsterdam, Netherlands)
    • Krumhansl, C. L. (1989). Why is musical timbre so hard to understand?, in Structure and Perception of Electroacoustic Sound and Music, edited by, S. Nielzén Olsson, (Excerpta Medica, Amsterdam, Netherlands), pp. 43-51.
    • (1989) Structure and Perception of Electroacoustic Sound and Music , pp. 43-51
    • Krumhansl, C.L.1
  • 22
    • 0034293572 scopus 로고    scopus 로고
    • A common perceptual space for harmonic and percussive timbres
    • 10.3758/BF03212144
    • Lakatos, S. (2000). A common perceptual space for harmonic and percussive timbres, Percept. Psychophys. 62 (7), 1426-1439. 10.3758/BF03212144
    • (2000) Percept. Psychophys. , vol.62 , Issue.7 , pp. 1426-1439
    • Lakatos, S.1
  • 23
    • 0028210066 scopus 로고
    • Fundamental frequency estimation of musical signals using a two-way mismatch procedure
    • Maher, R., and Beauchamp, J. (1994). Fundamental frequency estimation of musical signals using a two-way mismatch procedure, J. Acoust. Soc. Am. 95 (4), 2254-2263. 10.1121/1.408685 (Pubitemid 24113060)
    • (1994) Journal of the Acoustical Society of America , vol.95 , Issue.4 , pp. 2254-2263
    • Maher, R.C.1    Beauchamp, J.W.2
  • 24
    • 33846212762 scopus 로고    scopus 로고
    • The effect of fundamental frequency on the brightness dimension of timbre
    • DOI 10.1121/1.2384910
    • Marozeau, J., and de Cheveigné, A. (2007). The effect of fundamental frequency on the brightness dimension of timbre, J. Acoust. Soc. Am. 121 (1), 383-387. 10.1121/1.2384910 (Pubitemid 46102785)
    • (2007) Journal of the Acoustical Society of America , vol.121 , Issue.1 , pp. 383-387
    • Marozeau, J.1    De Cheveigne, A.2
  • 27
    • 0003251346 scopus 로고
    • Recognition of sound sources and events
    • edited by S. McAdams and E. Bigand (Oxford University Press, Oxford, UK)
    • McAdams, S. (1993). Recognition of sound sources and events, in Thinking in Sound: The Cognitive Psychology of Human Audition, edited by, S. McAdams, and, E. Bigand, (Oxford University Press, Oxford, UK), pp. 146-198.
    • (1993) Thinking in Sound: The Cognitive Psychology of Human Audition , pp. 146-198
    • McAdams, S.1
  • 28
    • 0029442124 scopus 로고
    • Perceptual scaling of synthesized musical timbres: Common dimensions specificities and latent subject classes
    • 10.1007/BF00419633
    • McAdams, S., Windsberg, S., Donnadieu, S., DeSoete, G., and Krimphoff, J. (1995). Perceptual scaling of synthesized musical timbres: Common dimensions specificities and latent subject classes, Psychol. Res. 58, 177-192. 10.1007/BF00419633
    • (1995) Psychol. Res. , vol.58 , pp. 177-192
    • McAdams, S.1    Windsberg, S.2    Donnadieu, S.3    Desoete, G.4    Krimphoff, J.5
  • 29
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • 10.1109/TASSP.1986.1164910
    • McAulay, R., and Quatieri, T. (1986). Speech analysis/synthesis based on a sinusoidal representation, IEEE Trans. Acoust. Speech Signal Process. 34 (4), 744-754. 10.1109/TASSP.1986.1164910
    • (1986) IEEE Trans. Acoust. Speech Signal Process. , vol.34 , Issue.4 , pp. 744-754
    • McAulay, R.1    Quatieri, T.2
  • 30
    • 0016556010 scopus 로고
    • Perceptual space for musical structures
    • 10.1121/1.380719
    • Miller, J., and Carterette, E. (1975). Perceptual space for musical structures, J. Acoust. Soc. Am. 58, 711-720. 10.1121/1.380719
    • (1975) J. Acoust. Soc. Am. , vol.58 , pp. 711-720
    • Miller, J.1    Carterette, E.2
  • 31
    • 0000228352 scopus 로고
    • A Monte Carlo study in thirty internal criterion measures for cluster analysis
    • 10.1007/BF02293899
    • Milligan, G. W. (1981). A Monte Carlo study in thirty internal criterion measures for cluster analysis, Psychometrika 46, 187-199. 10.1007/BF02293899
    • (1981) Psychometrika , vol.46 , pp. 187-199
    • Milligan, G.W.1
  • 32
    • 77952505768 scopus 로고    scopus 로고
    • Validation of a multidimensional distance model for perceptual dissimilarities among musical timbres
    • 10.1121/1.421751
    • Misdariis, N., Smith, B., Presssnitzer, D., Susini, P., and McAdams, S. (1998). Validation of a multidimensional distance model for perceptual dissimilarities among musical timbres, J. Acoust. Soc. Am. 103, 3005-3006. 10.1121/1.421751
    • (1998) J. Acoust. Soc. Am. , vol.103 , pp. 3005-3006
    • Misdariis, N.1    Smith, B.2    Presssnitzer, D.3    Susini, P.4    McAdams, S.5
  • 33
    • 0020816083 scopus 로고
    • Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
    • Moore, B. C. J., and Glasberg, B. R. (1983). Suggested formulae for calculating auditory-filter bandwidths and excitation patterns, J. Acoust. Soc. Am. 74, 750-753. 10.1121/1.389861 (Pubitemid 13019047)
    • (1983) Journal of the Acoustical Society of America , vol.74 , Issue.3 , pp. 750-753
    • Moore, B.C.J.1    Glasberg, B.R.2
  • 37
    • 85159322000 scopus 로고    scopus 로고
    • Instrument sound description in the context of MPEG-7
    • Berlin, Germany (ICMA, San Francisco)
    • Peeters, G., McAdams, S., and Herrera, P. (2000). Instrument sound description in the context of MPEG-7, in Proc. of Int. Computer Music Conference, Berlin, Germany (ICMA, San Francisco).
    • (2000) Proc. of Int. Computer Music Conference
    • Peeters, G.1    McAdams, S.2    Herrera, P.3
  • 38
    • 0002013896 scopus 로고
    • Timbre as a multidimensional attribute of complex tones
    • edited by R. Plomand G. F. Smoorenburg (Sijthoff, Leiden)
    • Plomp, R. (1970). Timbre as a multidimensional attribute of complex tones, in Frequency Analysis and Periodicity Detection in Hearing, edited by, R. Plomp, and, G. F. Smoorenburg, (Sijthoff, Leiden), pp. 397-414.
    • (1970) Frequency Analysis and Periodicity Detection in Hearing , pp. 397-414
    • Plomp, R.1
  • 39
    • 0020165237 scopus 로고
    • A tristumulus method for the specification of musical timbre
    • Pollard, H., and Jansson, E. (1982). A tristumulus method for the specification of musical timbre, Acustica 51, 162-171.
    • (1982) Acustica , vol.51 , pp. 162-171
    • Pollard, H.1    Jansson, E.2
  • 41
    • 79955755311 scopus 로고    scopus 로고
    • SAS Institute Inc.. (SAS Institute Inc., Cary, NC)
    • SAS Institute Inc. (2010). SAS/Stat 9.22 User's Guide (SAS Institute Inc., Cary, NC).
    • (2010) SAS/Stat 9.22 User's Guide
  • 42
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • Munich, Germany (IEEE Computer Society Press, Los Alamitos, CA), Vol
    • Scheirer, E., and Slaney, M. (1997). Construction and evaluation of a robust multifeature speech/music discriminator, in Proc. of IEEE Int. Conference on Acoustic Speech and Signal Processing, Munich, Germany (IEEE Computer Society Press, Los Alamitos, CA), Vol. 2, pp. 1331-1334.
    • (1997) Proc. of IEEE Int. Conference on Acoustic Speech and Signal Processing , vol.2 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 43
    • 0025544510 scopus 로고
    • Spectral modeling synthesis. A sound analysis/synthesis system based on a deterministic plus stochastic decomposition
    • Serra, X., and Smith III, J. (1990). Spectral modeling synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition, Comput. Music J. 14, 12-24. (Pubitemid 21695923)
    • (1990) Computer Music Journal , vol.14 , Issue.4 , pp. 12-24
    • Serra Xavier1    Smith Julius, O.2
  • 44
    • 81355161666 scopus 로고    scopus 로고
    • Auditory Toolbox, Version 2, Technical Report No: 1998-010 (Interval Research Corporation)
    • Slaney, M. (1998). Auditory Toolbox, Version 2, Technical Report No: 1998-010 (Interval Research Corporation).
    • (1998)
    • Slaney, M.1
  • 46
    • 0015273930 scopus 로고
    • Dimension analysis of the perception of instrumental timbre
    • 10.1111/j.1467-9450.1972.tb00071.x
    • Wedin, L., and Goude, G. (1972). Dimension analysis of the perception of instrumental timbre, Scand. J. Psychol. 13, 228-240. 10.1111/j.1467-9450.1972. tb00071.x
    • (1972) Scand. J. Psychol. , vol.13 , pp. 228-240
    • Wedin, L.1    Goude, G.2
  • 47
    • 0001894143 scopus 로고
    • Timbre space as a musical control structure
    • 10.2307/3680283
    • Wessel, D. (1979). Timbre space as a musical control structure, Comput. Music J. 3, 45-52. 10.2307/3680283
    • (1979) Comput. Music J. , vol.3 , pp. 45-52
    • Wessel, D.1
  • 48
    • 0008589959 scopus 로고
    • Psychoacoustics and music: A report from Michigan State University
    • Wessel, D. L. (1973). Psychoacoustics and music: A report from Michigan State University, PACE: Bull. Comput. Arts Soc. 30, 1-2.
    • (1973) PACE: Bull. Comput. Arts Soc. , vol.30 , pp. 1-2
    • Wessel, D.L.1
  • 50
    • 84953656445 scopus 로고
    • Subdivision of the audible frequency range into critical bands (Frequenzgruppen)
    • 10.1121/1.1908630
    • Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands (Frequenzgruppen), J. Acoust. Soc. Am. 33, 248. 10.1121/1.1908630
    • (1961) J. Acoust. Soc. Am. , vol.33 , pp. 248
    • Zwicker, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.