메뉴 건너뛰기




Volumn 12, Issue 2, 2004, Pages 100-109

Singing Voice Identification Using Spectral Envelope Estimation

Author keywords

Music information retrieval; Singer identification; Spectral analysis; Vocal tract transfer function

Indexed keywords

COMPUTATIONAL METHODS; DATABASE SYSTEMS; INFORMATION RETRIEVAL; MATHEMATICAL MODELS; NATURAL FREQUENCIES; SPECTRUM ANALYSIS; TRANSFER FUNCTIONS;

EID: 2142696853     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2003.822637     Document Type: Article
Times cited : (40)

References (40)
  • 2
    • 0035212533 scopus 로고    scopus 로고
    • Discrimination functions: Can they be used to classify singing voices
    • M. L. Erickson, S. Handel, and S. Perry, "Discrimination functions: Can they be used to classify singing voices," J. Voice, vol. 15, no. 4, pp. 492-502, 2001.
    • (2001) J. Voice , vol.15 , Issue.4 , pp. 492-502
    • Erickson, M.L.1    Handel, S.2    Perry, S.3
  • 3
    • 0010984660 scopus 로고    scopus 로고
    • A rule of thumb: The bandwidth for timbre invariance is one octave
    • S. Handel and M. L. Erickson, "A rule of thumb: The bandwidth for timbre invariance is one octave," Music Percept., vol. 19, no. 1, pp. 121-127, 2001.
    • (2001) Music Percept. , vol.19 , Issue.1 , pp. 121-127
    • Handel, S.1    Erickson, M.L.2
  • 7
    • 13444308866 scopus 로고    scopus 로고
    • Using voice segments to improve artist classification of music
    • Espoo, Finland
    • A. Berenzweig, D. Ellis, and S. Lawrence, "Using voice segments to improve artist classification of music," in AES 22nd Int. Conf., Espoo, Finland, 2002.
    • (2002) AES 22nd Int. Conf.
    • Berenzweig, A.1    Ellis, D.2    Lawrence, S.3
  • 8
    • 0037818558 scopus 로고    scopus 로고
    • A singer identification technique for content-based classification of MP3 music objects
    • McLean, VA
    • C.-C. Liu and C.-S. Huang, "A singer identification technique for content-based classification of MP3 music objects," in Proc. Conf. Information and Knowledge Management, McLean, VA, 2002, pp. 438-445.
    • (2002) Proc. Conf. Information and Knowledge Management , pp. 438-445
    • Liu, C.-C.1    Huang, C.-S.2
  • 10
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences "IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 11
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • J. P. Campbell Jr, "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, pp. 1437-1462, 1997.
    • (1997) Proc. IEEE , vol.85 , pp. 1437-1462
    • Campbell Jr., J.P.1
  • 12
    • 0030247355 scopus 로고    scopus 로고
    • Robust speaker recognition: A feature based approach
    • R. Mammone, X. Zhang, and R. P. Ramachandran, "Robust speaker recognition: A feature based approach," Signal Process. Mag., vol. 13, no. 5, pp. 57-71, 1996.
    • (1996) Signal Process. Mag. , vol.13 , Issue.5 , pp. 57-71
    • Mammone, R.1    Zhang, X.2    Ramachandran, R.P.3
  • 13
    • 0031223555 scopus 로고    scopus 로고
    • Recent advances in speaker recognition
    • S. Furui, "Recent advances in speaker recognition," Pattern Recognit. Lett., vol. 18, pp. 859-872, 1997.
    • (1997) Pattern Recognit. Lett. , vol.18 , pp. 859-872
    • Furui, S.1
  • 14
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D. Reynolds and R. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Processing, vol. 3, pp. 72-83, 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 15
    • 0004723933 scopus 로고    scopus 로고
    • Musical instrument identification: A pattern recognition approach
    • K. D. Martin and Y. E. Kim, "Musical instrument identification: A pattern recognition approach," J. Acoust. Soc. Amer., vol. 104, p. 1768, 1998.
    • (1998) J. Acoust. Soc. Amer. , vol.104 , pp. 1768
    • Martin, K.D.1    Kim, Y.E.2
  • 17
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • Munich, Germany
    • E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. ICASSP, vol. 2, Munich, Germany, 1997, pp. 1331-1334.
    • (1997) Proc. ICASSP , vol.2 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 18
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification, search, and retrieval of audio
    • E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia, pp. 27-36, 1996.
    • (1996) IEEE Multimedia , pp. 27-36
    • Wold, E.1    Blum, T.2    Keislar, D.3    Wheaton, J.4
  • 19
    • 0034273520 scopus 로고    scopus 로고
    • Content-based audio classification and retrieval using the nearest feature line method
    • S. Z. Li, "Content-based audio classification and retrieval using the nearest feature line method," IEEE Trans. Speech Audio Processing, vol. 8, pp. 619-625, 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 619-625
    • Li, S.Z.1
  • 20
    • 0035214634 scopus 로고    scopus 로고
    • Modal distribution analysis, synthesis, and perception of a soprano's sung vowels
    • M. Mellody, F. Herseth, and G. H. Wakefield, "Modal distribution analysis, synthesis, and perception of a soprano's sung vowels," J. Voice, vol. 15, no. 4, pp. 469-82, 2001.
    • (2001) J. Voice , vol.15 , Issue.4 , pp. 469-482
    • Mellody, M.1    Herseth, F.2    Wakefield, G.H.3
  • 21
    • 2142654456 scopus 로고    scopus 로고
    • Signal analysis of the singing voice: Low-order representations of singer identity
    • Berlin, Germany
    • M. Mellody and G. H. Wakefield, "Signal analysis of the singing voice: Low-order representations of singer identity," in Proc. Int. Computer Music Conf. 2000, Berlin, Germany, 2000.
    • (2000) Proc. Int. Computer Music Conf. 2000
    • Mellody, M.1    Wakefield, G.H.2
  • 24
    • 0002646202 scopus 로고
    • Complex-curve fitting
    • E. Levy, "Complex-curve fitting," IRE Trans. Automat. Control, vol. AC-4, pp. 37-44, 1959.
    • (1959) IRE Trans. Automat. Control , vol.AC-4 , pp. 37-44
    • Levy, E.1
  • 26
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, no. 4, pp. 744-754, 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.4 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 27
    • 0030009735 scopus 로고    scopus 로고
    • A high-resolution time-frequency representation for musical instrument signals
    • W. J. Pielemeier and G. H. Wakefield, "A high-resolution time-frequency representation for musical instrument signals," J. Acoust. Soc. Amer., vol. 99, pp. 2382-96, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , pp. 2382-2396
    • Pielemeier, W.J.1    Wakefield, G.H.2
  • 28
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound," in Proc. Inst. Phonetic Sciences of the University of Amsterdam, vol. 17, 1993, pp. 97-110.
    • (1993) Proc. Inst. Phonetic Sciences of the University of Amsterdam , vol.17 , pp. 97-110
    • Boersma, P.1
  • 29
  • 33
    • 2142808292 scopus 로고    scopus 로고
    • Automatic Segmentation of Sung Melodies
    • Dept. EECS, Univ. Michigan, Ann Arbor, Tech. Rep. 340
    • N. H. Adams, "Automatic Segmentation of Sung Melodies," Communications and Signal Processing Laboratory (CSPL), Dept. EECS, Univ. Michigan, Ann Arbor, Tech. Rep. 340, 2003.
    • (2003) Communications and Signal Processing Laboratory (CSPL)
    • Adams, N.H.1
  • 35
    • 0028199040 scopus 로고
    • Perceptual aspects of singing
    • J. Sundberg, "Perceptual aspects of singing," J. Voice, vol. 8, no. 2, pp. 106-22, 1994.
    • (1994) J. Voice , vol.8 , Issue.2 , pp. 106-122
    • Sundberg, J.1
  • 36
    • 0027997572 scopus 로고
    • Measurements of the vibrato rate of ten singers
    • E. Frame, "Measurements of the vibrato rate of ten singers," J. Acoust. Soc. Amer., vol. 96, no. 4, pp. 1979-1984, 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.96 , Issue.4 , pp. 1979-1984
    • Frame, E.1
  • 37
    • 0030855829 scopus 로고    scopus 로고
    • Vibrato extent and intonation in professional western lyric singing
    • _, "Vibrato extent and intonation in professional western lyric singing," J. Acoust. Soc. Amer., vol. 102, no. 1, pp. 616-621, 1997.
    • (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.1 , pp. 616-621
  • 39
    • 0032770743 scopus 로고    scopus 로고
    • Voice source characteristics in six premier country singers
    • J. Sundberg, T. F. Cleveland, J. Stone, R. E, and J. Iwarsson, "Voice source characteristics in six premier country singers," J. Voice, vol. 13, no. 2, pp. 168-83, 1999.
    • (1999) J. Voice , vol.13 , Issue.2 , pp. 168-183
    • Sundberg, J.1    Cleveland, T.F.2    Stone, J.3    Iwarsson, R.E.J.4
  • 40
    • 0035087414 scopus 로고    scopus 로고
    • Long-term-average spectrum characteristics of country singers during speaking and singing
    • T. F. Cleveland, J. Sundberg, and R. E. Stone, "Long-term-average spectrum characteristics of country singers during speaking and singing," J. Voice, vol. 15, no. 1, pp. 54-60, 2001.
    • (2001) J. Voice , vol.15 , Issue.1 , pp. 54-60
    • Cleveland, T.F.1    Sundberg, J.2    Stone, R.E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.