SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 12, Issue 2, 2004, Pages 100-109

Singing Voice Identification Using Spectral Envelope Estimation

(2) Bartsch, Mark A a Wakefield, Gregory H a

a UNIVERSITY OF MICHIGAN (United States)

Author keywords

Music information retrieval; Singer identification; Spectral analysis; Vocal tract transfer function

Indexed keywords

COMPUTATIONAL METHODS; DATABASE SYSTEMS; INFORMATION RETRIEVAL; MATHEMATICAL MODELS; NATURAL FREQUENCIES; SPECTRUM ANALYSIS; TRANSFER FUNCTIONS;

MUSIC INFORMATION RETRIEVAL; SINGER IDENTIFICATION; SPECTRAL ENVELOPE ESTIMATION; VOCAL TRACT TRANSFER FUNCTION;

SPEECH PROCESSING;

EID: 2142696853 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2003.822637 Document Type: Article

Times cited : (40)

References (40)

1
- 0003418124
- The Hague, The Netherlands: Mouton
- G. Fant, Acoustic Theory of Speech Production. The Hague, The Netherlands: Mouton, 1960.
- (1960) Acoustic Theory of Speech Production
- Fant, G.¹

2
- 0035212533
- Discrimination functions: Can they be used to classify singing voices
- M. L. Erickson, S. Handel, and S. Perry, "Discrimination functions: Can they be used to classify singing voices," J. Voice, vol. 15, no. 4, pp. 492-502, 2001.
- (2001) J. Voice , vol.15 , Issue.4 , pp. 492-502
- Erickson, M.L.¹ Handel, S.² Perry, S.³

3
- 0010984660
- A rule of thumb: The bandwidth for timbre invariance is one octave
- S. Handel and M. L. Erickson, "A rule of thumb: The bandwidth for timbre invariance is one octave," Music Percept., vol. 19, no. 1, pp. 121-127, 2001.
- (2001) Music Percept. , vol.19 , Issue.1 , pp. 121-127
- Handel, S.¹ Erickson, M.L.²

4
- 0004088662
- Englewood Cliffs, N.J.: Prentice Hall
- I. R. Titze, Principles of Voice Production. Englewood Cliffs, N.J.: Prentice Hall, 1994.
- (1994) Principles of Voice Production
- Titze, I.R.¹

5
- 0004213896
- DeKalb: Northern Illinois Univ. Press
- J. Sundberg, The Science of the Singing Voice. DeKalb: Northern Illinois Univ. Press, 1987.
- (1987) The Science of the Singing Voice
- Sundberg, J.¹

6
- 10044291814
- Singer identification in popular music recordings using voice coding features
- Paris, France
- Y. E. Kim and B. Whitman, "Singer identification in popular music recordings using voice coding features," in Proc. ISMIR 2002: 3rd Int. Conf. Music Information Retrieval, Paris, France, 2002.
- (2002) Proc. ISMIR 2002: 3rd Int. Conf. Music Information Retrieval
- Kim, Y.E.¹ Whitman, B.²

7
- 13444308866
- Using voice segments to improve artist classification of music
- Espoo, Finland
- A. Berenzweig, D. Ellis, and S. Lawrence, "Using voice segments to improve artist classification of music," in AES 22nd Int. Conf., Espoo, Finland, 2002.
- (2002) AES 22nd Int. Conf.
- Berenzweig, A.¹ Ellis, D.² Lawrence, S.³

8
- 0037818558
- A singer identification technique for content-based classification of MP3 music objects
- McLean, VA
- C.-C. Liu and C.-S. Huang, "A singer identification technique for content-based classification of MP3 music objects," in Proc. Conf. Information and Knowledge Management, McLean, VA, 2002, pp. 438-445.
- (2002) Proc. Conf. Information and Knowledge Management , pp. 438-445
- Liu, C.-C.¹ Huang, C.-S.²

9
- 0035783543
- Artist detection in music with minnowmatch
- Falmouth, MA
- B. Whitman, G. Flake, and S. Lawrence, "Artist detection in music with minnowmatch," in Proc. 2001 IEEE Workshop on Neural Networks for Signal Processing, Falmouth, MA, 2001, pp. 559-568.
- (2001) Proc. 2001 IEEE Workshop on Neural Networks for Signal Processing , pp. 559-568
- Whitman, B.¹ Flake, G.² Lawrence, S.³

10
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences "IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, no. 4, pp. 357-366, 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

11
- 0031233424
- Speaker recognition: A tutorial
- J. P. Campbell Jr, "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, pp. 1437-1462, 1997.
- (1997) Proc. IEEE , vol.85 , pp. 1437-1462
- Campbell Jr., J.P.¹

12
- 0030247355
- Robust speaker recognition: A feature based approach
- R. Mammone, X. Zhang, and R. P. Ramachandran, "Robust speaker recognition: A feature based approach," Signal Process. Mag., vol. 13, no. 5, pp. 57-71, 1996.
- (1996) Signal Process. Mag. , vol.13 , Issue.5 , pp. 57-71
- Mammone, R.¹ Zhang, X.² Ramachandran, R.P.³

13
- 0031223555
- Recent advances in speaker recognition
- S. Furui, "Recent advances in speaker recognition," Pattern Recognit. Lett., vol. 18, pp. 859-872, 1997.
- (1997) Pattern Recognit. Lett. , vol.18 , pp. 859-872
- Furui, S.¹

14
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- D. Reynolds and R. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Processing, vol. 3, pp. 72-83, 1995.
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 72-83
- Reynolds, D.¹ Rose, R.²

15
- 0004723933
- Musical instrument identification: A pattern recognition approach
- K. D. Martin and Y. E. Kim, "Musical instrument identification: A pattern recognition approach," J. Acoust. Soc. Amer., vol. 104, p. 1768, 1998.
- (1998) J. Acoust. Soc. Amer. , vol.104 , pp. 1768
- Martin, K.D.¹ Kim, Y.E.²

16
- 2142818116
- Music content analysis through models of audition
- Bristol, U.K.
- K. Martin, E. Schreirer, and B. Vercoe, "Music content analysis through models of audition," in Proc. ACM Multimedia Workshop on Content Processing of Music for Multimedia Applications, Bristol, U.K., 1998.
- (1998) Proc. ACM Multimedia Workshop on Content Processing of Music for Multimedia Applications
- Martin, K.¹ Schreirer, E.² Vercoe, B.³

17
- 0030648077
- Construction and evaluation of a robust multifeature speech/music discriminator
- Munich, Germany
- E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. ICASSP, vol. 2, Munich, Germany, 1997, pp. 1331-1334.
- (1997) Proc. ICASSP , vol.2 , pp. 1331-1334
- Scheirer, E.¹ Slaney, M.²

18
- 0030242072
- Content-based classification, search, and retrieval of audio
- E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia, pp. 27-36, 1996.
- (1996) IEEE Multimedia , pp. 27-36
- Wold, E.¹ Blum, T.² Keislar, D.³ Wheaton, J.⁴

19
- 0034273520
- Content-based audio classification and retrieval using the nearest feature line method
- S. Z. Li, "Content-based audio classification and retrieval using the nearest feature line method," IEEE Trans. Speech Audio Processing, vol. 8, pp. 619-625, 2000.
- (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 619-625
- Li, S.Z.¹

20
- 0035214634
- Modal distribution analysis, synthesis, and perception of a soprano's sung vowels
- M. Mellody, F. Herseth, and G. H. Wakefield, "Modal distribution analysis, synthesis, and perception of a soprano's sung vowels," J. Voice, vol. 15, no. 4, pp. 469-82, 2001.
- (2001) J. Voice , vol.15 , Issue.4 , pp. 469-482
- Mellody, M.¹ Herseth, F.² Wakefield, G.H.³

21
- 2142654456
- Signal analysis of the singing voice: Low-order representations of singer identity
- Berlin, Germany
- M. Mellody and G. H. Wakefield, "Signal analysis of the singing voice: Low-order representations of singer identity," in Proc. Int. Computer Music Conf. 2000, Berlin, Germany, 2000.
- (2000) Proc. Int. Computer Music Conf. 2000
- Mellody, M.¹ Wakefield, G.H.²

22
- 2142661643
- Ph.D. dissertation, Univ. Michigan, Ann Arbor
- M. Mellody, "Signal Analysis of the Female Singing Voice: Features for Perceptual Singer Identity," Ph.D. dissertation, Univ. Michigan, Ann Arbor, 2001.
- (2001) Signal Analysis of the Female Singing Voice: Features for Perceptual Singer Identity
- Mellody, M.¹

23
- 0003513556
- Englewood Cliffs, NJ: Prentice-Hall
- A. V. Oppenheim and R. W. Schafer, Discrete-Time Signal Processing. Englewood Cliffs, NJ: Prentice-Hall, 1989.
- (1989) Discrete-time Signal Processing
- Oppenheim, A.V.¹ Schafer, R.W.²

24
- 0002646202
- Complex-curve fitting
- E. Levy, "Complex-curve fitting," IRE Trans. Automat. Control, vol. AC-4, pp. 37-44, 1959.
- (1959) IRE Trans. Automat. Control , vol.AC-4 , pp. 37-44
- Levy, E.¹

25
- 0004041275
- Englewood Cliffs, NJ: Prentice-Hall
- I. Dennis Jr and R. Schnabel, Numerical Methods for Unconstrained Optimization and Nonlinear Equations. Englewood Cliffs, NJ: Prentice-Hall, 1983.
- (1983) Numerical Methods for Unconstrained Optimization and Nonlinear Equations
- Dennis Jr., I.¹ Schnabel, R.²

26
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, no. 4, pp. 744-754, 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.4 , pp. 744-754
- McAulay, R.J.¹ Quatieri, T.F.²

27
- 0030009735
- A high-resolution time-frequency representation for musical instrument signals
- W. J. Pielemeier and G. H. Wakefield, "A high-resolution time-frequency representation for musical instrument signals," J. Acoust. Soc. Amer., vol. 99, pp. 2382-96, 1996.
- (1996) J. Acoust. Soc. Amer. , vol.99 , pp. 2382-2396
- Pielemeier, W.J.¹ Wakefield, G.H.²

28
- 0001835850
- Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
- P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound," in Proc. Inst. Phonetic Sciences of the University of Amsterdam, vol. 17, 1993, pp. 97-110.
- (1993) Proc. Inst. Phonetic Sciences of the University of Amsterdam , vol.17 , pp. 97-110
- Boersma, P.¹

29
- 0003733873
- nglewood Cliffs, NJ: Prentice-Hall
- L. Cohen, Time-Frequency Analysis. Englewood Cliffs, NJ: Prentice-Hall, 1995.
- (1995) Time-frequency Analysis
- Cohen, L.¹

30
- 0003487601
- Oxford, U.K.: Oxford Univ. Press
- C. M. Bishop, Neural Networks for Pattern Recognition. Oxford, U.K.: Oxford Univ. Press, 1995.
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

31
- 0003946510
- New York: Springer-Verlag
- I. T. Jollife, Principal Component Analysis. New York: Springer-Verlag, 1986.
- (1986) Principal Component Analysis
- Jollife, I.T.¹

32
- 0032097263
- Boston, MA: Academic
- K. Fukunaga, Introduction to Statistical Pattern Recognition, 2nd ed. Boston, MA: Academic, 1990.
- (1990) Introduction to Statistical Pattern Recognition, 2nd Ed.
- Fukunaga, K.¹

33
- 2142808292
- Automatic Segmentation of Sung Melodies
- Dept. EECS, Univ. Michigan, Ann Arbor, Tech. Rep. 340
- N. H. Adams, "Automatic Segmentation of Sung Melodies," Communications and Signal Processing Laboratory (CSPL), Dept. EECS, Univ. Michigan, Ann Arbor, Tech. Rep. 340, 2003.
- (2003) Communications and Signal Processing Laboratory (CSPL)
- Adams, N.H.¹

34
- 2142695274
- Perceptual recognition of female singing voices
- Philadelphia, PA
- M. Mellody, G. Wakefield, and F. Herseth, "Perceptual recognition of female singing voices," in Proc. Voice Foundation's 30th Annu. Symp.: Care of the Professional Voice, Philadelphia, PA, 2001.
- (2001) Proc. Voice Foundation's 30th Annu. Symp.: Care of the Professional Voice
- Mellody, M.¹ Wakefield, G.² Herseth, F.³

35
- 0028199040
- Perceptual aspects of singing
- J. Sundberg, "Perceptual aspects of singing," J. Voice, vol. 8, no. 2, pp. 106-22, 1994.
- (1994) J. Voice , vol.8 , Issue.2 , pp. 106-122
- Sundberg, J.¹

36
- 0027997572
- Measurements of the vibrato rate of ten singers
- E. Frame, "Measurements of the vibrato rate of ten singers," J. Acoust. Soc. Amer., vol. 96, no. 4, pp. 1979-1984, 1994.
- (1994) J. Acoust. Soc. Amer. , vol.96 , Issue.4 , pp. 1979-1984
- Frame, E.¹

37
- 0030855829
- Vibrato extent and intonation in professional western lyric singing
- _, "Vibrato extent and intonation in professional western lyric singing," J. Acoust. Soc. Amer., vol. 102, no. 1, pp. 616-621, 1997.
- (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.1 , pp. 616-621

38
- 0004696359
- Ph.D. dissertation, Univ. Tokyo, Tokyo, Japan
- Y. Meron, "High Quality Singing Synthesis Using the Selection-Based Synthesis Scheme," Ph.D. dissertation, Univ. Tokyo, Tokyo, Japan, 1999.
- (1999) High Quality Singing Synthesis Using the Selection-based Synthesis Scheme
- Meron, Y.¹

39
- 0032770743
- Voice source characteristics in six premier country singers
- J. Sundberg, T. F. Cleveland, J. Stone, R. E, and J. Iwarsson, "Voice source characteristics in six premier country singers," J. Voice, vol. 13, no. 2, pp. 168-83, 1999.
- (1999) J. Voice , vol.13 , Issue.2 , pp. 168-183
- Sundberg, J.¹ Cleveland, T.F.² Stone, J.³ Iwarsson, R.E.J.⁴

40
- 0035087414
- Long-term-average spectrum characteristics of country singers during speaking and singing
- T. F. Cleveland, J. Sundberg, and R. E. Stone, "Long-term-average spectrum characteristics of country singers during speaking and singing," J. Voice, vol. 15, no. 1, pp. 54-60, 2001.
- (2001) J. Voice , vol.15 , Issue.1 , pp. 54-60
- Cleveland, T.F.¹ Sundberg, J.² Stone, R.E.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.