메뉴 건너뛰기




Volumn 24, Issue 2, 2007, Pages 67-79

Synthesis of the singing voice by performance sampling and spectral models

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC EQUIPMENT; SOUND REPRODUCTION; SPECTRUM ANALYSIS; SPEECH SYNTHESIS;

EID: 85032751318     PISSN: 10535888     EISSN: None     Source Type: Journal    
DOI: 10.1109/MSP.2007.323266     Document Type: Review
Times cited : (76)

References (43)
  • 1
    • 85032751306 scopus 로고    scopus 로고
    • "Corpus-based concatenative synthesis"
    • Mar
    • D. Schwarz, "Corpus-based concatenative synthesis," IEEE Signal Processing Mag., vol. 24, no. 2, pp. 92-104, Mar. 2007.
    • (2007) IEEE Signal Processing Mag. , vol.24 , Issue.2 , pp. 92-104
    • Schwarz, D.1
  • 2
    • 85032751560 scopus 로고    scopus 로고
    • Music synthesis with reconstructive phrase modeling
    • Mar
    • E. Lindemann, "Music synthesis with reconstructive phrase modeling," IEEE Signal Processing Mag., vol. 24, no. 2, pp. 80-91, Mar. 2007.
    • (2007) IEEE Signal Processing Mag. , vol.24 , Issue.2 , pp. 80-91
    • Lindemann, E.1
  • 3
    • 84867657501 scopus 로고    scopus 로고
    • "Unisong: A choir singing synthesizer"
    • presented at the convention paper 6933, San Francisco, CA, Oct
    • J. Bonada, A. Loscos, and M. Blaauw, "Unisong: A choir singing synthesizer," presented at the 121st AES Conv., convention paper 6933, San Francisco, CA, Oct. 2006.
    • (2006) 121st AES Conv.
    • Bonada, J.1    Loscos, A.2    Blaauw, M.3
  • 5
    • 4243109590 scopus 로고    scopus 로고
    • "Singing voice modeling as we know it today"
    • M. Kob, "Singing voice modeling as we know it today," Acta Acust. united with Acustica, vol. 90, no. 4, pp. 649-661, 2004.
    • (2004) Acta Acust. United With Acustica , vol.90 , Issue.4 , pp. 649-661
    • Kob, M.1
  • 7
    • 3343019782 scopus 로고    scopus 로고
    • "Singing voice synthesis: History, current work, and future directions"
    • Fall
    • P.R. Cook, "Singing voice synthesis: History, current work, and future directions," Computer Music J., vol. 20, no. 3, pp. 38-46, Fall 1996.
    • (1996) Computer Music J. , vol.20 , Issue.3 , pp. 38-46
    • Cook, P.R.1
  • 9
    • 0002078743 scopus 로고
    • "Physical modeling using digital waveguides"
    • J.O. Smith, "Physical modeling using digital waveguides," Computer Music J., vol. 16, no. 4, pp. 74-87, 1992.
    • (1992) Computer Music J. , vol.16 , Issue.4 , pp. 74-87
    • Smith, J.O.1
  • 10
    • 0027556414 scopus 로고
    • "SPASM: A real-time vocal tract physical model editor/controller and singer; The Companion Software Synthesis System"
    • P. Cook, "SPASM: A real-time vocal tract physical model editor/ controller and singer; The Companion Software Synthesis System," Computer Music J., vol. 17, no. 1, pp. 30-44, 1992.
    • (1992) Computer Music J. , vol.17 , Issue.1 , pp. 30-44
    • Cook, P.1
  • 11
    • 85069807228 scopus 로고    scopus 로고
    • "Acoustical simulations of the human vocal tract using the 1D and 2D digital waveguide software model"
    • in Naples, Italy, Oct
    • J. Mullen, D.M. Howard, and D.T. Murphy, "Acoustical simulations of the human vocal tract using the 1D and 2D digital waveguide software model," in Proc. 4th Int. Conf. Digital Audio Effects, Naples, Italy, pp. 311-314, Oct. 2004.
    • (2004) Proc. 4th Int. Conf. Digital Audio Effects , pp. 311-314
    • Mullen, J.1    Howard, D.M.2    Murphy, D.T.3
  • 12
    • 34047204921 scopus 로고    scopus 로고
    • "Waveguide physical modeling of vocal tract acoustics: Flexible formant bandwidth control from increased model dimensionality"
    • May
    • J. Mullen, D.M. Howard, and D.T. Murphy, "Waveguide physical modeling of vocal tract acoustics: Flexible formant bandwidth control from increased model dimensionality," IEEE Trans. Audio, Speech, Language Processing, vol. 14, no. 3, pp. 964-971, May 2006.
    • (2006) IEEE Trans. Audio, Speech, Language Processing , vol.14 , Issue.3 , pp. 964-971
    • Mullen, J.1    Howard, D.M.2    Murphy, D.T.3
  • 13
    • 0031876670 scopus 로고    scopus 로고
    • "Vocal tract area functions from magnetic resonance imaging"
    • B.H. Story, I.R. Titze, and E.A. Hoffman, "Vocal tract area functions from magnetic resonance imaging," J. Acoust. Soc. Amer., vol. 104, no. 1, pp. 471-487, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.104 , Issue.1 , pp. 471-487
    • Story, B.H.1    Titze, I.R.2    Hoffman, E.A.3
  • 14
    • 4243070149 scopus 로고    scopus 로고
    • "Using imaging and modeling techniques to understand the relation between vocal tract shape to acoustic characteristics"
    • in SMAC-03
    • B.H. Story, "Using imaging and modeling techniques to understand the relation between vocal tract shape to acoustic characteristics," in Proc. Stockholm Music Acoustics Conf., 2003, SMAC-03, pp. 435-438.
    • (2003) Proc. Stockholm Music Acoustics Conf. , pp. 435-438
    • Story, B.H.1
  • 17
    • 0023407575 scopus 로고
    • "Review of text-to-speech conversion for English"
    • D.H. Klatt, "Review of text-to-speech conversion for English," J. Acoust. Soc. Amer., vol. 82, no. 3, pp. 737-793, 1987.
    • (1987) J. Acoust. Soc. Amer. , vol.82 , Issue.3 , pp. 737-793
    • Klatt, D.H.1
  • 18
    • 0141623687 scopus 로고
    • "Synthesis of the singing voice"
    • in M.V. Mathews and J.R. Pierce, Eds., Cambridge, MA: MIT Press
    • G. Bennett and X. Rodet, "Synthesis of the singing voice," in Current Directions in Computer Music Research, M.V. Mathews and J.R. Pierce, Eds., Cambridge, MA: MIT Press, pp. 19-44, 1989.
    • (1989) Current Directions in Computer Music Research , pp. 19-44
    • Bennett, G.1    Rodet, X.2
  • 19
    • 85032771753 scopus 로고    scopus 로고
    • "Spectral approach to the modeling of the singing voice"
    • presented at the convention paper 5452, New York, Sept
    • J. Bonada, A. Loscos, P. Cano, X. Serra, and H. Kenmochi, "Spectral approach to the modeling of the singing voice," presented at the 111th AES Conv., convention paper 5452, New York, Sept. 2001.
    • (2001) 111th AES Conv.
    • Bonada, J.1    Loscos, A.2    Cano, P.3    Serra, X.4    Kenmochi, H.5
  • 20
    • 0028016265 scopus 로고
    • "Measuring and modeling vocal source-tract interaction"
    • July
    • D.G. Childers, "Measuring and modeling vocal source-tract interaction," IEEE Trans. Biomed. Eng., vol. 41, no. 7, pp. 663-671, July 1994.
    • (1994) IEEE Trans. Biomed. Eng. , vol.41 , Issue.7 , pp. 663-671
    • Childers, D.G.1
  • 21
    • 0003837108 scopus 로고
    • "A system for sound analysis-transformation-synthesis based on a deterministic plus stochastic decomposition"
    • Ph.D. dissertation, CCRMA, Dept. Music, Stanford Univ., Stanford, CA
    • X. Serra, "A system for sound analysis-transformation-synthesis based on a deterministic plus stochastic decomposition," Ph.D. dissertation, CCRMA, Dept. Music, Stanford Univ., Stanford, CA, 1989.
    • (1989)
    • Serra, X.1
  • 22
    • 0033330037 scopus 로고    scopus 로고
    • "New phase-vocoder techniques for real-time pitch-shifting, chorusing, harmonizing, and other exotic audio effects"
    • Nov
    • J. Laroche and M. Dolson, "New phase-vocoder techniques for real-time pitch-shifting, chorusing, harmonizing, and other exotic audio effects," J. Audio Eng. Soc., vol. 47, no. 11, pp. 928-936, Nov. 1999.
    • (1999) J. Audio Eng. Soc. , vol.47 , Issue.11 , pp. 928-936
    • Laroche, J.1    Dolson, M.2
  • 24
    • 84856287160 scopus 로고    scopus 로고
    • "Frequency-domain techniques for high-quality voice modification"
    • in London, UK, Sept
    • J. Laroche, "Frequency-domain techniques for high-quality voice modification," in Proc. 6th Int. Conf. Digital Audio Effects, London, UK, Sept. 2003.
    • (2003) Proc. 6th Int. Conf. Digital Audio Effects
    • Laroche, J.1
  • 25
    • 84872690111 scopus 로고    scopus 로고
    • "Waveform preserving time stretching and pitch shifting for sinusoidal models of sound"
    • in Barcelona, Spain, Nov
    • R. DiFederico, "Waveform preserving time stretching and pitch shifting for sinusoidal models of sound," in Proc. 1st Int. Conf. Digital Audio Effects, Barcelona, Spain, Nov. 1998, pp. 44-48.
    • (1998) Proc. 1st Int. Conf. Digital Audio Effects , pp. 44-48
    • DiFederico, R.1
  • 26
    • 0029375490 scopus 로고
    • "Determination of instants of significant excitation in speech using group delay function"
    • Sept
    • R. Smits and B. Yegnanarayana, "Determination of instants of significant excitation in speech using group delay function," IEEE Trans. Speech Audio Processing, vol. 3, no. 5, pp. 325-333, Sept. 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , Issue.5 , pp. 325-333
    • Smits, R.1    Yegnanarayana, B.2
  • 27
    • 0032121729 scopus 로고    scopus 로고
    • "Extraction of vocal-tract system characteristics from speech signal"
    • July
    • B. Yegnanarayana and R. Veldhuis, "Extraction of vocal-tract system characteristics from speech signal," IEEE Trans. Speech Audio Processing, vol. 6, no. 4, pp. 313-327, July 1998.
    • (1998) IEEE Trans. Speech Audio Processing , vol.6 , Issue.4 , pp. 313-327
    • Yegnanarayana, B.1    Veldhuis, R.2
  • 28
    • 84872703197 scopus 로고    scopus 로고
    • "High quality voice transformations based on modeling radiated voice pulses in frequency domain"
    • in Naples, Italy, Oct
    • J. Bonada, "High quality voice transformations based on modeling radiated voice pulses in frequency domain," in Proc. 7th Int. Conf. Digital Audio Effects, Naples, Italy, Oct. 2004, pp. 291-295.
    • (2004) Proc. 7th Int. Conf. Digital Audio Effects , pp. 291-295
    • Bonada, J.1
  • 29
    • 85088454022 scopus 로고    scopus 로고
    • "Emulating rough and growl voice in spectral domain"
    • in Naples, Italy, Oct
    • A. Loscos and J. Bonada, "Emulating rough and growl voice in spectral domain," in Proc. 7th Int. Conf. Digital Audio Effects, Naples, Italy, Oct. 2004, pp. 49-52.
    • (2004) Proc. 7th Int. Conf. Digital Audio Effects , pp. 49-52
    • Loscos, A.1    Bonada, J.2
  • 30
    • 0011167986 scopus 로고    scopus 로고
    • "Physical modeling of the singing voice"
    • Ph.D. dissertation, Institute of Technical Acoustics, Aachen Univ., Aachen, Germany
    • M. Kob, "Physical modeling of the singing voice," Ph.D. dissertation, Institute of Technical Acoustics, Aachen Univ., Aachen, Germany, 2002.
    • (2002)
    • Kob, M.1
  • 31
    • 85138834244 scopus 로고    scopus 로고
    • "A new approach to transient processing in the phase vocoder"
    • in London, UK, Sept
    • A. Röbel, "A new approach to transient processing in the phase vocoder," in Proc. 6th Int. Conf. Digital Audio Effects, London, UK, Sept. 2003, pp. 344-349.
    • (2003) Proc. 6th Int. Conf. Digital Audio Effects , pp. 344-349
    • Röbel, A.1
  • 32
    • 0021494447 scopus 로고
    • "The CHANT Project: From the synthesis of the singing voice to synthesis in general"
    • X. Rodet, Y. Potard, and J.B.B. Barrire, "The CHANT Project: From the synthesis of the singing voice to synthesis in general," Computer Music J., vol. 8, no. 3, pp. 15-31, 1984.
    • (1984) Computer Music J. , vol.8 , Issue.3 , pp. 15-31
    • Rodet, X.1    Potard, Y.2    Barrire, J.B.B.3
  • 33
    • 0017981660 scopus 로고
    • "VOSIM - A new sound synthesis system"
    • W. Kaegi and S. Tempelaars, "VOSIM - A new sound synthesis system," J. Audio Eng. Soc., vol. 26, no. 6, pp. 418-425, 1978.
    • (1978) J. Audio Eng. Soc. , vol.26 , Issue.6 , pp. 418-425
    • Kaegi, W.1    Tempelaars, S.2
  • 34
    • 0002549801 scopus 로고    scopus 로고
    • "Emotional coloring of computer-controlled music performances"
    • R. Bresin and A. Friberg, "Emotional coloring of computer-controlled music performances," Computer Music J., vol. 24, no. 4, pp. 44-63, 2000.
    • (2000) Computer Music J. , vol.24 , Issue.4 , pp. 44-63
    • Bresin, R.1    Friberg, A.2
  • 35
    • 0003033637 scopus 로고
    • "Musical performance: A synthesisby-rule approach"
    • J. Sundberg, A. Askenfelt, and L. Fryd'n, "Musical performance: A synthesisby-rule approach," Computer Music J., vol. 7, no. 1, pp. 37-43, 1983.
    • (1983) Computer Music J. , vol.7 , Issue.1 , pp. 37-43
    • Sundberg, J.1    Askenfelt, A.2    Frydn, L.3
  • 36
    • 33646192105 scopus 로고    scopus 로고
    • "Computational models of expressive music performance: The state of the art"
    • G. Widmer and W. Goebl, "Computational models of expressive music performance: The state of the art," J. New Music Res., vol. 33, no. 3, pp. 203-216, 2004.
    • (2004) J. New Music Res. , vol.33 , Issue.3 , pp. 203-216
    • Widmer, G.1    Goebl, W.2
  • 37
    • 24744452130 scopus 로고    scopus 로고
    • "PWGLSynth: A visual synthesis language for virtual instrument design and control"
    • M. Laurson, V. Norilo, and M. Kuuskankare, "PWGLSynth: A visual synthesis language for virtual instrument design and control," Computer Music J., vol. 29, no. 3, pp. 29-41, 2005.
    • (2005) Computer Music J. , vol.29 , Issue.3 , pp. 29-41
    • Laurson, M.1    Norilo, V.2    Kuuskankare, M.3
  • 38
    • 84966350572 scopus 로고    scopus 로고
    • "Perfect synthesis for all of the people all of the time"
    • in Santa Monica, CA, Sept
    • A. Black, "Perfect synthesis for all of the people all of the time," in Proc. IEEE TTS Workshop 2002, Santa Monica, CA, Sept. 2002, pp. 167-170.
    • (2002) Proc. IEEE TTS Workshop 2002 , pp. 167-170
    • Black, A.1
  • 39
    • 0009647054 scopus 로고
    • "Spanish adaptation of SAMPA and automatic phonetic transcription"
    • ESPRIT Project 6819 (SAM-A Speech Technology Assessment in Multilingual Applications)
    • J. Llisterri and J.B. Mariño, "Spanish adaptation of SAMPA and automatic phonetic transcription," ESPRIT Project 6819 (SAM-A Speech Technology Assessment in Multilingual Applications), 1993.
    • (1993)
    • Llisterri, J.1    Mariño, J.B.2
  • 40
    • 84867649971 scopus 로고    scopus 로고
    • "Improvements to a sample-concatenation based singing voice synthesizer"
    • presented at the convention paper 6900, San Francisco, CA, Oct
    • J. Bonada, A. Loscos, and M. Blaauw, "Improvements to a sample-concatenation based singing voice synthesizer," presented at the 121st AES Conv., convention paper 6900, San Francisco, CA, Oct. 2006.
    • (2006) 121st AES Conv.
    • Bonada, J.1    Loscos, A.2    Blaauw, M.3
  • 41
    • 85032777677 scopus 로고    scopus 로고
    • "Syllable and tone boundaries in singing"
    • in Stockholm, Sweden, Aug
    • J. Ross and J. Sundberg, "Syllable and tone boundaries in singing," in Proc. 4th Pan European Voice Conf., Stockholm, Sweden, Aug. 2001.
    • (2001) Proc. 4th Pan European Voice Conf.
    • Ross, J.1    Sundberg, J.2
  • 42
    • 84872744184 scopus 로고    scopus 로고
    • "Transforming singing voice expression: The sweetness effect"
    • in Naples, Italy
    • L. Fabig and J. Janer, "Transforming singing voice expression: The sweetness effect," in Proc. 7th Int. Conf. Digital Audio Effects, Naples, Italy, 2004, pp. 70-75.
    • (2004) Proc. 7th Int. Conf. Digital Audio Effects , pp. 70-75
    • Fabig, L.1    Janer, J.2
  • 43
    • 34047262103 scopus 로고    scopus 로고
    • "Sample-based singing voice synthesizer by spectral concatenation"
    • in Stockholm, Sweden
    • J. Bonada, A. Loscos, and H. Kenmochi, "Sample-based singing voice synthesizer by spectral concatenation," in Proc. Stockholm Music Acoustics Conf., Stockholm, Sweden, 2003, pp. 439-442.
    • (2003) Proc. Stockholm Music Acoustics Conf. , pp. 439-442
    • Bonada, J.1    Loscos, A.2    Kenmochi, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.