메뉴 건너뛰기




Volumn 18, Issue 5, 2010, Pages 974-983

Emotion conversion based on prosodic unit selection

Author keywords

Emotional speech synthesis; Intonation; Prosody; Unit selection; Voice conversion

Indexed keywords

CONVERSION METHODS; CURRENT SYSTEM; EMOTION CONVERSION; EMOTIONAL SPEECH; EMOTIONAL SPEECH SYNTHESIS; ONE STEP; SPEAKING STYLES; SUBJECTIVE TESTS; UNIT SELECTION; VOICE CONVERSION;

EID: 77953699919     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2038658     Document Type: Article
Times cited : (25)

References (40)
  • 1
    • 0031643805 scopus 로고    scopus 로고
    • Speaker transformation using sentence HMM based alignments and detailed prosody modification
    • L. M. Arslan and D. Talkin, "Speaker transformation using sentence HMM based alignments and detailed prosody modification," in Proc. ICASSP, 1998, vol.1, pp. 289-292.
    • (1998) Proc. ICASSP , vol.1 , pp. 289-292
    • Arslan, L.M.1    Talkin, D.2
  • 3
    • 85010815133 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," in Speech Commun., 1992, vol.1, pp. 145-148.
    • (1992) Speech Commun , vol.1 , pp. 145-148
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 4
    • 85064715894 scopus 로고
    • Speech spectrum transformation by speaker interpolation
    • N. Iwahashi and Y. Sagisaka, "Speech spectrum transformation by speaker interpolation," in Proc. ICASSP, 1994, vol.1, pp. 461-464.
    • (1994) Proc. ICASSP , vol.1 , pp. 461-464
    • Iwahashi, N.1    Sagisaka, Y.2
  • 5
    • 0029254176 scopus 로고
    • Transformation of formants for voice conversion using artificial neural networks
    • M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol.16, no.2, pp. 207-216, 1995.
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 6
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," in Proc. ICASSP, 1998, vol. 6, no. 2, pp. 131-142.
    • (1998) Proc. ICASSP , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappé, O.2    Moulines, E.3
  • 7
    • 4444285698 scopus 로고    scopus 로고
    • Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR
    • A. Kain, "High resolution voice transformation," Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR, 2001.
    • (2001) High Resolution Voice Transformation
    • Kain, A.1
  • 8
    • 33646779506 scopus 로고    scopus 로고
    • Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
    • T. Toda, A. W. Black, and K. Tokuda, "Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter," in Proc. ICASSP, 2005, vol.1, pp. 9-12.
    • (2005) Proc. ICASSP , vol.1 , pp. 9-12
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 9
    • 85068458327 scopus 로고    scopus 로고
    • Weighted frequency warping for voice conversion
    • D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," in Proc. Interspeech, 2007, pp. 1965-1968.
    • (2007) Proc. Interspeech , pp. 1965-1968
    • Erro, D.1    Moreno, A.2
  • 10
    • 5444243681 scopus 로고    scopus 로고
    • Speaker specific pitch contour modeling and modification
    • D. T. Chappell and J. H. L. Hansen, "Speaker specific pitch contour modeling and modification," in Proc. ICASSP, 1998, vol.2, pp. 885-888.
    • (1998) Proc. ICASSP , vol.2 , pp. 885-888
    • Chappell, D.T.1    Hansen, J.H.L.2
  • 11
    • 85009179173 scopus 로고    scopus 로고
    • Voice conversion methods for vocal tract and pitch contour modification
    • O. Turk and L. M. Arslan, "Voice conversion methods for vocal tract and pitch contour modification," in Proc. Eurospeech, 2003, pp. 2845-2848.
    • (2003) Proc. Eurospeech , pp. 2845-2848
    • Turk, O.1    Arslan, L.M.2
  • 13
    • 5444259197 scopus 로고    scopus 로고
    • On the construction of a pitch conversion system
    • T. Ceyssens, W. Verhelst, and P. Wambacq, "On the construction of a pitch conversion system," in Proc. Eusipco, 2002, vol.1, pp. 423-426.
    • (2002) Proc. Eusipco , vol.1 , pp. 423-426
    • Ceyssens, T.1    Verhelst, W.2    Wambacq, P.3
  • 14
    • 85009212516 scopus 로고    scopus 로고
    • Transforming f0 contours
    • B. Gillett and S. King, "Transforming f0 contours," in Proc. Eurospeech, 2003, pp. 101-104.
    • (2003) Proc. Eurospeech , pp. 101-104
    • Gillett, B.1    King, S.2
  • 15
    • 4544361661 scopus 로고    scopus 로고
    • Voice conversion through transformation of spectral and intonation features
    • D. Rentzos, S. Vaseghi, Q. Yan, and C. H. Ho, "Voice conversion through transformation of spectral and intonation features," in Proc. ICASSP, 2004, vol.1, pp. 21-24.
    • (2004) Proc. ICASSP , vol.1 , pp. 21-24
    • Rentzos, D.1    Vaseghi, S.2    Yan, Q.3    Ho, C.H.4
  • 16
    • 84869508926 scopus 로고    scopus 로고
    • A voice conversion method based on joint pitch and spectral envelope transformation
    • T. En-Najjary, O. Rosec, and T. Chonavel, "A voice conversion method based on joint pitch and spectral envelope transformation," in Proc. ICSLP, 2004, pp. 1225-1228.
    • (2004) Proc. ICSLP , pp. 1225-1228
    • En-Najjary, T.1    Rosec, O.2    Chonavel, T.3
  • 17
    • 34547520011 scopus 로고    scopus 로고
    • A novel method for prosody prediction in voice conversion
    • E. E. Helander and J. Nurminen, "A novel method for prosody prediction in voice conversion," in Proc. ICASSP, 2007, vol.4, pp. 509-512.
    • (2007) Proc. ICASSP , vol.4 , pp. 509-512
    • Helander, E.E.1    Nurminen, J.2
  • 18
    • 77953726259 scopus 로고    scopus 로고
    • Pitch and duration transformation with non-parallel data
    • D. Lolive, N. Barbot, and O. Boeffard, "Pitch and duration transformation with non-parallel data," Speech Prosody, pp. 111-114, 2008.
    • (2008) Speech Prosody , pp. 111-114
    • Lolive, D.1    Barbot, N.2    Boeffard, O.3
  • 19
    • 34548216761 scopus 로고    scopus 로고
    • Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion
    • Sep.
    • C. C. Hsia, C. H. Wu, and J. Q. Wu, "Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion," IEEE Trans. Comput., vol.56, no.9, pp. 1245-1254, Sep. 2007.
    • (2007) IEEE Trans. Comput. , vol.56 , Issue.9 , pp. 1245-1254
    • Hsia, C.C.1    Wu, C.H.2    Wu, J.Q.3
  • 20
    • 34047263010 scopus 로고    scopus 로고
    • Prosody conversion from neutral speech to emotional speech
    • Jul.
    • J. Tao, Y. Kang, and A. Li, "Prosody conversion from neutral speech to emotional speech," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1145-1154, Jul. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1145-1154
    • Tao, J.1    Kang, Y.2    Li, A.3
  • 21
    • 58149203393 scopus 로고    scopus 로고
    • Data-driven emotion conversion in spoken English
    • Z. Inanoglu and S. Young, "Data-driven emotion conversion in spoken English," Speech Commun., vol.51, no.3, pp. 268-283, 2009.
    • (2009) Speech Commun , vol.51 , Issue.3 , pp. 268-283
    • Inanoglu, Z.1    Young, S.2
  • 22
    • 84946736935 scopus 로고    scopus 로고
    • A unit selection approach to F0 modeling and its application to emphasis
    • A. Raux and A. Black, "A unit selection approach to F0 modeling and its application to emphasis," in Proc. ASRU, 2003, pp. 700-705.
    • (2003) Proc. ASRU , pp. 700-705
    • Raux, A.1    Black, A.2
  • 23
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP, 1996, vol.1, pp. 373-376.
    • (1996) Proc. ICASSP , vol.1 , pp. 373-376
    • Hunt, A.1    Black, A.2
  • 24
    • 0003906146 scopus 로고
    • Analysis and synthesis of German F0 contours by means of Fujisaki's model
    • B. Möbius, M. Pätzold, and W. Hess, "Analysis and synthesis of German F0 contours by means of Fujisaki's model," Speech Commun., vol.13, pp. 53-61, 1993.
    • (1993) Speech Commun , vol.13 , pp. 53-61
    • Möbius, B.1    Pätzold, M.2    Hess, W.3
  • 27
    • 33947662015 scopus 로고    scopus 로고
    • Prosody generation for speech-to-speech translation
    • P. D. Agüero, J. Adell, and A. Bonafonte, "Prosody generation for speech-to-speech translation," in Proc. ICASSP, 2006, pp. 557-560.
    • (2006) Proc. ICASSP , pp. 557-560
    • Agüero, P.D.1    Adell, J.2    Bonafonte, A.3
  • 28
    • 33745372264 scopus 로고    scopus 로고
    • A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems
    • F. Campillo and E. R. Banga, "A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems," in Speech Commun., 2005, vol.48, pp. 941-956.
    • (2005) Speech Commun , vol.48 , pp. 941-956
    • Campillo, F.1    Banga, E.R.2
  • 29
    • 77953723246 scopus 로고    scopus 로고
    • Predicting segmental durations for Basque using CARTs
    • E. Navas, I. Hernáez, and J. Sánchez, "Predicting segmental durations for Basque using CARTs," in Proc. 15th ICPhS, 2003, pp. 2083-2086.
    • (2003) Proc. 15th ICPhS , pp. 2083-2086
    • Navas, E.1    Hernáez, I.2    Sánchez, J.3
  • 31
    • 84867216755 scopus 로고    scopus 로고
    • The linear transformation of LF glottal waveforms for voice conversion
    • A. del Pozo and S. Young, "The linear transformation of LF glottal waveforms for voice conversion," in Proc. Interspeech, 2008, pp. 1457-1460.
    • (2008) Proc. Interspeech , pp. 1457-1460
    • Del Pozo, A.1    Young, S.2
  • 33
    • 34047254509 scopus 로고    scopus 로고
    • Quality-enhanced voice morphing using maximum likelihood transformations
    • Jul.
    • H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1301-1312, Jul. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1301-1312
    • Ye, H.1    Young, S.2
  • 34
    • 0029799113 scopus 로고    scopus 로고
    • Spectral balance as an acoustic correlate of linguistic stress
    • A. Sluyter and V. Van Heuven, "Spectral balance as an acoustic correlate of linguistic stress," J. Acoust. Soc. Amer., vol.100, pp. 2471-2485, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.100 , pp. 2471-2485
    • Sluyter, A.1    Van Heuven, V.2
  • 35
    • 33845952706 scopus 로고    scopus 로고
    • The spectrum of glottal flow models
    • B. Doval, C. d'Alessandro, and N. Henrich, "The spectrum of glottal flow models," Acta Acustica, vol.92, pp. 1026-1046, 2006.
    • (2006) Acta Acustica , vol.92 , pp. 1026-1046
    • Doval, B.1    D'Alessandro, C.2    Henrich, N.3
  • 37
    • 0027268967 scopus 로고
    • HNS: Speech modification based on a harmonic+noise model
    • J. Laroche, Y. Stylianou, and E. Moulines, "HNS: Speech modification based on a harmonic+noise model," in Proc. ICASSP, 1993, vol.2, pp. 550-553.
    • (1993) Proc. ICASSP , vol.2 , pp. 550-553
    • Laroche, J.1    Stylianou, Y.2    Moulines, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.