SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 5, 2010, Pages 974-983

Emotion conversion based on prosodic unit selection

(4) Erro, Daniel a Navas, Eva a Hernáez, Inma a Saratxaga, Ibon a

a UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

Author keywords

Emotional speech synthesis; Intonation; Prosody; Unit selection; Voice conversion

Indexed keywords

CONVERSION METHODS; CURRENT SYSTEM; EMOTION CONVERSION; EMOTIONAL SPEECH; EMOTIONAL SPEECH SYNTHESIS; ONE STEP; SPEAKING STYLES; SUBJECTIVE TESTS; UNIT SELECTION; VOICE CONVERSION;

SPEECH ANALYSIS; SPEECH SYNTHESIS; WAVELET TRANSFORMS;

SPEECH PROCESSING;

EID: 77953699919 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2009.2038658 Document Type: Article

Times cited : (25)

References (40)

1
- 0031643805
- Speaker transformation using sentence HMM based alignments and detailed prosody modification
- L. M. Arslan and D. Talkin, "Speaker transformation using sentence HMM based alignments and detailed prosody modification," in Proc. ICASSP, 1998, vol.1, pp. 289-292.
- (1998) Proc. ICASSP , vol.1 , pp. 289-292
- Arslan, L.M.¹ Talkin, D.²

2
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. ICASSP, 1988, pp. 655-658.
- (1988) Proc. ICASSP , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

3
- 85010815133
- Voice transformation using PSOLA technique
- H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," in Speech Commun., 1992, vol.1, pp. 145-148.
- (1992) Speech Commun , vol.1 , pp. 145-148
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

4
- 85064715894
- Speech spectrum transformation by speaker interpolation
- N. Iwahashi and Y. Sagisaka, "Speech spectrum transformation by speaker interpolation," in Proc. ICASSP, 1994, vol.1, pp. 461-464.
- (1994) Proc. ICASSP , vol.1 , pp. 461-464
- Iwahashi, N.¹ Sagisaka, Y.²

5
- 0029254176
- Transformation of formants for voice conversion using artificial neural networks
- M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol.16, no.2, pp. 207-216, 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 207-216
- Narendranath, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

6
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," in Proc. ICASSP, 1998, vol. 6, no. 2, pp. 131-142.
- (1998) Proc. ICASSP , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

7
- 4444285698
- Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR
- A. Kain, "High resolution voice transformation," Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR, 2001.
- (2001) High Resolution Voice Transformation
- Kain, A.¹

8
- 33646779506
- Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
- T. Toda, A. W. Black, and K. Tokuda, "Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter," in Proc. ICASSP, 2005, vol.1, pp. 9-12.
- (2005) Proc. ICASSP , vol.1 , pp. 9-12
- Toda, T.¹ Black, A.W.² Tokuda, K.³

9
- 85068458327
- Weighted frequency warping for voice conversion
- D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," in Proc. Interspeech, 2007, pp. 1965-1968.
- (2007) Proc. Interspeech , pp. 1965-1968
- Erro, D.¹ Moreno, A.²

10
- 5444243681
- Speaker specific pitch contour modeling and modification
- D. T. Chappell and J. H. L. Hansen, "Speaker specific pitch contour modeling and modification," in Proc. ICASSP, 1998, vol.2, pp. 885-888.
- (1998) Proc. ICASSP , vol.2 , pp. 885-888
- Chappell, D.T.¹ Hansen, J.H.L.²

11
- 85009179173
- Voice conversion methods for vocal tract and pitch contour modification
- O. Turk and L. M. Arslan, "Voice conversion methods for vocal tract and pitch contour modification," in Proc. Eurospeech, 2003, pp. 2845-2848.
- (2003) Proc. Eurospeech , pp. 2845-2848
- Turk, O.¹ Arslan, L.M.²

12
- 33947693233
- M.S. thesis, Univ. of Cambridge, Cambridge, MA
- Z. Inanoglu, "Transforming pitch in a voice conversion framework," M.S. thesis, Univ. of Cambridge, Cambridge, MA, 2003.
- (2003) Transforming Pitch in A Voice Conversion Framework
- Inanoglu, Z.¹

13
- 5444259197
- On the construction of a pitch conversion system
- T. Ceyssens, W. Verhelst, and P. Wambacq, "On the construction of a pitch conversion system," in Proc. Eusipco, 2002, vol.1, pp. 423-426.
- (2002) Proc. Eusipco , vol.1 , pp. 423-426
- Ceyssens, T.¹ Verhelst, W.² Wambacq, P.³

14
- 85009212516
- Transforming f0 contours
- B. Gillett and S. King, "Transforming f0 contours," in Proc. Eurospeech, 2003, pp. 101-104.
- (2003) Proc. Eurospeech , pp. 101-104
- Gillett, B.¹ King, S.²

15
- 4544361661
- Voice conversion through transformation of spectral and intonation features
- D. Rentzos, S. Vaseghi, Q. Yan, and C. H. Ho, "Voice conversion through transformation of spectral and intonation features," in Proc. ICASSP, 2004, vol.1, pp. 21-24.
- (2004) Proc. ICASSP , vol.1 , pp. 21-24
- Rentzos, D.¹ Vaseghi, S.² Yan, Q.³ Ho, C.H.⁴

16
- 84869508926
- A voice conversion method based on joint pitch and spectral envelope transformation
- T. En-Najjary, O. Rosec, and T. Chonavel, "A voice conversion method based on joint pitch and spectral envelope transformation," in Proc. ICSLP, 2004, pp. 1225-1228.
- (2004) Proc. ICSLP , pp. 1225-1228
- En-Najjary, T.¹ Rosec, O.² Chonavel, T.³

17
- 34547520011
- A novel method for prosody prediction in voice conversion
- E. E. Helander and J. Nurminen, "A novel method for prosody prediction in voice conversion," in Proc. ICASSP, 2007, vol.4, pp. 509-512.
- (2007) Proc. ICASSP , vol.4 , pp. 509-512
- Helander, E.E.¹ Nurminen, J.²

18
- 77953726259
- Pitch and duration transformation with non-parallel data
- D. Lolive, N. Barbot, and O. Boeffard, "Pitch and duration transformation with non-parallel data," Speech Prosody, pp. 111-114, 2008.
- (2008) Speech Prosody , pp. 111-114
- Lolive, D.¹ Barbot, N.² Boeffard, O.³

19
- 34548216761
- Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion
- Sep.
- C. C. Hsia, C. H. Wu, and J. Q. Wu, "Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion," IEEE Trans. Comput., vol.56, no.9, pp. 1245-1254, Sep. 2007.
- (2007) IEEE Trans. Comput. , vol.56 , Issue.9 , pp. 1245-1254
- Hsia, C.C.¹ Wu, C.H.² Wu, J.Q.³

20
- 34047263010
- Prosody conversion from neutral speech to emotional speech
- Jul.
- J. Tao, Y. Kang, and A. Li, "Prosody conversion from neutral speech to emotional speech," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1145-1154, Jul. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1145-1154
- Tao, J.¹ Kang, Y.² Li, A.³

21
- 58149203393
- Data-driven emotion conversion in spoken English
- Z. Inanoglu and S. Young, "Data-driven emotion conversion in spoken English," Speech Commun., vol.51, no.3, pp. 268-283, 2009.
- (2009) Speech Commun , vol.51 , Issue.3 , pp. 268-283
- Inanoglu, Z.¹ Young, S.²

22
- 84946736935
- A unit selection approach to F0 modeling and its application to emphasis
- A. Raux and A. Black, "A unit selection approach to F0 modeling and its application to emphasis," in Proc. ASRU, 2003, pp. 700-705.
- (2003) Proc. ASRU , pp. 700-705
- Raux, A.¹ Black, A.²

23
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database
- A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP, 1996, vol.1, pp. 373-376.
- (1996) Proc. ICASSP , vol.1 , pp. 373-376
- Hunt, A.¹ Black, A.²

24
- 0003906146
- Analysis and synthesis of German F0 contours by means of Fujisaki's model
- B. Möbius, M. Pätzold, and W. Hess, "Analysis and synthesis of German F0 contours by means of Fujisaki's model," Speech Commun., vol.13, pp. 53-61, 1993.
- (1993) Speech Commun , vol.13 , pp. 53-61
- Möbius, B.¹ Pätzold, M.² Hess, W.³

25
- 4544351403
- Ph.D. dissertation, Universidad de Valladolid, Valladolid, Spain
- D. Escudero, "Modelado estadístico de entonación con funciones de bézier: Aplicaciones a la conversión texto-voz en Español," Ph.D. dissertation, Universidad de Valladolid, Valladolid, Spain, 2002.
- (2002) Modelado Estadístico de Entonación Con Funciones de Bézier: Aplicaciones A la Conversión Texto-voz en Español
- Escudero, D.¹

26
- 77953722239
- Ph.D. dissertation, Univ. of the Basque Country, Bilbao, Spain
- E. Navas, "Standard basque prosodic modeling for text to speech conversion," Ph.D. dissertation, Univ. of the Basque Country, Bilbao, Spain, 2003.
- (2003) Standard Basque Prosodic Modeling for Text to Speech Conversion
- Navas, E.¹

27
- 33947662015
- Prosody generation for speech-to-speech translation
- P. D. Agüero, J. Adell, and A. Bonafonte, "Prosody generation for speech-to-speech translation," in Proc. ICASSP, 2006, pp. 557-560.
- (2006) Proc. ICASSP , pp. 557-560
- Agüero, P.D.¹ Adell, J.² Bonafonte, A.³

28
- 33745372264
- A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems
- F. Campillo and E. R. Banga, "A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems," in Speech Commun., 2005, vol.48, pp. 941-956.
- (2005) Speech Commun , vol.48 , pp. 941-956
- Campillo, F.¹ Banga, E.R.²

29
- 77953723246
- Predicting segmental durations for Basque using CARTs
- E. Navas, I. Hernáez, and J. Sánchez, "Predicting segmental durations for Basque using CARTs," in Proc. 15th ICPhS, 2003, pp. 2083-2086.
- (2003) Proc. 15th ICPhS , pp. 2083-2086
- Navas, E.¹ Hernáez, I.² Sánchez, J.³

30
- 77949917061
- On the limitations of voice conversion techniques in emotion identification tasks
- R. Barra, J. M. Montero, J. Macías-Guarasa, J. Gutiérrez-Arriola, J. Ferreiros, and J. M. Pardo, "On the limitations of voice conversion techniques in emotion identification tasks," in Proc. Interspeech, 2007, pp. 2233-2236.
- (2007) Proc. Interspeech , pp. 2233-2236
- Barra, R.¹ Montero, J.M.² Macías-Guarasa, J.³ Gutiérrez-Arriola, J.⁴ Ferreiros, J.⁵ Pardo, J.M.⁶

31
- 84867216755
- The linear transformation of LF glottal waveforms for voice conversion
- A. del Pozo and S. Young, "The linear transformation of LF glottal waveforms for voice conversion," in Proc. Interspeech, 2008, pp. 1457-1460.
- (2008) Proc. Interspeech , pp. 1457-1460
- Del Pozo, A.¹ Young, S.²

32
- 33846972308
- Residual prediction
- D. Sündermann, H. Höge, A. Bonafonte, and H. Duxans, "Residual prediction," in Proc. 5th IEEE ISSPIT, 2005, pp. 512-516.
- (2005) Proc. 5th IEEE ISSPIT , pp. 512-516
- Sündermann, D.¹ Höge, H.² Bonafonte, A.³ Duxans, H.⁴

33
- 34047254509
- Quality-enhanced voice morphing using maximum likelihood transformations
- Jul.
- H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1301-1312, Jul. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1301-1312
- Ye, H.¹ Young, S.²

34
- 0029799113
- Spectral balance as an acoustic correlate of linguistic stress
- A. Sluyter and V. Van Heuven, "Spectral balance as an acoustic correlate of linguistic stress," J. Acoust. Soc. Amer., vol.100, pp. 2471-2485, 1996.
- (1996) J. Acoust. Soc. Amer. , vol.100 , pp. 2471-2485
- Sluyter, A.¹ Van Heuven, V.²

35
- 33845952706
- The spectrum of glottal flow models
- B. Doval, C. d'Alessandro, and N. Henrich, "The spectrum of glottal flow models," Acta Acustica, vol.92, pp. 1026-1046, 2006.
- (2006) Acta Acustica , vol.92 , pp. 1026-1046
- Doval, B.¹ D'Alessandro, C.² Henrich, N.³

36
- 51449124416
- Flexible harmonic/stochastic speech synthesis
- D. Erro, A. Moreno, and A. Bonafonte, "Flexible harmonic/stochastic speech synthesis," in Proc. 6th ISCA Workshop Speech Synth., 2007, pp. 194-199.
- (2007) Proc. 6th ISCA Workshop Speech Synth. , pp. 194-199
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

37
- 0027268967
- HNS: Speech modification based on a harmonic+noise model
- J. Laroche, Y. Stylianou, and E. Moulines, "HNS: Speech modification based on a harmonic+noise model," in Proc. ICASSP, 1993, vol.2, pp. 550-553.
- (1993) Proc. ICASSP , vol.2 , pp. 550-553
- Laroche, J.¹ Stylianou, Y.² Moulines, E.³

38
- 34547503468
- Evaluation of pitch detection algorithms under real conditions
- I. Luengo, I. Saratxaga, E. Navas, I. Hernáez, J. Sanchez, and I. Sainz, "Evaluation of pitch detection algorithms under real conditions," in Proc. ICASSP, 2007, pp. 1057-1060.
- (2007) Proc. ICASSP , pp. 1057-1060
- Luengo, I.¹ Saratxaga, I.² Navas, E.³ Hernáez, I.⁴ Sanchez, J.⁵ Sainz, I.⁶

39
- 84885404320
- Subjective evaluation of an emotional speech database for Basque
- Paper 437
- I. Sainz, I. Saratxaga, E. Navas, I. Hernáez, J. Sanchez, I. Luengo, and I. Odriozola, "Subjective evaluation of an emotional speech database for Basque," in Proc. 6th LREC, 2008, Paper 437.
- (2008) Proc. 6th LREC
- Sainz, I.¹ Saratxaga, I.² Navas, E.³ Hernáez, I.⁴ Sanchez, J.⁵ Luengo, I.⁶ Odriozola, I.⁷

40
- 77953720117
- The AHOLAB blizzard challenge 2008 entry
- I. Sainz, E. Navas, and I. Hernáez, "The AHOLAB blizzard challenge 2008 entry," in Proc. Blizzard Challenge Workshop, 2008.
- (2008) Proc. Blizzard Challenge Workshop
- Sainz, I.¹ Navas, E.² Hernáez, I.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.