SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 49, Issue 3, 2007, Pages 213-229

Applying data mining techniques to corpus based prosodic modeling

(2) Escudero Mancebo, David a Cardeñoso Payo, Valentín a

a UNIVERSITY OF VALLADOLID (Spain)

Author keywords

Data mining; F0 Contours; Intonation modeling; Prosody; Text to speech

Indexed keywords

COMPUTER SIMULATION; PARAMETER ESTIMATION; PATTERN RECOGNITION; PROBLEM SOLVING; SPEECH SYNTHESIS;

INTONATION MODELING; INTONATION PATTERNS; PROSODIC FEATURES; VISUAL REPRESENTATION;

DATA MINING;

EID: 33947159552 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2007.01.008 Document Type: Article

Times cited : (17)

References (51)

1
- 20444446355
- Conversational computers
- Aaron A., Eide E., and Pitrelli J.F. Conversational computers. Sci. Amer. (2005) 64-70
- (2005) Sci. Amer. , pp. 64-70
- Aaron, A.¹ Eide, E.² Pitrelli, J.F.³

2
- 33947112081
- Alarcos, E., 2002. Gramática de la Lengua Española. Real Academina Española.

3
- 0003724033
- Cambridge University Press
- Allen J., Hunnicutt M.S., and Klatt D. From Text to Speech: The MITalk System (1987), Cambridge University Press
- (1987) From Text to Speech: The MITalk System
- Allen, J.¹ Hunnicutt, M.S.² Klatt, D.³

4
- 0003494616
- Morgan Kaufmann Publishers, Inc.
- Bartels R.H., Beatty J.C., and Barsky B.A. An Introduction to Splines for Use in Computer Graphics and Geometric Modeling (1986), Morgan Kaufmann Publishers, Inc.
- (1986) An Introduction to Splines for Use in Computer Graphics and Geometric Modeling
- Bartels, R.H.¹ Beatty, J.C.² Barsky, B.A.³

5
- 33947137051
- Beckman, M.E., Campos, M.D., McGregory, J.T., Morgan, T.A., 2000. Intonation across Spanish, in the tones and break indices framework. Technical Report. Available from: , University of Ohio.

6
- 0035283534
- Developments and paradigms in intonation research
- Botinis A., Granstrom B., and Moebius B. Developments and paradigms in intonation research. Speech Commun. 33 (2001) 263-296
- (2001) Speech Commun. , vol.33 , pp. 263-296
- Botinis, A.¹ Granstrom, B.² Moebius, B.³

7
- 33947185459
- Bulyko, I., Ostendorf, M., Price, P., 1999. On the relative importance of different prosodic factors for improving speech synthesis. In: Proceedings of ICPhs'99. pp. 81-84.

8
- 9444260412
- What do people hear? a study of the perception of non-verbal affective information in conversational speech
- Campbell N., and Erickson D. What do people hear? a study of the perception of non-verbal affective information in conversational speech. J. Phonetic Soc. Jpn. 8 1 (2004) 9-28
- (2004) J. Phonetic Soc. Jpn. , vol.8 , Issue.1 , pp. 9-28
- Campbell, N.¹ Erickson, D.²

9
- 33745372264
- A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems
- Campillo Díaz F., and Rodríguez Banga E. A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems. Speech Commun. 48 8 (2006) 941-956
- (2006) Speech Commun. , vol.48 , Issue.8 , pp. 941-956
- Campillo Díaz, F.¹ Rodríguez Banga, E.²

10
- 4544224865
- Cardeñoso, V., Escudero, D., 2004. A strategy to solve data scarcity problems in corpus based intonation modelling. In: Proceedings of ICASSP 2004, vol. 1. pp. 665-668.

11
- 0029342671
- Automatic pitch contour stylization using a model of tonal perception
- d'Alessandro C., and Mertens P. Automatic pitch contour stylization using a model of tonal perception. Comput. Speech Lang. 9 (1995) 257-288
- (1995) Comput. Speech Lang. , vol.9 , pp. 257-288
- d'Alessandro, C.¹ Mertens, P.²

12
- 0141702290
- Eide, E., Aaron, A., Bakis, R., Cohen, P., Donovan, R., Hamza, W., Mathes, T., Picheny, M., Polkosky, M., Smith, M., Viswanathan, M., 2003. Recent improvements to the IBM trainable speech synthesis system. In: Proceedings of ICASSP 2003, vol. 1. pp. 708-711.

13
- 3242855988
- Prosodic processing in a text-to-speech synthesis system using a database and learning procedures
- Bailly C., Benoit C., and Sawallis T. (Eds), VCH, Elsevier Science Publishers
- Emerard F., Montamet L., and Cozannet A. Prosodic processing in a text-to-speech synthesis system using a database and learning procedures. In: Bailly C., Benoit C., and Sawallis T. (Eds). Talking Machines: Theories, Models, and Designs (1992), VCH, Elsevier Science Publishers 225-254
- (1992) Talking Machines: Theories, Models, and Designs , pp. 225-254
- Emerard, F.¹ Montamet, L.² Cozannet, A.³

14
- 33947095703
- Escudero, D., 2002. Modelado estadístico de entonación con funciones de bézier: Aplicaciones a la conversión texto voz. Ph.D. thesis, Dpto. de Informática, Universidad de Valladolid, España.

15
- 0036293858
- Escudero, D., Bonafonte, V.C.A., 2002. Corpus based extraction of quantitative prosodic parameters of stress groups in Spanish. In: Proceedings of ICASSP 2002, vol. 1. pp. 481-484.

16
- 85009174520
- Escudero, D., Cardeñoso, V., 2003. Experimental evaluation of the relevance of prosodic features in Spanish using machine learning techniques. In: Proceedings of EUROSPEECH-2003. pp. 2309-2312.

17
- 85009096921
- Escudero, D., Cardeñoso, V., 2004. A proposal to quantitatively select the right intonation unit in data-driven intonation modeling. In: Proceedings of INTERSPEECH-2004. pp. 745-748.

18
- 33745197106
- Escudero, D., Cardeñoso, V., 2005. Optimized selection of intonation dictionaries in corpus based intonation modelling. In: Proceedings of INTERSPEECH-2005. pp. 3261-3264.

19
- 85009241571
- Escudero, D., González, C., Cardeñoso, V., Mayo 2002. Quantitative evaluation of relevant prosodic factors for text-to-speech synthesis in Spanish. In: Proceedings of ICSLP-2002. pp. 1165-1168.

20
- 33947100155
- Face, T., 2001. Intonation marking of contrastive focus in Madrid Spanish. Ph.D. thesis, The Ohio State University, Columbus OH, USA.

21
- 0004205287
- Cambridge University Press
- Farin G. Curves and Surfaces for CAGD. fourth ed. (1996), Cambridge University Press
- (1996) Curves and Surfaces for CAGD. fourth ed.
- Farin, G.¹

22
- 33947186976
- Ferrer, A., 2001. Sintesi de la Parla per Concatenació Basada en la Selecció. Ph.D. thesis, Dpto. de Teoría del Senyal i Comunicacions, Universidad Politécnica de Cataluña, España.

23
- 85011187169
- Analysis of voice fundamental frequency contours for declarative sentences of Japanese
- Fujisaki H., and Hirose K. Analysis of voice fundamental frequency contours for declarative sentences of Japanese. J. Acoust. Soc. Jpn. 5 4 (1984) 233-242
- (1984) J. Acoust. Soc. Jpn. , vol.5 , Issue.4 , pp. 233-242
- Fujisaki, H.¹ Hirose, K.²

24
- 33947126708
- Garrido, J.M., 1996. Modelling Spanish intonation for text-to-speech applications. Ph.D. thesis, Facultat de Lletres, Universitat de Barcelona, España.

25
- 85143189895
- Gutierrez, J.M., Montero, J.M., Saiz, D., Pardo, J.M., 2001. New rule-based and data-driven strategy to incorporate Fujisaki's F0 model to a text-to-speech system in Castillian Spanish. In: Proceedings of ICASSP 2001, vol. 2. pp. 821-824.

26
- 0003788784
- Cambridge University Press
- Hart J., Collier R., and Cohen A. A Perceptual Study of Intonation. An Experimental Approach to Speech Melody (1990), Cambridge University Press
- (1990) A Perceptual Study of Intonation. An Experimental Approach to Speech Melody
- Hart, J.¹ Collier, R.² Cohen, A.³

27
- 0031940566
- Measuring the perceptual similarity of pitch contours
- Hermes D.J. Measuring the perceptual similarity of pitch contours. J. Speech, Lang. Hear. Res. 41 (1994) 73-82
- (1994) J. Speech, Lang. Hear. Res. , vol.41 , pp. 73-82
- Hermes, D.J.¹

28
- 33947105227
- Holm, B., 2003. Sfc: Un modèle de superposition de contours multiparamétriques pour la génération automatique de la prosodie- aprentissage automatique et application à l'Énonciation de formules mathématiques. Ph.D. thesis, Institut National Polythechnique de Grenoble. Grenoble. France.

29
- 84893405732
- Data clustering: a review
- Jain A., Murty M., and Flynn P.J. Data clustering: a review. ACM Comput. Surv. 31 3 (1999) 264-323
- (1999) ACM Comput. Surv. , vol.31 , Issue.3 , pp. 264-323
- Jain, A.¹ Murty, M.² Flynn, P.J.³

30
- 0037290439
- Prosody modeling with soft templates
- Kochanski G., and Shih C. Prosody modeling with soft templates. Speech Comm. 39 (2003) 311-352
- (2003) Speech Comm. , vol.39 , pp. 311-352
- Kochanski, G.¹ Shih, C.²

31
- 0035058410
- Tree-based modeling of intonation
- Lee S., and Oh Y.-H. Tree-based modeling of intonation. Comput. Speech Lang. 15 (2001) 75-98
- (2001) Comput. Speech Lang. , vol.15 , pp. 75-98
- Lee, S.¹ Oh, Y.-H.²

32
- 33947122343
- Lobanov, B.M., 1987. The phonemophon text-to-speech system. In: Proc. of the XI International Congress of Phonetic Sciences, pp. 61-64.

33
- 0030359784
- López, E., Rodríguez, J.M., 1996. Statistical methods in data-driven modeling of Spanish prosody for text to speech. In: Proceedings of ICSLP-96. pp. 1377-1380.

34
- 33947141143
- Navarro-Tomás, T., 1944. Manual de Entonación Española. Madrid, Guadarrama.

35
- 33947140666
- Peña, D., 1999. Estadística. Modelos y Métodos. Alianza, Madrid.

36
- 33947119857
- Pierrehumbert, J.B., 1980. The phonology and phonetics of English intonation. Ph.D. thesis, MIT.

37
- 0020778447
- Curve-fitting with piecewise parametric cubics
- Plass M., and Stone M. Curve-fitting with piecewise parametric cubics. Comput. Graph. (1983) 229-239
- (1983) Comput. Graph. , pp. 229-239
- Plass, M.¹ Stone, M.²

38
- 33947168467
- Quilis, A., 1993. Tratado de Fonología y Fonética. Editorial Gredos.

39
- 33646821329
- Sakai, S., 2005. Additive modeling of English F0 contours for speech synthesis. In: Proceedings of ICASSP 2005. vol. 1. pp. 277-280.

40
- 0037513422
- Data-driven generation of F0 contours using a superpositional model
- Sakurai A., Hirose K., and Minematsu N. Data-driven generation of F0 contours using a superpositional model. Speech Commun. 40 (2003) 535-549
- (2003) Speech Commun. , vol.40 , pp. 535-549
- Sakurai, A.¹ Hirose, K.² Minematsu, N.³

41
- 0001569732
- A qualitative model of F0 generation and alignment
- Kluwer Academic Publisher
- Santen J.P.H.V., and Möebius B. A qualitative model of F0 generation and alignment. Intonation: Analysis, Modelling and Technology (2000), Kluwer Academic Publisher 269-288
- (2000) Intonation: Analysis, Modelling and Technology , pp. 269-288
- Santen, J.P.H.V.¹ Möebius, B.²

42
- 33947173407
- Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., Pierrehumbert, J., Hirschberg, J., 1992. ToBI: A standard for labelling English prosody. In: Proceedings of ICSLP-1992. pp. 867-870.

43
- 33947159872
- Sosa, J.M., 1999. La Entonación del Español. Cátedra.

44
- 0003314260
- An approach to text-to-speech synthesis
- Elsevier, Amsterdam (Chapter 17)
- Sproat R., and Olive J. An approach to text-to-speech synthesis. Speech Coding and Synthesis (1995), Elsevier, Amsterdam 611-633 (Chapter 17)
- (1995) Speech Coding and Synthesis , pp. 611-633
- Sproat, R.¹ Olive, J.²

45
- 0034008810
- Analysis and synthesis of intonation using the Tilt model
- Taylor P. Analysis and synthesis of intonation using the Tilt model. J. Acoust. Soc. Amer. 107 3 (2000) 1697-1714
- (2000) J. Acoust. Soc. Amer. , vol.107 , Issue.3 , pp. 1697-1714
- Taylor, P.¹

46
- 0028529843
- The Rise/Fall/Connection model of intonation
- Taylor P., and Black A. The Rise/Fall/Connection model of intonation. Speech Comm. 15 (1995) 169-186
- (1995) Speech Comm. , vol.15 , pp. 169-186
- Taylor, P.¹ Black, A.²

47
- 0033708106
- Tokuda, K., Yoshimura, T., Masuko, T., Kobayashi, T., Kitamura, T., 2000. Speech parameter generation algorithms for HMM-based speech synthesis. In: Proceedings of ICASSP 2000, vol. 3. pp. 1315-1318.

48
- 0001957999
- F0 generation with a database of natural F0 patterns and with a NN
- Bailly C., Benoit C., and Sawallis T. (Eds), Elsevier Science Publishers
- Traber C. F0 generation with a database of natural F0 patterns and with a NN. In: Bailly C., Benoit C., and Sawallis T. (Eds). Talking Machines: Theories, Models, and Designs (1992), Elsevier Science Publishers 287-304
- (1992) Talking Machines: Theories, Models, and Designs , pp. 287-304
- Traber, C.¹

49
- 33947153485
- Vallejo, J.A., 1998. Mejora de la frecuencia fundamental en la conversión de texto a voz. Ph.D. thesis, E.T.S.I de Telecomunicaciones, Universidad Politécnica de Madrid, España.

50
- 0032296808
- A stochastic model of intonation for text-to-speech synthesis
- Veronis J., Di Cristo P., Courtois F., and Chaumette C. A stochastic model of intonation for text-to-speech synthesis. Speech Comm. 26 4 (1998) 233-244
- (1998) Speech Comm. , vol.26 , Issue.4 , pp. 233-244
- Veronis, J.¹ Di Cristo, P.² Courtois, F.³ Chaumette, C.⁴

51
- 0038797944
- Wiley
- Webb A. Statistical Pattern Recognition. second ed. (2002), Wiley
- (2002) Statistical Pattern Recognition. second ed.
- Webb, A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.