메뉴 건너뛰기




Volumn 49, Issue 3, 2007, Pages 213-229

Applying data mining techniques to corpus based prosodic modeling

Author keywords

Data mining; F0 Contours; Intonation modeling; Prosody; Text to speech

Indexed keywords

COMPUTER SIMULATION; PARAMETER ESTIMATION; PATTERN RECOGNITION; PROBLEM SOLVING; SPEECH SYNTHESIS;

EID: 33947159552     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.01.008     Document Type: Article
Times cited : (17)

References (51)
  • 2
    • 33947112081 scopus 로고    scopus 로고
    • Alarcos, E., 2002. Gramática de la Lengua Española. Real Academina Española.
  • 5
    • 33947137051 scopus 로고    scopus 로고
    • Beckman, M.E., Campos, M.D., McGregory, J.T., Morgan, T.A., 2000. Intonation across Spanish, in the tones and break indices framework. Technical Report. Available from: , University of Ohio.
  • 6
    • 0035283534 scopus 로고    scopus 로고
    • Developments and paradigms in intonation research
    • Botinis A., Granstrom B., and Moebius B. Developments and paradigms in intonation research. Speech Commun. 33 (2001) 263-296
    • (2001) Speech Commun. , vol.33 , pp. 263-296
    • Botinis, A.1    Granstrom, B.2    Moebius, B.3
  • 7
    • 33947185459 scopus 로고    scopus 로고
    • Bulyko, I., Ostendorf, M., Price, P., 1999. On the relative importance of different prosodic factors for improving speech synthesis. In: Proceedings of ICPhs'99. pp. 81-84.
  • 8
    • 9444260412 scopus 로고    scopus 로고
    • What do people hear? a study of the perception of non-verbal affective information in conversational speech
    • Campbell N., and Erickson D. What do people hear? a study of the perception of non-verbal affective information in conversational speech. J. Phonetic Soc. Jpn. 8 1 (2004) 9-28
    • (2004) J. Phonetic Soc. Jpn. , vol.8 , Issue.1 , pp. 9-28
    • Campbell, N.1    Erickson, D.2
  • 9
    • 33745372264 scopus 로고    scopus 로고
    • A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems
    • Campillo Díaz F., and Rodríguez Banga E. A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems. Speech Commun. 48 8 (2006) 941-956
    • (2006) Speech Commun. , vol.48 , Issue.8 , pp. 941-956
    • Campillo Díaz, F.1    Rodríguez Banga, E.2
  • 10
    • 4544224865 scopus 로고    scopus 로고
    • Cardeñoso, V., Escudero, D., 2004. A strategy to solve data scarcity problems in corpus based intonation modelling. In: Proceedings of ICASSP 2004, vol. 1. pp. 665-668.
  • 11
    • 0029342671 scopus 로고
    • Automatic pitch contour stylization using a model of tonal perception
    • d'Alessandro C., and Mertens P. Automatic pitch contour stylization using a model of tonal perception. Comput. Speech Lang. 9 (1995) 257-288
    • (1995) Comput. Speech Lang. , vol.9 , pp. 257-288
    • d'Alessandro, C.1    Mertens, P.2
  • 12
    • 0141702290 scopus 로고    scopus 로고
    • Eide, E., Aaron, A., Bakis, R., Cohen, P., Donovan, R., Hamza, W., Mathes, T., Picheny, M., Polkosky, M., Smith, M., Viswanathan, M., 2003. Recent improvements to the IBM trainable speech synthesis system. In: Proceedings of ICASSP 2003, vol. 1. pp. 708-711.
  • 13
    • 3242855988 scopus 로고
    • Prosodic processing in a text-to-speech synthesis system using a database and learning procedures
    • Bailly C., Benoit C., and Sawallis T. (Eds), VCH, Elsevier Science Publishers
    • Emerard F., Montamet L., and Cozannet A. Prosodic processing in a text-to-speech synthesis system using a database and learning procedures. In: Bailly C., Benoit C., and Sawallis T. (Eds). Talking Machines: Theories, Models, and Designs (1992), VCH, Elsevier Science Publishers 225-254
    • (1992) Talking Machines: Theories, Models, and Designs , pp. 225-254
    • Emerard, F.1    Montamet, L.2    Cozannet, A.3
  • 14
    • 33947095703 scopus 로고    scopus 로고
    • Escudero, D., 2002. Modelado estadístico de entonación con funciones de bézier: Aplicaciones a la conversión texto voz. Ph.D. thesis, Dpto. de Informática, Universidad de Valladolid, España.
  • 15
    • 0036293858 scopus 로고    scopus 로고
    • Escudero, D., Bonafonte, V.C.A., 2002. Corpus based extraction of quantitative prosodic parameters of stress groups in Spanish. In: Proceedings of ICASSP 2002, vol. 1. pp. 481-484.
  • 16
    • 85009174520 scopus 로고    scopus 로고
    • Escudero, D., Cardeñoso, V., 2003. Experimental evaluation of the relevance of prosodic features in Spanish using machine learning techniques. In: Proceedings of EUROSPEECH-2003. pp. 2309-2312.
  • 17
    • 85009096921 scopus 로고    scopus 로고
    • Escudero, D., Cardeñoso, V., 2004. A proposal to quantitatively select the right intonation unit in data-driven intonation modeling. In: Proceedings of INTERSPEECH-2004. pp. 745-748.
  • 18
    • 33745197106 scopus 로고    scopus 로고
    • Escudero, D., Cardeñoso, V., 2005. Optimized selection of intonation dictionaries in corpus based intonation modelling. In: Proceedings of INTERSPEECH-2005. pp. 3261-3264.
  • 19
    • 85009241571 scopus 로고    scopus 로고
    • Escudero, D., González, C., Cardeñoso, V., Mayo 2002. Quantitative evaluation of relevant prosodic factors for text-to-speech synthesis in Spanish. In: Proceedings of ICSLP-2002. pp. 1165-1168.
  • 20
    • 33947100155 scopus 로고    scopus 로고
    • Face, T., 2001. Intonation marking of contrastive focus in Madrid Spanish. Ph.D. thesis, The Ohio State University, Columbus OH, USA.
  • 22
    • 33947186976 scopus 로고    scopus 로고
    • Ferrer, A., 2001. Sintesi de la Parla per Concatenació Basada en la Selecció. Ph.D. thesis, Dpto. de Teoría del Senyal i Comunicacions, Universidad Politécnica de Cataluña, España.
  • 23
    • 85011187169 scopus 로고
    • Analysis of voice fundamental frequency contours for declarative sentences of Japanese
    • Fujisaki H., and Hirose K. Analysis of voice fundamental frequency contours for declarative sentences of Japanese. J. Acoust. Soc. Jpn. 5 4 (1984) 233-242
    • (1984) J. Acoust. Soc. Jpn. , vol.5 , Issue.4 , pp. 233-242
    • Fujisaki, H.1    Hirose, K.2
  • 24
    • 33947126708 scopus 로고    scopus 로고
    • Garrido, J.M., 1996. Modelling Spanish intonation for text-to-speech applications. Ph.D. thesis, Facultat de Lletres, Universitat de Barcelona, España.
  • 25
    • 85143189895 scopus 로고    scopus 로고
    • Gutierrez, J.M., Montero, J.M., Saiz, D., Pardo, J.M., 2001. New rule-based and data-driven strategy to incorporate Fujisaki's F0 model to a text-to-speech system in Castillian Spanish. In: Proceedings of ICASSP 2001, vol. 2. pp. 821-824.
  • 27
    • 0031940566 scopus 로고
    • Measuring the perceptual similarity of pitch contours
    • Hermes D.J. Measuring the perceptual similarity of pitch contours. J. Speech, Lang. Hear. Res. 41 (1994) 73-82
    • (1994) J. Speech, Lang. Hear. Res. , vol.41 , pp. 73-82
    • Hermes, D.J.1
  • 28
    • 33947105227 scopus 로고    scopus 로고
    • Holm, B., 2003. Sfc: Un modèle de superposition de contours multiparamétriques pour la génération automatique de la prosodie- aprentissage automatique et application à l'Énonciation de formules mathématiques. Ph.D. thesis, Institut National Polythechnique de Grenoble. Grenoble. France.
  • 30
    • 0037290439 scopus 로고    scopus 로고
    • Prosody modeling with soft templates
    • Kochanski G., and Shih C. Prosody modeling with soft templates. Speech Comm. 39 (2003) 311-352
    • (2003) Speech Comm. , vol.39 , pp. 311-352
    • Kochanski, G.1    Shih, C.2
  • 31
    • 0035058410 scopus 로고    scopus 로고
    • Tree-based modeling of intonation
    • Lee S., and Oh Y.-H. Tree-based modeling of intonation. Comput. Speech Lang. 15 (2001) 75-98
    • (2001) Comput. Speech Lang. , vol.15 , pp. 75-98
    • Lee, S.1    Oh, Y.-H.2
  • 32
    • 33947122343 scopus 로고    scopus 로고
    • Lobanov, B.M., 1987. The phonemophon text-to-speech system. In: Proc. of the XI International Congress of Phonetic Sciences, pp. 61-64.
  • 33
    • 0030359784 scopus 로고    scopus 로고
    • López, E., Rodríguez, J.M., 1996. Statistical methods in data-driven modeling of Spanish prosody for text to speech. In: Proceedings of ICSLP-96. pp. 1377-1380.
  • 34
    • 33947141143 scopus 로고    scopus 로고
    • Navarro-Tomás, T., 1944. Manual de Entonación Española. Madrid, Guadarrama.
  • 35
    • 33947140666 scopus 로고    scopus 로고
    • Peña, D., 1999. Estadística. Modelos y Métodos. Alianza, Madrid.
  • 36
    • 33947119857 scopus 로고    scopus 로고
    • Pierrehumbert, J.B., 1980. The phonology and phonetics of English intonation. Ph.D. thesis, MIT.
  • 37
    • 0020778447 scopus 로고
    • Curve-fitting with piecewise parametric cubics
    • Plass M., and Stone M. Curve-fitting with piecewise parametric cubics. Comput. Graph. (1983) 229-239
    • (1983) Comput. Graph. , pp. 229-239
    • Plass, M.1    Stone, M.2
  • 38
    • 33947168467 scopus 로고    scopus 로고
    • Quilis, A., 1993. Tratado de Fonología y Fonética. Editorial Gredos.
  • 39
    • 33646821329 scopus 로고    scopus 로고
    • Sakai, S., 2005. Additive modeling of English F0 contours for speech synthesis. In: Proceedings of ICASSP 2005. vol. 1. pp. 277-280.
  • 40
    • 0037513422 scopus 로고    scopus 로고
    • Data-driven generation of F0 contours using a superpositional model
    • Sakurai A., Hirose K., and Minematsu N. Data-driven generation of F0 contours using a superpositional model. Speech Commun. 40 (2003) 535-549
    • (2003) Speech Commun. , vol.40 , pp. 535-549
    • Sakurai, A.1    Hirose, K.2    Minematsu, N.3
  • 42
    • 33947173407 scopus 로고    scopus 로고
    • Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., Pierrehumbert, J., Hirschberg, J., 1992. ToBI: A standard for labelling English prosody. In: Proceedings of ICSLP-1992. pp. 867-870.
  • 43
    • 33947159872 scopus 로고    scopus 로고
    • Sosa, J.M., 1999. La Entonación del Español. Cátedra.
  • 44
    • 0003314260 scopus 로고
    • An approach to text-to-speech synthesis
    • Elsevier, Amsterdam (Chapter 17)
    • Sproat R., and Olive J. An approach to text-to-speech synthesis. Speech Coding and Synthesis (1995), Elsevier, Amsterdam 611-633 (Chapter 17)
    • (1995) Speech Coding and Synthesis , pp. 611-633
    • Sproat, R.1    Olive, J.2
  • 45
    • 0034008810 scopus 로고    scopus 로고
    • Analysis and synthesis of intonation using the Tilt model
    • Taylor P. Analysis and synthesis of intonation using the Tilt model. J. Acoust. Soc. Amer. 107 3 (2000) 1697-1714
    • (2000) J. Acoust. Soc. Amer. , vol.107 , Issue.3 , pp. 1697-1714
    • Taylor, P.1
  • 46
    • 0028529843 scopus 로고
    • The Rise/Fall/Connection model of intonation
    • Taylor P., and Black A. The Rise/Fall/Connection model of intonation. Speech Comm. 15 (1995) 169-186
    • (1995) Speech Comm. , vol.15 , pp. 169-186
    • Taylor, P.1    Black, A.2
  • 47
    • 0033708106 scopus 로고    scopus 로고
    • Tokuda, K., Yoshimura, T., Masuko, T., Kobayashi, T., Kitamura, T., 2000. Speech parameter generation algorithms for HMM-based speech synthesis. In: Proceedings of ICASSP 2000, vol. 3. pp. 1315-1318.
  • 48
    • 0001957999 scopus 로고
    • F0 generation with a database of natural F0 patterns and with a NN
    • Bailly C., Benoit C., and Sawallis T. (Eds), Elsevier Science Publishers
    • Traber C. F0 generation with a database of natural F0 patterns and with a NN. In: Bailly C., Benoit C., and Sawallis T. (Eds). Talking Machines: Theories, Models, and Designs (1992), Elsevier Science Publishers 287-304
    • (1992) Talking Machines: Theories, Models, and Designs , pp. 287-304
    • Traber, C.1
  • 49
    • 33947153485 scopus 로고    scopus 로고
    • Vallejo, J.A., 1998. Mejora de la frecuencia fundamental en la conversión de texto a voz. Ph.D. thesis, E.T.S.I de Telecomunicaciones, Universidad Politécnica de Madrid, España.
  • 50
    • 0032296808 scopus 로고    scopus 로고
    • A stochastic model of intonation for text-to-speech synthesis
    • Veronis J., Di Cristo P., Courtois F., and Chaumette C. A stochastic model of intonation for text-to-speech synthesis. Speech Comm. 26 4 (1998) 233-244
    • (1998) Speech Comm. , vol.26 , Issue.4 , pp. 233-244
    • Veronis, J.1    Di Cristo, P.2    Courtois, F.3    Chaumette, C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.