메뉴 건너뛰기




Volumn 46, Issue 3-4, 2005, Pages 348-364

SFC: A trainable prosodic model

Author keywords

Automatic generation of prosody; Intonation; Prosodic modelling

Indexed keywords

AUTOMATION; CONSTRAINT THEORY; DATA PROCESSING; DECODING; LINGUISTICS; PARAMETER ESTIMATION; SPEECH PROCESSING;

EID: 21844440585     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2005.04.008     Document Type: Conference Paper
Times cited : (67)

References (64)
  • 2
    • 0037795514 scopus 로고
    • Developing a structured lexicon for synthesis of prosody
    • G. Bailly C. Benoît Elsevier
    • V. Aubergé Developing a structured lexicon for synthesis of prosody G. Bailly C. Benoît Talking Machines: Theories, Models and Designs 1992 Elsevier 307 321
    • (1992) Talking Machines: Theories, Models and Designs , pp. 307-321
    • Aubergé, V.1
  • 3
    • 21844448276 scopus 로고
    • Prosody modeling with a dynamic lexicon of intonative forms: Application for text-to-speech synthesis
    • Aubergé, V., 1993. Prosody modeling with a dynamic lexicon of intonative forms: Application for text-to-speech synthesis. Working Papers of Lund University, Vol. 41, pp. 62-66.
    • (1993) Working Papers of Lund University , vol.41 , pp. 62-66
    • Aubergé, V.1
  • 4
    • 84930562270 scopus 로고
    • A computational grammar of discourse-neutral prosodic phrasing in English
    • J. Bachenko, and E. Fitzpatrick A computational grammar of discourse-neutral prosodic phrasing in English Comput. Linguist. 16 1990 155 167
    • (1990) Comput. Linguist. , vol.16 , pp. 155-167
    • Bachenko, J.1    Fitzpatrick, E.2
  • 5
    • 0024683803 scopus 로고
    • Integration of rhythmic and syntactic constraints in a model of generation of French prosody
    • G. Bailly Integration of rhythmic and syntactic constraints in a model of generation of French prosody Speech Comm. 8 1989 137 146
    • (1989) Speech Comm. , vol.8 , pp. 137-146
    • Bailly, G.1
  • 7
    • 84937386831 scopus 로고    scopus 로고
    • Learning the hidden structure of speech: From communicative functions to prosody
    • G. Bailly, and B. Holm Learning the hidden structure of speech: From communicative functions to prosody Cadernos Estudos Linguist. 43 2002 37 54
    • (2002) Cadernos Estudos Linguist. , vol.43 , pp. 37-54
    • Bailly, G.1    Holm, B.2
  • 8
    • 7744233509 scopus 로고    scopus 로고
    • From shallow to deep parsing using constraint satisfaction
    • Taipei, Taiwan
    • Balfourier, J.-M., Blache, P., van Rullen, T., 2002. From shallow to deep parsing using constraint satisfaction. In: Coling, Taipei, Taiwan, pp. 36-42.
    • (2002) Coling , pp. 36-42
    • Balfourier, J.-M.1    Blache, P.2    Van Rullen, T.3
  • 9
    • 0028531866 scopus 로고
    • Characterisation of rhythmic patterns for text-to-speech synthesis
    • P. Barbosa, and G. Bailly Characterisation of rhythmic patterns for text-to-speech synthesis Speech Comm. 15 1994 127 137
    • (1994) Speech Comm. , vol.15 , pp. 127-137
    • Barbosa, P.1    Bailly, G.2
  • 10
    • 0003762887 scopus 로고    scopus 로고
    • Generation of pauses within the z-score model
    • J.P.H.V. Santen R.W. Sproat J.P. Olive J. Hirschberg Springer-Verlag New York
    • P. Barbosa, and G. Bailly Generation of pauses within the z-score model J.P.H.V. Santen R.W. Sproat J.P. Olive J. Hirschberg Progress in Speech Synthesis 1997 Springer-Verlag New York 365 381
    • (1997) Progress in Speech Synthesis , pp. 365-381
    • Barbosa, P.1    Bailly, G.2
  • 11
    • 0023404428 scopus 로고
    • A model of segmental duration for speech synthesis in French
    • K. Bartkova, and C. Sorin A model of segmental duration for speech synthesis in French Speech Comm. 6 1987 245 260
    • (1987) Speech Comm. , vol.6 , pp. 245-260
    • Bartkova, K.1    Sorin, C.2
  • 13
    • 0003603976 scopus 로고    scopus 로고
    • Praat, a system for doing phonetics by computer, version 3.4
    • Institute of Phonetic Sciences of the
    • Boersma, P., Weenink, D., 1996. Praat, a System for doing Phonetics by Computer, version 3.4. Institute of Phonetic Sciences of the University of Amsterdam, Report 132, 182 p.
    • (1996) University of Amsterdam, Report , vol.132 , pp. 182
    • Boersma, P.1    Weenink, D.2
  • 15
    • 85050250463 scopus 로고    scopus 로고
    • La prosodie de la focalisation en français: Faits perceptifs et morphogénétiques
    • Nancy-France
    • Brichet, C., Aubergé, V., 2004. La prosodie de la focalisation en français: faits perceptifs et morphogénétiques. In: Journées d'Etudes sur la Parole, Nancy-France.
    • (2004) Journées d'Etudes sur la Parole
    • Brichet, C.1    Aubergé, V.2
  • 17
    • 0003602965 scopus 로고
    • PhD Thesis, University of Sussex, Brighton, UK
    • Campbell, N., 1992. Multi-level timing in speech. PhD Thesis, University of Sussex, Brighton, UK, 300 p.
    • (1992) Multi-level Timing in Speech , pp. 300
    • Campbell, N.1
  • 19
    • 0004826167 scopus 로고
    • Prosody in situations of communication: Salience and segmentation
    • Aix-en-Provence, France
    • Cutler, A., Norris, D., 1991. Prosody in situations of communication: Salience and segmentation. In: Proc. Internat. Congress of Phonetic Sciences, Aix-en-Provence, France, pp. 264-270.
    • (1991) Proc. Internat. Congress of Phonetic Sciences , pp. 264-270
    • Cutler, A.1    Norris, D.2
  • 21
    • 84860832249 scopus 로고    scopus 로고
    • Using decision trees within the tilt intonation model to predict F0 contours
    • Budapest, Hungary
    • Dusterhoff, K.E., Black, A.W., Taylor, P., 1999. Using decision trees within the tilt intonation model to predict F0 contours. In: EuroSpeech, Budapest, Hungary, pp. 1627-1630.
    • (1999) EuroSpeech , pp. 1627-1630
    • Dusterhoff, K.E.1    Black, A.W.2    Taylor, P.3
  • 24
    • 0343879056 scopus 로고
    • A generative model for the prosody of connected speech in Japanese
    • Fujisaki, H., Sudo, H., 1971. A generative model for the prosody of connected speech in Japanese. Annual Report of Engineering Research Institute, Vol. 30, pp. 75-80.
    • (1971) Annual Report of Engineering Research Institute , vol.30 , pp. 75-80
    • Fujisaki, H.1    Sudo, H.2
  • 26
    • 0001200928 scopus 로고
    • Performance structures: A psycholinguistic and linguistic appraisal
    • J.-P. Gee, and F. Grosjean Performance structures: A psycholinguistic and linguistic appraisal Cognitive Psychol. 15 1983 411 458
    • (1983) Cognitive Psychol. , vol.15 , pp. 411-458
    • Gee, J.-P.1    Grosjean, F.2
  • 27
    • 0038957802 scopus 로고    scopus 로고
    • Discreteness and gradience in intonational contrasts
    • C. Gussenhoven Discreteness and gradience in intonational contrasts Lang. Speech 42 1999 283 305
    • (1999) Lang. Speech , vol.42 , pp. 283-305
    • Gussenhoven, C.1
  • 28
    • 33646648519 scopus 로고    scopus 로고
    • The phonology and phonetics of speech prosody: Between acoustics and interpretation
    • Nara, Japan
    • Hirst, D.J., 2003. The phonology and phonetics of speech prosody: Between acoustics and interpretation. In: Internat. Conf. on Speech Prosody, Nara, Japan, pp. 163-169.
    • (2003) Internat. Conf. on Speech Prosody , pp. 163-169
    • Hirst, D.J.1
  • 29
    • 0004611134 scopus 로고
    • Coding the F0 of a continuous text in French: An experimental approach
    • Aix-en-Provence, France
    • Hirst, D., Nicolas, P., Espesser, R., 1991. Coding the F0 of a continuous text in French: An experimental approach. In: Proc. Internat. Congress of Phonetic Sciences, Aix-en-Provence, France, pp. 234-237.
    • (1991) Proc. Internat. Congress of Phonetic Sciences , pp. 234-237
    • Hirst, D.1    Nicolas, P.2    Espesser, R.3
  • 30
    • 0142241561 scopus 로고    scopus 로고
    • Levels of representation and levels of analysis for the description of intonation systems
    • M. Horne Kluwer Academic Publishers Dordrecht-the Netherlands
    • D.J. Hirst, A. Di Cristo, and R. Espesser Levels of representation and levels of analysis for the description of intonation systems M. Horne Prosody: Theory and Experiment 2000 Kluwer Academic Publishers Dordrecht-the Netherlands 51 87
    • (2000) Prosody: Theory and Experiment , pp. 51-87
    • Hirst, D.J.1    Di Cristo, A.2    Espesser, R.3
  • 32
    • 85009083816 scopus 로고    scopus 로고
    • Generating prosody by superposing multi-parametric overlapping contours
    • Beijing, China
    • Holm, B., Bailly, G., 2000. Generating prosody by superposing multi-parametric overlapping contours. In: Proc. Internat. Conf. on Speech and Language Processing, Beijing, China, pp. 203-206.
    • (2000) Proc. Internat. Conf. on Speech and Language Processing , pp. 203-206
    • Holm, B.1    Bailly, G.2
  • 33
    • 21844457336 scopus 로고    scopus 로고
    • Learning the hidden structure of intonation: Implementing various functions of prosody
    • Aix-en-Provence, France
    • Holm, B., Bailly, G., 2002. Learning the hidden structure of intonation: Implementing various functions of prosody. In: Speech Prosody, Aix-en-Provence, France, pp. 399-402.
    • (2002) Speech Prosody , pp. 399-402
    • Holm, B.1    Bailly, G.2
  • 34
  • 35
    • 0001931109 scopus 로고
    • Synthesis by rule of segmental durations in English sentences
    • B. Lindblom S. Ohlman Academic Press London
    • D.H. Klatt Synthesis by rule of segmental durations in English sentences B. Lindblom S. Ohlman Frontiers of Speech Communication Research 1979 Academic Press London 287 300
    • (1979) Frontiers of Speech Communication Research , pp. 287-300
    • Klatt, D.H.1
  • 36
    • 0022796218 scopus 로고
    • Synthesis of natural sounding pitch contours in isolated utterances using hidden Markov models
    • A. Ljolje, and F. Fallside Synthesis of natural sounding pitch contours in isolated utterances using hidden Markov models TrASSP 34 1986 1074 1080
    • (1986) TrASSP , vol.34 , pp. 1074-1080
    • Ljolje, A.1    Fallside, F.2
  • 37
    • 0019619658 scopus 로고
    • Acoustic determinants of Perceptual center (p-center) location
    • S.M. Marcus Acoustic determinants of Perceptual center (p-center) location Percept. Psychophys. 30 3 1981 247 256
    • (1981) Percept. Psychophys. , vol.30 , Issue.3 , pp. 247-256
    • Marcus, S.M.1
  • 38
    • 21844451830 scopus 로고    scopus 로고
    • Prosodic and intonational domains in speech synthesis
    • J.P.H. van Santen R.W. Sproat J.P. Olive J. Hirschberg Springer-Verlag New York
    • E.C. Marsi, P.-A.J.M. Coppen, C.H.M. Gussenhoven, and T.C.M. Rietveld Prosodic and intonational domains in speech synthesis J.P.H. van Santen R.W. Sproat J.P. Olive J. Hirschberg Progress in Speech Synthesis 1997 Springer-Verlag New York 477 493
    • (1997) Progress in Speech Synthesis , pp. 477-493
    • Marsi, E.C.1    Coppen, P.-A.J.M.2    Gussenhoven, C.H.M.3    Rietveld, T.C.M.4
  • 40
    • 21844473945 scopus 로고
    • Extracting microprosodic information from diphones-a simple way to model segmental effects on prosody for synthetic speech
    • Banff, Canada
    • Monaghan, A.I.C., 1992. Extracting microprosodic information from diphones-a simple way to model segmental effects on prosody for synthetic speech. In: Internat. Conf. on Speech and Language Processing, Banff, Canada, pp. 1159-1162.
    • (1992) Internat. Conf. on Speech and Language Processing , pp. 1159-1162
    • Monaghan, A.I.C.1
  • 42
    • 0035283592 scopus 로고    scopus 로고
    • Generating prosodic attitudes in French: Data, model and evaluation
    • Y. Morlec, G. Bailly, and V. Aubergé Generating prosodic attitudes in French: Data, model and evaluation Speech Comm. 33 4 2001 357 371
    • (2001) Speech Comm. , vol.33 , Issue.4 , pp. 357-371
    • Morlec, Y.1    Bailly, G.2    Aubergé, V.3
  • 45
    • 0038473022 scopus 로고
    • A study of French vowel and consonant durations
    • D. O'Shaughnessy A study of French vowel and consonant durations J. Phonetics 9 1981 385 406
    • (1981) J. Phonetics , vol.9 , pp. 385-406
    • O'Shaughnessy, D.1
  • 46
    • 21344458605 scopus 로고    scopus 로고
    • Prosodic breaks and attachment decisions in sentence parsing
    • J. Pynte, and B. Prieur Prosodic breaks and attachment decisions in sentence parsing Lang. Cognitive Process. 11 1 1996 165 191
    • (1996) Lang. Cognitive Process. , vol.11 , Issue.1 , pp. 165-191
    • Pynte, J.1    Prieur, B.2
  • 47
    • 21844467937 scopus 로고    scopus 로고
    • Automatic generation of prosody: Comparing two superpositional systems
    • Nara, Japan
    • Raidt, S., Bailly, G., Holm, B., Mixdorff, H., 2004. Automatic generation of prosody: Comparing two superpositional systems. In: Internat. Conf. on Speech Prosody, Nara, Japan, pp. 417-420.
    • (2004) Internat. Conf. on Speech Prosody , pp. 417-420
    • Raidt, S.1    Bailly, G.2    Holm, B.3    Mixdorff, H.4
  • 48
    • 0002069313 scopus 로고
    • Tree-based modelling of segmental durations
    • G. Bailly C. Benoît Elsevier
    • M. Riley Tree-based modelling of segmental durations G. Bailly C. Benoît Talking Machines: Theories, Models and Designs 1992 Elsevier 265 274
    • (1992) Talking Machines: Theories, Models and Designs , pp. 265-274
    • Riley, M.1
  • 49
    • 0032665603 scopus 로고    scopus 로고
    • A dynamical system model for generating fundamental frequency for speech synthesis
    • K.N. Ross, and M. Ostendorf A dynamical system model for generating fundamental frequency for speech synthesis IEEE Trans. Speech Audio Process. 7 3 1999 295 309
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 295-309
    • Ross, K.N.1    Ostendorf, M.2
  • 51
    • 21844467673 scopus 로고    scopus 로고
    • Recursive patterns in phonological phrases
    • Nara, Japan
    • Schreuder, M., Gilbers, D., 2004. Recursive patterns in phonological phrases. In: Internat. Conf. on Speech Prosody, Nara, Japan, pp. 341-344.
    • (2004) Internat. Conf. on Speech Prosody , pp. 341-344
    • Schreuder, M.1    Gilbers, D.2
  • 56
    • 85009080611 scopus 로고    scopus 로고
    • Inter-transcriber reliability of ToBI prosodic labeling
    • Beijing, China
    • Syrdal, A.K., McGory, J., 2000. Inter-transcriber reliability of ToBI prosodic labeling. In: Internat. Conf. on Spoken Language Processing, Beijing, China, pp. 235-238.
    • (2000) Internat. Conf. on Spoken Language Processing , pp. 235-238
    • Syrdal, A.K.1    McGory, J.2
  • 57
    • 0011133869 scopus 로고    scopus 로고
    • Speech synthesis by phonological structure matching
    • Budapest, Hungary
    • Taylor, P., Black, A.W., 1999. Speech synthesis by phonological structure matching. In: EuroSpeech, Budapest, Hungary, pp. 1531-1534.
    • (1999) EuroSpeech , pp. 1531-1534
    • Taylor, P.1    Black, A.W.2
  • 58
    • 84945895231 scopus 로고    scopus 로고
    • Prosodic data-driven modelling of narrative style in Festival TTS
    • Pittsburgh, USA
    • Tesser, F., Cosi, P., Drioli, C., Tisato, G., 2004. Prosodic data-driven modelling of narrative style in Festival TTS. In: Workshop on Speech Synthesis, Pittsburgh, USA, pp. 185-190.
    • (2004) Workshop on Speech Synthesis , pp. 185-190
    • Tesser, F.1    Cosi, P.2    Drioli, C.3    Tisato, G.4
  • 59
    • 33646636894 scopus 로고    scopus 로고
    • Identification and automatic generation of prosodic contours for a text-to-speech synthesis system in French
    • Rhodes, Greece
    • Tournemire, S.D., 1997. Identification and automatic generation of prosodic contours for a text-to-speech synthesis system in French. In: Proc. Eur. Conf. on Speech Communication and Technology, Rhodes, Greece, pp. 191-194.
    • (1997) Proc. Eur. Conf. on Speech Communication and Technology , pp. 191-194
    • Tournemire, S.D.1
  • 60
    • 0001957999 scopus 로고
    • F0 generation with a database of natural F0 patterns and with a neural network
    • G. Bailly C. Benoît Elsevier
    • C. Traber F0 generation with a database of natural F0 patterns and with a neural network G. Bailly C. Benoît Talking Machines: Theories, Models and Designs 1992 Elsevier 287 304
    • (1992) Talking Machines: Theories, Models and Designs , pp. 287-304
    • Traber, C.1
  • 62
    • 6344264628 scopus 로고
    • Deriving text-to-speech durations from natural speech
    • G. Bailly C. Benoît Elsevier
    • J.P.H. van Santen Deriving text-to-speech durations from natural speech G. Bailly C. Benoît Talking Machines: Theories, Models and Designs 1992 Elsevier 275 285
    • (1992) Talking Machines: Theories, Models and Designs , pp. 275-285
    • Van Santen, J.P.H.1
  • 63
    • 21844436737 scopus 로고    scopus 로고
    • Quantitative modeling of pitch accent alignment
    • Aix-en-Provence, France
    • van Santen, J.P.H., 2002. Quantitative modeling of pitch accent alignment. In: Internat. Conf. on Speech Prosody, Aix-en-Provence, France, pp. 107-112.
    • (2002) Internat. Conf. on Speech Prosody , pp. 107-112
    • Van Santen, J.P.H.1
  • 64
    • 85009069303 scopus 로고    scopus 로고
    • Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis
    • Beijing, China
    • Wightman, C.W., Syrdal, A.K., Stemmer, G., Conkie A., Beutnagel, M., 2000. Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis. In: Internat. Conf. on Spoken Language Processing, Beijing, China, pp. 71-74.
    • (2000) Internat. Conf. on Spoken Language Processing , pp. 71-74
    • Wightman, C.W.1    Syrdal, A.K.2    Stemmer, G.3    Conkie, A.4    Beutnagel, M.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.