메뉴 건너뛰기




Volumn 14, Issue 3, 2000, Pages 177-210

ProSynth: An integrated prosodic approach to device-independent, natural-sounding speech synthesis

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; ACOUSTIC WAVE REFLECTION; COMPUTATIONAL LINGUISTICS; MATHEMATICAL MODELS; SPEECH ANALYSIS; SPEECH PROCESSING;

EID: 0034224125     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1006/csla.2000.0141     Document Type: Article
Times cited : (32)

References (68)
  • 1
    • 0010936375 scopus 로고
    • Syllable quantity and enclitics in English
    • D. Abercrombie, D. B. Fry, P. A. D. MacCarthy, N. C. Scott, & J. L. Trim. London: Longman Green
    • Abercrombie D. Syllable quantity and enclitics in English. Abercrombie D., Fry D. B., MacCarthy P. A. D., Scott N. C., Trim J. L. In Honour of Daniel Jones. 1964;216-222 Longman Green, London.
    • (1964) Honour of Daniel Jones , pp. 216-222
    • Abercrombie, D.1
  • 5
    • 0039280602 scopus 로고
    • Alignment and composition of tonal accents: Comments on Silverman & Pierrehumbert's paper
    • J. Kingston, & M. Beckman. Cambridge: Cambridge University Press
    • Bruce G. Alignment and composition of tonal accents: comments on Silverman & Pierrehumbert's paper. Kingston J., Beckman M. Papers in Laboratory Phonology I. 1990;107-114 Cambridge University Press, Cambridge.
    • (1990) Papers in Laboratory Phonology I , pp. 107-114
    • Bruce, G.1
  • 6
    • 0010968444 scopus 로고
    • The phonetic interpretation of headed phonological structures containing overlapping constituents
    • Coleman J. S. The phonetic interpretation of headed phonological structures containing overlapping constituents. Phonology. 9:1992;1-44.
    • (1992) Phonology , vol.9 , pp. 1-44
    • Coleman, J.S.1
  • 7
    • 33846598677 scopus 로고
    • Comprehension of synthetic speech produced by rule: A review and theoretical interpretation
    • Duffy S. A., Pisoni D. B. Comprehension of synthetic speech produced by rule: A review and theoretical interpretation. Language and Speech. 35:1992;351-389.
    • (1992) Language and Speech , vol.35 , pp. 351-389
    • Duffy, S.A.1    Pisoni, D.B.2
  • 9
    • 0001727387 scopus 로고
    • Exploiting lawful variability in the speech waveform
    • J. S. Perkell, & D. H. Klatt. Hillsdale, NJ: Erlbaum
    • Elman J. L., McClelland J. L. Exploiting lawful variability in the speech waveform. Perkell J. S., Klatt D. H. Invariance and Variability in Speech Processes. 1986;360-385 Erlbaum, Hillsdale, NJ.
    • (1986) Invariance and Variability in Speech Processes , pp. 360-385
    • Elman, J.L.1    McClelland, J.L.2
  • 13
    • 0344147456 scopus 로고    scopus 로고
    • Effects on word recognition of syllable-onset cues to syllable-coda voicing
    • J. K. Local, R. A. Ogden, & R. A. M. Temple. Cambridge, U.K: Cambridge University Press
    • Hawkins S., Nguyen N. Effects on word recognition of syllable-onset cues to syllable-coda voicing. Local J. K., Ogden R. A., Temple R. A. M. Papers in Laboratory Phonology VI. in press;Cambridge University Press, Cambridge, U.K.
    • Papers in Laboratory Phonology VI
    • Hawkins, S.1    Nguyen, N.2
  • 16
    • 0342336018 scopus 로고    scopus 로고
    • Synthesizing systematic variation at boundaries between vowels and obstruents
    • J. J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, & A. C. Bailey. Berkeley: University of California
    • Heid S., Hawkins S. Synthesizing systematic variation at boundaries between vowels and obstruents. Ohala J. J., Hasegawa Y., Ohala M., Granville D., Bailey A. C. Proceedings of the XIVth International Congress of Phonetic Sciences. 1999;511-514 University of California, Berkeley.
    • (1999) Proceedings of the XIVth International Congress of Phonetic Sciences , pp. 511-514
    • Heid, S.1    Hawkins, S.2
  • 18
    • 0342336011 scopus 로고    scopus 로고
    • Intonation modelling in ProSynth: An integrated prosodic approach to speech synthesis
    • J. J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, & A. C. Bailey. Berkeley, CA: University of California
    • House J., Dankovičová J., Huckvale M. Intonation modelling in ProSynth: an integrated prosodic approach to speech synthesis. Ohala J. J., Hasegawa Y., Ohala M., Granville D., Bailey A. C. Proceedings of the XIVth International Congress of Phonetic Sciences. 1999;2343-2346 University of California, Berkeley, CA.
    • (1999) Proceedings of the XIVth International Congress of Phonetic Sciences , pp. 2343-2346
    • House, J.1    Dankovičová, J.2    Huckvale, M.3
  • 19
    • 0342770970 scopus 로고
    • An integrated phonological-phonetic model for text-to-speech synthesis
    • K. Elenius, & P. Branderud. Sweden: KTH and Stockholm University
    • House J., Hawkins S. An integrated phonological-phonetic model for text-to-speech synthesis. Elenius K., Branderud P. Proceedings of the XIIIth International Congress of Phonetic Sciences. 1995;326-329 KTH and Stockholm University, Sweden.
    • (1995) Proceedings of the XIIIth International Congress of Phonetic Sciences , pp. 326-329
    • House, J.1    Hawkins, S.2
  • 20
    • 85007273570 scopus 로고    scopus 로고
    • University College London
    • J. House, A. Wichmann, 1996, University College London, 99, 117.
    • (1996) , pp. 99117
    • House, J.1    Wichmann, A.2
  • 21
    • 0344147462 scopus 로고    scopus 로고
    • Domain-initial strengthening in four languages
    • J. K. Local, R. A. Ogden, & R. A. M. Temple. Cambridge, U.K: Cambridge University Press
    • Keating P., Cho T., Fougéron C., Hsu C.-S. Domain-initial strengthening in four languages. Local J. K., Ogden R. A., Temple R. A. M. Papers in Laboratory Phonology VI. in press;Cambridge University Press, Cambridge, U.K.
    • Papers in Laboratory Phonology VI
    • Keating, P.1    Cho, T.2    Fougéron, C.3    Hsu, C.-S.4
  • 22
  • 23
    • 0001931109 scopus 로고
    • Synthesis by rule of segmental durations in English sentences
    • B. Lindblom, & S. Öhman. New York: Academic Press
    • Klatt D. Synthesis by rule of segmental durations in English sentences. Lindblom B., Öhman S. Frontiers of Speech Communication Research. 1979;287-299 Academic Press, New York.
    • (1979) Frontiers of Speech Communication Research , pp. 287-299
    • Klatt, D.1
  • 25
    • 0037993904 scopus 로고
    • Modelling assimilation in a non-segmental rule-free phonology
    • G. J. Docherty, & D. R. Ladd. Cambridge: Cambridge University Press
    • Local J. K. Modelling assimilation in a non-segmental rule-free phonology. Docherty G. J., Ladd D. R. Papers in Laboratory Phonology II: Gesture, Segment, Prosody. 1992;190-223 Cambridge University Press, Cambridge.
    • (1992) Papers in Laboratory Phonology II: Gesture, Segment, Prosody , pp. 190-223
    • Local, J.K.1
  • 26
    • 85007161540 scopus 로고
    • Local
    • Local, 1993.
    • (1993)
  • 28
    • 0242306945 scopus 로고
    • Making sense of dynamic, non-segmental phonetics
    • K. Elenius, & P. Branderud. Sweden: KTH and Stockholm University
    • Local J. K. Making sense of dynamic, non-segmental phonetics. Elenius K., Branderud P. Proceedings of the XIIIth International Congress of Phonetic Sciences. 1995;2-9 KTH and Stockholm University, Sweden.
    • (1995) Proceedings of the XIIIth International Congress of Phonetic Sciences , pp. 2-9
    • Local, J.K.1
  • 29
    • 0242338499 scopus 로고    scopus 로고
    • A model of timing for non-segmental phonological structure
    • J. P. H. van Santen, R. W. Sproat, J. P. Olive, & J. Hirschberg. New York: Springer
    • Local J. K., Ogden R. A model of timing for non-segmental phonological structure. van Santen J. P. H., Sproat R. W., Olive J. P., Hirschberg J. Progress in Speech Synthesis. 1997;109-122 Springer, New York.
    • (1997) Progress in Speech Synthesis , pp. 109-122
    • Local, J.K.1    Ogden, R.2
  • 31
    • 84937297387 scopus 로고
    • Speakers nasalize /o /after /n /, but still hear /o /
    • Manuel S. Y. Speakers nasalize /o /after /n /, but still hear /o /. Journal of Phonetics. 23:1995;453-476.
    • (1995) Journal of Phonetics , vol.23 , pp. 453-476
    • Manuel, S.Y.1
  • 33
    • 0000943762 scopus 로고
    • Contexual variation of the vowel voice source as a function of adjacent consonants
    • Ní Chasaide A., Gobl C. Contexual variation of the vowel voice source as a function of adjacent consonants. Language and Speech. 36:1993;303-330.
    • (1993) Language and Speech , vol.36 , pp. 303-330
    • Ní Chasaide, A.1    Gobl, C.2
  • 34
    • 85050839044 scopus 로고
    • Parametric interpretation in YorkTalk
    • Ogden R. Parametric interpretation in YorkTalk. York Papers in Linguistics. 16:1992;81-99.
    • (1992) York Papers in Linguistics , vol.16 , pp. 81-99
    • Ogden, R.1
  • 35
    • 0343641239 scopus 로고    scopus 로고
    • A syllable level feature in Finnish
    • H. van der Hulst, & N. Ritter. Berlin: Mouton de Gruyter
    • Ogden R. A syllable level feature in Finnish. van der Hulst H., Ritter N. The Syllable: Views and Facts. 1999;651-672 Mouton de Gruyter, Berlin.
    • (1999) The Syllable: Views and Facts , pp. 651-672
    • Ogden, R.1
  • 36
    • 0342770969 scopus 로고    scopus 로고
    • Temporal interpretation in ProSynth, a prosodic speech synthesis system
    • J. J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, & A. C. Bailey. Berkeley: University of California
    • Ogden R., Local J., Carter P. Temporal interpretation in ProSynth, a prosodic speech synthesis system. Ohala J. J., Hasegawa Y., Ohala M., Granville D., Bailey A. C. Proceedings of the XIVth International Congress of Phonetic Sciences. 1999;1059-1062 University of California, Berkeley.
    • (1999) Proceedings of the XIVth International Congress of Phonetic Sciences , pp. 1059-1062
    • Ogden, R.1    Local, J.2    Carter, P.3
  • 37
    • 0001540328 scopus 로고
    • Phonological and phonetic representation
    • Pierrehumbert J. P. Phonological and phonetic representation. Journal of Phonetics. 18:1990;375-394.
    • (1990) Journal of Phonetics , vol.18 , pp. 375-394
    • Pierrehumbert, J.P.1
  • 39
    • 0012433827 scopus 로고    scopus 로고
    • Perception of synthetis speech
    • J. P. H. van Santen, R. W. Sproat, J. P. Olive, & J. Hirschberg. New York: Springer
    • Pisoni D. B. Perception of synthetis speech. van Santen J. P. H., Sproat R. W., Olive J. P., Hirschberg J. Progress in Speech Synthesis. 1997;541-560 Springer, New York.
    • (1997) Progress in Speech Synthesis , pp. 541-560
    • Pisoni, D.B.1
  • 41
    • 0041110189 scopus 로고
    • Assessment of text-to-speech synthesis stystems
    • A. Fourcin, G. Harland, W. Barry, & V. Hazan. Chichester: Ellis Horwood
    • Pols L. C. W. Assessment of text-to-speech synthesis stystems. Fourcin A., Harland G., Barry W., Hazan V. Speech Input and Output Assessment: Multilingual Methods and Standards. 1989;53-81 Ellis Horwood, Chichester.
    • (1989) Speech Input and Output Assessment: Multilingual Methods and Standards , pp. 53-81
    • Pols, L.C.W.1
  • 44
    • 0032178685 scopus 로고    scopus 로고
    • Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances
    • Remez R. E., Fellowes J. M., Pisoni D. B., Goh W. D., Rubin P. E. Multimodal perceptual organization of speech: evidence from tone analogs of spoken utterances. Speech Communication. 26:1998;65-73.
    • (1998) Speech Communication , vol.26 , pp. 65-73
    • Remez, R.E.1    Fellowes, J.M.2    Pisoni, D.B.3    Goh, W.D.4    Rubin, P.E.5
  • 46
    • 0019508260 scopus 로고
    • Speech perception without traditional speech cues
    • Remez R. E., Rubin P. E., Pisoni D. B., Carrel T. D. Speech perception without traditional speech cues. Science. 212:1981;947-950.
    • (1981) Science , vol.212 , pp. 947-950
    • Remez, R.E.1    Rubin, P.E.2    Pisoni, D.B.3    Carrel, T.D.4
  • 47
    • 0020161263 scopus 로고
    • Phonetic trading relations and context effects: New experimental evidence for a speech mode of perception
    • Repp B. Phonetic trading relations and context effects: new experimental evidence for a speech mode of perception. Psychological Bulletin. 92:1982;81-110.
    • (1982) Psychological Bulletin , vol.92 , pp. 81-110
    • Repp, B.1
  • 48
    • 84937287450 scopus 로고
    • Aligning pitch targets in speech synthesis: Effects of syllable structure
    • Rietveld A., Gussenhoven C. Aligning pitch targets in speech synthesis: effects of syllable structure. Journal of Phonetics. 23:1995;375-385.
    • (1995) Journal of Phonetics , vol.23 , pp. 375-385
    • Rietveld, A.1    Gussenhoven, C.2
  • 49
    • 0037795511 scopus 로고
    • Frequency selectivity and perception of speech
    • B. C. J. Moore. Harcourt Brace Jovanovich, London: Academic Press
    • Rosen S. M., Fourcin A. J. Frequency selectivity and perception of speech. Moore B. C. J. Frequency Selectivity in Hearing. 1986;373-487 Academic Press, Harcourt Brace Jovanovich, London.
    • (1986) Frequency Selectivity in Hearing , pp. 373-487
    • Rosen, S.M.1    Fourcin, A.J.2
  • 50
    • 0000279926 scopus 로고
    • Auditory, articulatory, and learning explanations of categorical perception in speech
    • S. Harnad. Cambridge: Cambridge University Press
    • Rosen S., Howell P. Auditory, articulatory, and learning explanations of categorical perception in speech. Harnad S. Categorial Perception: The Groundwork of Cognition. 1987;113-160 Cambridge University Press, Cambridge.
    • (1987) Categorial Perception: The Groundwork of Cognition , pp. 113-160
    • Rosen, S.1    Howell, P.2
  • 53
    • 0010135126 scopus 로고
    • The timing of prenuclear high accents in English
    • J. Kingston, & M. Beckman. Cambridge: Cambridge University Press
    • Silverman K., Pierrehumbert J. The timing of prenuclear high accents in English. Kingston J., Beckman M. Papers in Laboratory Phonology I. 1990;72-106 Cambridge University Press, Cambridge.
    • (1990) Papers in Laboratory Phonology I , pp. 72-106
    • Silverman, K.1    Pierrehumbert, J.2
  • 54
    • 0342770895 scopus 로고
    • Casual speech rules and what the phonology of connected speech might really be like
    • Simpson A. Casual speech rules and what the phonology of connected speech might really be like. Linguistics. 30:1992;535-548.
    • (1992) Linguistics , vol.30 , pp. 535-548
    • Simpson, A.1
  • 55
    • 0032181249 scopus 로고    scopus 로고
    • PURR - A method for prosody evaluation and investigation
    • Sonntag G. P., Portele T. PURR - A method for prosody evaluation and investigation. Computer Speech and Language. 12:1998;437-451.
    • (1998) Computer Speech and Language , vol.12 , pp. 437-451
    • Sonntag, G.P.1    Portele, T.2
  • 56
    • 0004161686 scopus 로고    scopus 로고
    • Multilingual text-to-speech synthesis
    • Boston: Kluwer Academic Publishers
    • (R. Sproat, ed.). Multilingual text-to-speech synthesis. The Bell Labs Approach. 1997;Kluwer Academic Publishers, Boston.
    • (1997) The Bell Labs Approach
    • Sproat, R.1
  • 57
    • 0024516066 scopus 로고
    • Dynamic specification of coarticulated vowels spoken in sentence context
    • Strange W. Dynamic specification of coarticulated vowels spoken in sentence context. Journal of the Acoustical Society of America. 85:1989;2135-2153.
    • (1989) Journal of the Acoustical Society of America , vol.85 , pp. 2135-2153
    • Strange, W.1
  • 60
    • 0028405296 scopus 로고
    • Assignment of segmental duration in text-to-speech synthesis
    • van Santen J. Assignment of segmental duration in text-to-speech synthesis. Computer Speech and Language. 8:1994;95-128.
    • (1994) Computer Speech and Language , vol.8 , pp. 95-128
    • Van Santen, J.1
  • 65
    • 0023196450 scopus 로고
    • Continuous uptake of acoustic cues in spoken word recognition
    • Warren P., Marslen-Wilson W. Continuous uptake of acoustic cues in spoken word recognition. Perception and Psychophysics. 41:1987;262-275.
    • (1987) Perception and Psychophysics , vol.41 , pp. 262-275
    • Warren, P.1    Marslen-Wilson, W.2
  • 66
    • 0038331837 scopus 로고    scopus 로고
    • The extent of coarticulation of English liquids: An acoustic and articulatory study
    • J. J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, & A. C. Bailey. Berkeley, CA: University of California
    • West P. The extent of coarticulation of English liquids: An acoustic and articulatory study. Ohala J. J., Hasegawa Y., Ohala M., Granville D., Bailey A. C. Proceedings of the XIVth International Congress of Phonetic Sciences. 1999;1901-1904 University of California, Berkeley, CA.
    • (1999) Proceedings of the XIVth International Congress of Phonetic Sciences , pp. 1901-1904
    • West, P.1
  • 67
    • 0343641115 scopus 로고    scopus 로고
    • Discourse constraints on peak timing in English: Experimental evidence
    • J. J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, & A. C. Bailey. Berkeley: University of California
    • Wichmann A., House J. Discourse constraints on peak timing in English: experimental evidence. Ohala J. J., Hasegawa Y., Ohala M., Granville D., Bailey A. C. Proceedings of the XIVth International Congress of Phonetic Sciences. 1999;1765-1768 University of California, Berkeley.
    • (1999) Proceedings of the XIVth International Congress of Phonetic Sciences , pp. 1765-1768
    • Wichmann, A.1    House, J.2
  • 68
    • 0000235349 scopus 로고
    • An acoustic and electropalatographic study of lexical and postlexical palatalisation in American English
    • B. Connell, & A. Arvaniti. Cambridge: Cambridge University Press
    • Zsiga E. C. An acoustic and electropalatographic study of lexical and postlexical palatalisation in American English. Connell B., Arvaniti A. Phonology and Phonetic Evidence: Papers in Laboratory Phonology IV. 1995;282-302 Cambridge University Press, Cambridge.
    • (1995) Phonology and Phonetic Evidence: Papers in Laboratory Phonology IV , pp. 282-302
    • Zsiga, E.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.