메뉴 건너뛰기




Volumn 50, Issue 4, 2008, Pages 301-311

Bayesian networks for phone duration prediction

Author keywords

Bayesian networks; Classification and regression trees; Duration modelling; Sums of products; Text to speech

Indexed keywords

COMPUTATIONAL LINGUISTICS; MATHEMATICAL MODELS; REGRESSION ANALYSIS; TELECOMMUNICATION LINKS; TIME DOMAIN ANALYSIS; TOPOLOGY; TREES (MATHEMATICS);

EID: 40249100308     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.10.002     Document Type: Article
Times cited : (30)

References (58)
  • 2
    • 0028531866 scopus 로고
    • Characterisation of rhythmic patterns for text-to-speech synthesis
    • Barbosa P., and Bailly G. Characterisation of rhythmic patterns for text-to-speech synthesis. Speech Comm. 15 (1994) 127-137
    • (1994) Speech Comm. , vol.15 , pp. 127-137
    • Barbosa, P.1    Bailly, G.2
  • 3
    • 0037324538 scopus 로고    scopus 로고
    • Effects of disfluencies, predictability, and utterance position on word form variation in English conversation
    • Bell A., Jurafsky D., Fosler-Lussier E., Girand C., Gregory M., and Gildea D. Effects of disfluencies, predictability, and utterance position on word form variation in English conversation. J. Acoust. Soc. Amer. 113 2 (2003) 1001-1024
    • (2003) J. Acoust. Soc. Amer. , vol.113 , Issue.2 , pp. 1001-1024
    • Bell, A.1    Jurafsky, D.2    Fosler-Lussier, E.3    Girand, C.4    Gregory, M.5    Gildea, D.6
  • 5
    • 40249085106 scopus 로고    scopus 로고
    • Black, A., Caley, R., King, S., Taylor, P., 2003. Edinburgh Speech Tools Library: system documentation. Technical Report 1.2.0 edition, The Centre for Speech Technology Research, University of Edinburgh, UK.
    • Black, A., Caley, R., King, S., Taylor, P., 2003. Edinburgh Speech Tools Library: system documentation. Technical Report 1.2.0 edition, The Centre for Speech Technology Research, University of Edinburgh, UK.
  • 6
    • 40249083535 scopus 로고    scopus 로고
    • Boutilier, C., Friedman, N., Goldszmidt, M., Koller, D., 1996. Context specific independence in Bayesian networks. In: Proc. 12th Conf. on Uncertainty in Artificial Intelligence (UAI), Portland, OR, USA.
    • Boutilier, C., Friedman, N., Goldszmidt, M., Koller, D., 1996. Context specific independence in Bayesian networks. In: Proc. 12th Conf. on Uncertainty in Artificial Intelligence (UAI), Portland, OR, USA.
  • 8
    • 40249089271 scopus 로고    scopus 로고
    • Campbell, N., 1992. Prosodic encoding of English speech. In: Proc. 2nd Internat. Conf. on Spoken Language Processing, Banff, Canada.
    • Campbell, N., 1992. Prosodic encoding of English speech. In: Proc. 2nd Internat. Conf. on Spoken Language Processing, Banff, Canada.
  • 9
    • 84930566044 scopus 로고
    • Segment durations in a syllable frame
    • Campbell W., and Isard S. Segment durations in a syllable frame. J. Phonet. 19 (1991) 37-47
    • (1991) J. Phonet. , vol.19 , pp. 37-47
    • Campbell, W.1    Isard, S.2
  • 10
    • 40249112544 scopus 로고    scopus 로고
    • Clark, R., Richmond, K., King, S. 2004. Festival 2 - build your own general purpose unit selection speech synthesiser. In: Proc. 5th ISCA Workshop on Speech Synthesis, Pittsburgh, USA.
    • Clark, R., Richmond, K., King, S. 2004. Festival 2 - build your own general purpose unit selection speech synthesiser. In: Proc. 5th ISCA Workshop on Speech Synthesis, Pittsburgh, USA.
  • 13
    • 40249105294 scopus 로고    scopus 로고
    • Cooper, A., 1991. Laryngeal and oral gestures in English/p, t, k/. In: Proc. 12th Internat. Congress of Phonetic Sciences, Aix-en-Provance, France, Vol. 2, pp. 50-53.
    • Cooper, A., 1991. Laryngeal and oral gestures in English/p, t, k/. In: Proc. 12th Internat. Congress of Phonetic Sciences, Aix-en-Provance, France, Vol. 2, pp. 50-53.
  • 14
    • 34249832377 scopus 로고
    • A bayesian method for the induction of probabilistic networks from data
    • Cooper G., and Herskovits E. A bayesian method for the induction of probabilistic networks from data. Machine Learn. 9 (1992) 309-347
    • (1992) Machine Learn. , vol.9 , pp. 309-347
    • Cooper, G.1    Herskovits, E.2
  • 15
    • 0023950375 scopus 로고
    • Segmental durations in connected-speech signals: syllabic stress
    • Crystal T., and House A. Segmental durations in connected-speech signals: syllabic stress. J. Acoust. Soc. Amer. 83 4 (1988) 1574-1585
    • (1988) J. Acoust. Soc. Amer. , vol.83 , Issue.4 , pp. 1574-1585
    • Crystal, T.1    House, A.2
  • 16
    • 0023921973 scopus 로고
    • Segmental durations in connected-speech sygnals: current results
    • Crystal T., and House A. Segmental durations in connected-speech sygnals: current results. J. Acoust. Soc. Amer. 83 4 (1988) 1553-1573
    • (1988) J. Acoust. Soc. Amer. , vol.83 , Issue.4 , pp. 1553-1573
    • Crystal, T.1    House, A.2
  • 17
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete date via the em algorithm
    • Dempster A., Laird N., and Rubin D. Maximum likelihood from incomplete date via the em algorithm. J. Roy. Statist. Soc. B 39 (1977) 1-38
    • (1977) J. Roy. Statist. Soc. , vol.B 39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 18
    • 40249102488 scopus 로고    scopus 로고
    • Dusterhoff, K.E., Black, A.W., Taylor, P.A., 1999. Using decision trees within the Tilt intonation model to predict F0 contours. In: CD-ROM Proc. Eurospeech 99, Budapest, Hungary.
    • Dusterhoff, K.E., Black, A.W., Taylor, P.A., 1999. Using decision trees within the Tilt intonation model to predict F0 contours. In: CD-ROM Proc. Eurospeech 99, Budapest, Hungary.
  • 19
    • 0031009252 scopus 로고    scopus 로고
    • Articulatory strengthening at edges of prosodic domains
    • Fougeron C., and Keating P. Articulatory strengthening at edges of prosodic domains. J. Acoust. Soc. Amer. 101 6 (1997) 3728-3740
    • (1997) J. Acoust. Soc. Amer. , vol.101 , Issue.6 , pp. 3728-3740
    • Fougeron, C.1    Keating, P.2
  • 20
    • 40249099628 scopus 로고    scopus 로고
    • Friedman, N., Goldszmidt, M., 1996. Learning Bayesian networks with local structure. In: Proc. 12th Conf. on Uncertainty in Artificial Intelligence (UAI), Portland, OR, USA.
    • Friedman, N., Goldszmidt, M., 1996. Learning Bayesian networks with local structure. In: Proc. 12th Conf. on Uncertainty in Artificial Intelligence (UAI), Portland, OR, USA.
  • 21
    • 40249093913 scopus 로고    scopus 로고
    • Goubanova, O., 2005. Bayesian networks for predicting duration of phones. Ph.D. thesis, University of Edinburgh.
    • Goubanova, O., 2005. Bayesian networks for predicting duration of phones. Ph.D. thesis, University of Edinburgh.
  • 22
    • 33745193267 scopus 로고    scopus 로고
    • Goubanova, O., King, S., 2005. Predicting consonant duration with Bayesian belief networks. In: Proc. of the Interspeech 2005, Lisbon, Portugal, Vol. 4, pp. 1941-1944.
    • Goubanova, O., King, S., 2005. Predicting consonant duration with Bayesian belief networks. In: Proc. of the Interspeech 2005, Lisbon, Portugal, Vol. 4, pp. 1941-1944.
  • 23
    • 24344464454 scopus 로고    scopus 로고
    • Frequency and predictability effects on the duration of content words in conversation
    • Gregory M., Bell A., Jurafsky D., and Raymond W. Frequency and predictability effects on the duration of content words in conversation. J. Acoust. Soc. Amer. 110 5 (2001) 2738
    • (2001) J. Acoust. Soc. Amer. , vol.110 , Issue.5 , pp. 2738
    • Gregory, M.1    Bell, A.2    Jurafsky, D.3    Raymond, W.4
  • 24
    • 0038947592 scopus 로고
    • Abbreviation of consonants in English pre- and post-vocalic clusters
    • Haggard D. Abbreviation of consonants in English pre- and post-vocalic clusters. J. Phonet. 1 1 (1973) 9-24
    • (1973) J. Phonet. , vol.1 , Issue.1 , pp. 9-24
    • Haggard, D.1
  • 25
    • 40249110845 scopus 로고    scopus 로고
    • Heckerman, D., 1995. A tutorial on learning with Bayesian networks. Technical Report MSR-TR-95-06, Microsoft Research, Microsoft Corporation, Redmond, USA.
    • Heckerman, D., 1995. A tutorial on learning with Bayesian networks. Technical Report MSR-TR-95-06, Microsoft Research, Microsoft Corporation, Redmond, USA.
  • 26
    • 40249099627 scopus 로고    scopus 로고
    • Hiller, S.E.J.R., Laver, J., 1990. Spell project speech stimuli:. Technical Report Staus report 1.1, The Centre for Speech Technology Research, University of Edinburgh, UK.
    • Hiller, S.E.J.R., Laver, J., 1990. Spell project speech stimuli:. Technical Report Staus report 1.1, The Centre for Speech Technology Research, University of Edinburgh, UK.
  • 27
    • 40249120622 scopus 로고    scopus 로고
    • Kaiki, N., Takeda, K., Sagisaka, Y., 1990. Statistical analysis for segmental duration rules in Japanese. In: Proc. 1st Internat. Conf. on Spoken Language Processing, Kobe, Japan, pp. 17-20.
    • Kaiki, N., Takeda, K., Sagisaka, Y., 1990. Statistical analysis for segmental duration rules in Japanese. In: Proc. 1st Internat. Conf. on Spoken Language Processing, Kobe, Japan, pp. 17-20.
  • 28
    • 0015676852 scopus 로고
    • Interaction between two factors that influence vowel duration
    • Klatt D. Interaction between two factors that influence vowel duration. J. Acoust. Soc. Amer. 54 4 (1973) 1102-1104
    • (1973) J. Acoust. Soc. Amer. , vol.54 , Issue.4 , pp. 1102-1104
    • Klatt, D.1
  • 29
    • 0016255283 scopus 로고
    • The duration of [s] in English words
    • Klatt D. The duration of [s] in English words. J. Speech Hear. Res. 17 (1974) 51-63
    • (1974) J. Speech Hear. Res. , vol.17 , pp. 51-63
    • Klatt, D.1
  • 30
    • 0001576302 scopus 로고
    • Vowel lengthening is syntactically determined in connected speech
    • Klatt D. Vowel lengthening is syntactically determined in connected speech. J. Phonet. 59 3 (1975) 129-140
    • (1975) J. Phonet. , vol.59 , Issue.3 , pp. 129-140
    • Klatt, D.1
  • 31
    • 0016952322 scopus 로고
    • Linguistic uses of segmental duration of English: acoustic and perceptual evidence
    • Klatt D. Linguistic uses of segmental duration of English: acoustic and perceptual evidence. J. Acoust. Soc. Amer. 59 5 (1976) 1209-1211
    • (1976) J. Acoust. Soc. Amer. , vol.59 , Issue.5 , pp. 1209-1211
    • Klatt, D.1
  • 33
    • 85009064374 scopus 로고    scopus 로고
    • Krishna, N., Tulukdar, P., Bali, K., Ramakrishnan, A., 2004. Duration modelling for Hindi text-to-speech synthesis system. In: CD-ROM Proc. Internat. Conf. on Spoken Language Processing 2004, Denver, USA.
    • Krishna, N., Tulukdar, P., Bali, K., Ramakrishnan, A., 2004. Duration modelling for Hindi text-to-speech synthesis system. In: CD-ROM Proc. Internat. Conf. on Spoken Language Processing 2004, Denver, USA.
  • 34
    • 0028482006 scopus 로고
    • Learning Bayesian belief networks: an approach based on the MDL principle
    • Lam W., and Bachus F. Learning Bayesian belief networks: an approach based on the MDL principle. Comput. Intell. 10 (1994) 269-293
    • (1994) Comput. Intell. , vol.10 , pp. 269-293
    • Lam, W.1    Bachus, F.2
  • 36
    • 0000084818 scopus 로고
    • The timing of utterances and linguistic boundaries
    • Lehiste I. The timing of utterances and linguistic boundaries. J. Acoust. Soc. Amer. 51 6B (1972) 2018-2024
    • (1972) J. Acoust. Soc. Amer. , vol.51 , Issue.6 B , pp. 2018-2024
    • Lehiste, I.1
  • 37
    • 0015714211 scopus 로고
    • Rhythmic units and syntactic units in production and perception
    • Lehiste I. Rhythmic units and syntactic units in production and perception. J. Acoust. Soc. Amer. 54 5 (1973) 1228-1234
    • (1973) J. Acoust. Soc. Amer. , vol.54 , Issue.5 , pp. 1228-1234
    • Lehiste, I.1
  • 38
    • 40249114199 scopus 로고    scopus 로고
    • Lindblom, D., Rapp, K., 1973. Some temporal regularities of spoken Swedish. In: PILUS, Sweden, Vol. 21, pp. 1-59.
    • Lindblom, D., Rapp, K., 1973. Some temporal regularities of spoken Swedish. In: PILUS, Sweden, Vol. 21, pp. 1-59.
  • 39
    • 33745218730 scopus 로고    scopus 로고
    • Mayo, C., Clark, R., King, S., 2005. Multidimensional scaling of listener responses to synthetic speech. In: Proc. Interspeech 2005, Lisbon, Portugal, Vol. 4, pp. 1725-1728
    • Mayo, C., Clark, R., King, S., 2005. Multidimensional scaling of listener responses to synthetic speech. In: Proc. Interspeech 2005, Lisbon, Portugal, Vol. 4, pp. 1725-1728
  • 40
    • 40249083533 scopus 로고    scopus 로고
    • Nooteboom, S., 1972. Production and perception of vowel duration. Ph.D. thesis, University of Utrecht.
    • Nooteboom, S., 1972. Production and perception of vowel duration. Ph.D. thesis, University of Utrecht.
  • 41
    • 0015716065 scopus 로고
    • The effect of position in utterance on speech segment duration in English
    • Oller O. The effect of position in utterance on speech segment duration in English. J. Acoust. Soc. Amer. 54 5 (1973) 1235-1247
    • (1973) J. Acoust. Soc. Amer. , vol.54 , Issue.5 , pp. 1235-1247
    • Oller, O.1
  • 43
    • 84953657625 scopus 로고
    • Duration of syllable nuclei in English
    • Peterson G., and Lehiste I. Duration of syllable nuclei in English. J. Acoust. Soc. Amer. 32 (1960) 693-703
    • (1960) J. Acoust. Soc. Amer. , vol.32 , pp. 693-703
    • Peterson, G.1    Lehiste, I.2
  • 44
    • 0019436690 scopus 로고
    • Linguistic timing factors in combination
    • Port R. Linguistic timing factors in combination. J. Acoust. Soc. Amer. 69 1 (1981) 262-273
    • (1981) J. Acoust. Soc. Amer. , vol.69 , Issue.1 , pp. 262-273
    • Port, R.1
  • 45
    • 40249114200 scopus 로고    scopus 로고
    • Riley, M., 1992. Tree-based modelling for speech synthesis. In: Bailly, G., Benoit, C., Sawallis, T. (Eds.), Talking Machines: Theories, Models and Designs. Elsevier, Amsterdam, Netherlands, pp. 265-273.
    • Riley, M., 1992. Tree-based modelling for speech synthesis. In: Bailly, G., Benoit, C., Sawallis, T. (Eds.), Talking Machines: Theories, Models and Designs. Elsevier, Amsterdam, Netherlands, pp. 265-273.
  • 46
    • 0033950840 scopus 로고    scopus 로고
    • Suprasegmental and segmental timing models in Mandarin Chinese and American English
    • Shih C., and van Santen J. Suprasegmental and segmental timing models in Mandarin Chinese and American English. J. Acoust. Soc. Amer. 107 2 (2000) 1012-1026
    • (2000) J. Acoust. Soc. Amer. , vol.107 , Issue.2 , pp. 1012-1026
    • Shih, C.1    van Santen, J.2
  • 47
    • 0002972214 scopus 로고
    • Effects of focus distribution, pitch accent and lexical stress on the temporal organization of syllables in dutch
    • Sluijter A., and van Heuven V. Effects of focus distribution, pitch accent and lexical stress on the temporal organization of syllables in dutch. Phonetica 52 (1995) 71-89
    • (1995) Phonetica , vol.52 , pp. 71-89
    • Sluijter, A.1    van Heuven, V.2
  • 48
    • 44949128323 scopus 로고    scopus 로고
    • Strom, V., Clark, R., King, S., 2006. Expressive prosody for unit-selection speech synthesis. In: Proc. Interspeech 2006, Pittsburgh, USA.
    • Strom, V., Clark, R., King, S., 2006. Expressive prosody for unit-selection speech synthesis. In: Proc. Interspeech 2006, Pittsburgh, USA.
  • 49
    • 84966348891 scopus 로고    scopus 로고
    • Tokuda, K., Zen, H., Black, A., 2002. An HMM-based speech synthesis system applied to english. In: Proc. 2002 IEEE Speech Synthesis Workshop, Santa Monica, USA.
    • Tokuda, K., Zen, H., Black, A., 2002. An HMM-based speech synthesis system applied to english. In: Proc. 2002 IEEE Speech Synthesis Workshop, Santa Monica, USA.
  • 50
    • 0000885146 scopus 로고    scopus 로고
    • Word-boundary-related duration patterns in English
    • Turk A., and Shattuck-Hufnagel S. Word-boundary-related duration patterns in English. J. Phonet. 28 4 (2000) 397-440
    • (2000) J. Phonet. , vol.28 , Issue.4 , pp. 397-440
    • Turk, A.1    Shattuck-Hufnagel, S.2
  • 51
    • 0000949982 scopus 로고    scopus 로고
    • Structural influences on accentual lengthening in English
    • Turk A., and White L. Structural influences on accentual lengthening in English. J. Phonet. 27 2 (1999) 171-206
    • (1999) J. Phonet. , vol.27 , Issue.2 , pp. 171-206
    • Turk, A.1    White, L.2
  • 52
    • 40249096897 scopus 로고
    • Another consistency in phoneme duration
    • Umeda N. Another consistency in phoneme duration. J. Acoust. Soc. Amer. 58 S1 (1975) 62
    • (1975) J. Acoust. Soc. Amer. , vol.58 , Issue.SUPPL.1 , pp. 62
    • Umeda, N.1
  • 53
    • 0016542927 scopus 로고
    • Vowel duration in American English
    • Umeda N. Vowel duration in American English. J. Acoust. Soc. Amer. 58 2 (1975) 435-445
    • (1975) J. Acoust. Soc. Amer. , vol.58 , Issue.2 , pp. 435-445
    • Umeda, N.1
  • 54
    • 0017354413 scopus 로고
    • Consonant duration in American English
    • Umeda N. Consonant duration in American English. J. Acoust. Soc. Amer. 61 3 (1977) 847-858
    • (1977) J. Acoust. Soc. Amer. , vol.61 , Issue.3 , pp. 847-858
    • Umeda, N.1
  • 55
    • 0026963758 scopus 로고
    • Contextual effects on vowel durations
    • van Santen J.P.H. Contextual effects on vowel durations. Speech Comm. 11 (1992) 513-546
    • (1992) Speech Comm. , vol.11 , pp. 513-546
    • van Santen, J.P.H.1
  • 56
    • 0028405296 scopus 로고
    • Assignment of segmental duration in text-to-speech synthesis
    • van Santen J.H. Assignment of segmental duration in text-to-speech synthesis. Comput. Speech Lang. 8 (1994) 95-128
    • (1994) Comput. Speech Lang. , vol.8 , pp. 95-128
    • van Santen, J.H.1
  • 57
    • 40249115750 scopus 로고    scopus 로고
    • van Son, R., van Santen, J., 1997. Strong interaction between factors influencing consonant duration. In: Proc. of the Interspeech'97, Rhodes, Greece, pp. 319-322.
    • van Son, R., van Santen, J., 1997. Strong interaction between factors influencing consonant duration. In: Proc. of the Interspeech'97, Rhodes, Greece, pp. 319-322.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.