메뉴 건너뛰기




Volumn , Issue , 2008, Pages 429-436

Rule-Based Speech Synthesis

Author keywords

Formant Synthesis; Glottal Opening; Speech Synthesis; Vocal Tract; Voice Source

Indexed keywords


EID: 78649342858     PISSN: 25228692     EISSN: 25228706     Source Type: Book Series    
DOI: 10.1007/978-3-540-49127-9_20     Document Type: Chapter
Times cited : (3)

References (79)
  • 7
    • 0017012286 scopus 로고
    • Structure of a phonological rule component for a synthesis-by-rule program
    • D. K. Klatt: Structure of a phonological rule component for a synthesis-by-rule program, IEEE Trans. ASSP-24 (1976)
    • (1976) IEEE Trans. ASSP-24
    • Klatt, D.K.1
  • 9
    • 85067593976 scopus 로고
    • A text-to-speech system based entirely on rules
    • Philadelphia
    • R. Carlson, B. Granström: A text-to-speech system based entirely on rules, Proc. ICASSP 76, Philadelphia (1976)
    • (1976) Proc. ICASSP 76
    • Carlson, R.1    Granström, B.2
  • 10
    • 0025229803 scopus 로고
    • Speech synthesis from text
    • Y. Sagisaka: Speech synthesis from text, IEEE Commun. Mag. 28(1), 35–41 (1990)
    • (1990) IEEE Commun. Mag. , vol.28 , Issue.1 , pp. 35-41
    • Sagisaka, Y.1
  • 12
    • 0041070092 scopus 로고    scopus 로고
    • Speech synthesis
    • Hardcastle WJ and Laver J., Blackwell, Oxford,) pp
    • R. Carlson, B. Granström: Speech synthesis. In: Hardcastle WJ and Laver J. The Handbook of Phonetic Science (Blackwell, Oxford 1997) pp. 768–788
    • (1997) The Handbook of Phonetic Science , pp. 768-788
    • Carlson, R.1    Granström, B.2
  • 13
    • 0023407575 scopus 로고
    • Review of text-to-speech conversion for English
    • D.K. Klatt: Review of text-to-speech conversion for English, J. Acoust. Soc. Am. 82(3), 737–793 (1987)
    • (1987) J. Acoust. Soc. Am. , vol.82 , Issue.3 , pp. 737-793
    • Klatt, D.K.1
  • 15
    • 84886459827 scopus 로고
    • Speech Communication Research
    • G. Fant: Speech Communication Research, Ing. Vetenskaps Akad. Stockholm 24, 331–337 (1953)
    • (1953) Ing. Vetenskaps Akad. Stockholm , vol.24 , pp. 331-337
    • Fant, G.1
  • 16
    • 0018986665 scopus 로고
    • Software for a cascade/parallel formant synthesizer
    • D.K. Klatt: Software for a cascade/parallel formant synthesizer, J. Acoust. Soc. Am. 67, 971 (1980)
    • (1980) J. Acoust. Soc. Am. , vol.67 , pp. 971
    • Klatt, D.K.1
  • 17
    • 0020905802 scopus 로고
    • Formant synthesizers, cascade or parallel
    • J. Holmes: Formant synthesizers, cascade or parallel, Speech Commun. 2, 251–273 (1983)
    • (1983) Speech Commun , vol.2 , pp. 251-273
    • Holmes, J.1
  • 18
    • 84912906590 scopus 로고
    • Constraints among parameters simplify control of Klatt formant synthesizer
    • K. Stevens, C. Bickley: Constraints among parameters simplify control of Klatt formant synthesizer, J. Phonetics 19(1) (1991)
    • (1991) J. Phonetics , vol.19 , Issue.1
    • Stevens, K.1    Bickley, C.2
  • 19
    • 0026372714 scopus 로고
    • Experiments with voice modelling in speech synthesis
    • R. Carlson, B. Granström, I. Karlsson: Experiments with voice modelling in speech synthesis, Speech Commun. 10, 481–489 (1991)
    • (1991) Speech Commun , vol.10 , pp. 481-489
    • Carlson, R.1    Granström, B.2    Karlsson, I.3
  • 20
    • 0141588508 scopus 로고
    • The Klattalk text-to-speech conversion system
    • D. Klatt: The Klattalk text-to-speech conversion system, Proc. ICASSP 82, 1589–1592 (1982)
    • (1982) Proc. ICASSP 82 , pp. 1589-1592
    • Klatt, D.1
  • 22
    • 30244566635 scopus 로고
    • The OVE III speech synthesizer
    • J. Liljencrants: The OVE III speech synthesizer, IEEE Trans.Audio Electroac. 16(1), 137–140 (1968)
    • (1968) IEEE Trans.Audio Electroac. , vol.16 , Issue.1 , pp. 137-140
    • Liljencrants, J.1
  • 23
    • 85009069019 scopus 로고
    • A multi-language text-to-speech module, Proc
    • R. Carlson, B. Granström, S. Hunnicutt: A multi-language text-to-speech module, Proc. ICASSP 82 82(3), 1604–1607 (1982)
    • (1982) ICASSP 82 , vol.82 , Issue.3 , pp. 1604-1607
    • Carlson, R.1    Granström, B.2    Hunnicutt, S.3
  • 25
    • 0036711819 scopus 로고    scopus 로고
    • A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn
    • H.M. Hanson, K.N. Stevens: A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn, J. Acoust. Soc. Am. 112, 1158–1182 (2002)
    • (2002) J. Acoust. Soc. Am. , vol.112 , pp. 1158-1182
    • Hanson, H.M.1    Stevens, K.N.2
  • 28
    • 0342336018 scopus 로고    scopus 로고
    • Synthesizing systematic variation at boundaries between vowels and obstruents
    • Vol., ed. by J.J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, A.C. Bailey, University of California, Berkeley,) pp
    • S. Heid, S. Hawkins: Synthesizing systematic variation at boundaries between vowels and obstruents. In: Proceedings of the XIVth International Congress of Phonetic Sciences, Vol. 1, ed. by J.J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, A.C. Bailey (University of California, Berkeley 1999) pp. 511–514
    • (1999) Proceedings of the Xivth International Congress of Phonetic Sciences , vol.1 , pp. 511-514
    • Heid, S.1    Hawkins, S.2
  • 30
    • 84927454548 scopus 로고
    • Acoustic analysis of voice source dynamics
    • T. V. Ananthapadmanabha: Acoustic analysis of voice source dynamics, STL-QPSR 2(3) 1-24 (1984)
    • (1984) STL-QPSR , vol.2 , Issue.3 , pp. 1-24
    • Ananthapadmanabha, T.V.1
  • 32
    • 0015699693 scopus 로고
    • Influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer
    • J. Holmes: Influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer, IEEE Trans. Audio Electroac. AU-21, 298–305 (1973)
    • (1973) IEEE Trans. Audio Electroac. AU , vol.21 , pp. 298-305
    • Holmes, J.1
  • 33
    • 0025321354 scopus 로고
    • Analysis, synthesis, and perception of voice quality variations among female and male talkers
    • D.K. Klatt, L. Klatt: Analysis, synthesis, and perception of voice quality variations among female and male talkers, J. Acoust. Soc. Am. 87, 820–857 (1990)
    • (1990) J. Acoust. Soc. Am. , vol.87 , pp. 820-857
    • Klatt, D.K.1    Klatt, L.2
  • 34
    • 84885529652 scopus 로고
    • Effect of glottal pulse shape on the quality of natural vowels
    • A.E. Rosenberg: Effect of glottal pulse shape on the quality of natural vowels, J. Acoust. Soc. Am. 53, 1632–1645 (1971)
    • (1971) J. Acoust. Soc. Am. , vol.53 , pp. 1632-1645
    • Rosenberg, A.E.1
  • 36
    • 5844348624 scopus 로고
    • A Phonetically Oriented Programming Language for Rule Description of Speech
    • ed. by G. FantAlmqvist Wiksell, Uppsala
    • R. Carlson, B. Granström: A Phonetically Oriented Programming Language for Rule Description of Speech. In: Speech Communication, Vol. 2, ed. by G. Fant (Almqvist Wiksell, Uppsala 1975) pp. 245–253
    • (1975) Speech Communication , vol.2 , pp. 245-253
    • Carlson, R.1    Granström, B.2
  • 38
    • 0026941709 scopus 로고
    • Acoustic characteristics of voice quality
    • C. Gobl, A. Ní Chasaide: Acoustic characteristics of voice quality, Speech Commun. 11, 481–490 (1992)
    • (1992) Speech Commun , vol.11 , pp. 481-490
    • Gobl, C.1    Ní Chasaide, A.2
  • 39
    • 0037380186 scopus 로고    scopus 로고
    • The role of voice quality in communicating emotion, mood and attitude
    • C. Gobl, A. Ní Chasaide: The role of voice quality in communicating emotion, mood and attitude, Speech Commun. 40, 189–212 (2003)
    • (2003) Speech Commun , vol.40 , pp. 189-212
    • Gobl, C.1    Ní Chasaide, A.2
  • 40
    • 0026940696 scopus 로고
    • Modelling speaking styles in female speech synthesis
    • I. Karlsson: Modelling speaking styles in female speech synthesis, Speech Commun. 11, 491–497 (1992)
    • (1992) Speech Commun , vol.11 , pp. 491-497
    • Karlsson, I.1
  • 42
    • 84928450970 scopus 로고
    • Effects of the vocal tract constriction on the glottal source: Experimental and modelling studies
    • C. Bickley, K. Stevens: Effects of the vocal tract constriction on the glottal source: Experimental and modelling studies, J. Phon. 14, 373–382 (1986)
    • (1986) J. Phon , vol.14 , pp. 373-382
    • Bickley, C.1    Stevens, K.2
  • 43
    • 0015142423 scopus 로고
    • Airflow and turbulence noise for fricative and stop consonants: Static considerations
    • K.N. Stevens: Airflow and turbulence noise for fricative and stop consonants: Static considerations, J. Acoust. Soc. Am. 50(4), 1180–1192 (1971)
    • (1971) J. Acoust. Soc. Am. , vol.50 , Issue.4 , pp. 1180-1192
    • Stevens, K.N.1
  • 46
    • 0027574102 scopus 로고
    • Speech Maker: A flexible and general framework for text-to-speech synthesis, and its application to Dutch
    • H.C. van Leeuwen, E. te Lindert: Speech Maker: A flexible and general framework for text-to-speech synthesis, and its application to Dutch, Comput. Speech Lang. 7(2), 149–168 (1993)
    • (1993) Comput. Speech Lang. , vol.7 , Issue.2 , pp. 149-168
    • van Leeuwen, H.C.1    Te Lindert, E.2
  • 48
    • 84885525179 scopus 로고
    • Speech synthesizer control by smoothed step functions
    • J. Liljencrants: Speech synthesizer control by smoothed step functions, STL-QPSR 1969(4), 43–50 (1969)
    • (1969) STL-QPSR , vol.1969 , Issue.4 , pp. 43-50
    • Liljencrants, J.1
  • 49
    • 84955042643 scopus 로고
    • Synthesis of stop consonants in initial position
    • D.H. Klatt: Synthesis of stop consonants in initial position, J. Acoust. Soc. Am. Suppl. 147, S93 (1970)
    • (1970) J. Acoust. Soc. Am. Suppl. , vol.147 , pp. S93
    • Klatt, D.H.1
  • 50
    • 0016943755 scopus 로고
    • Linguistic rules for text-to-speech synthesis
    • N. Umeda: Linguistic rules for text-to-speech synthesis, Proc. IEEE 64(4), 443–451 (1976)
    • (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 443-451
    • Umeda, N.1
  • 51
    • 0001931109 scopus 로고
    • Synthesis by rule of segmental durations in English sentences
    • ed. by B. Lindblom, S. öhman (Academic, New York
    • D.K. Klatt: Synthesis by rule of segmental durations in English sentences. In: Frontiers in Speech Communication Research, ed. by B. Lindblom, S. öhman (Academic, New York 1979)
    • (1979) Frontiers in Speech Communication Research
    • Klatt, D.K.1
  • 52
    • 84930563909 scopus 로고
    • Formant trajectories as audible gestures: An alternative for speech synthesis
    • G. Bailly, R. Laboissière, J. L. Schwartz: Formant trajectories as audible gestures: an alternative for speech synthesis, J. Phon. 19(1), 9-23 (1991)
    • (1991) J. Phon. , vol.19 , Issue.1 , pp. 9-23
    • Bailly, G.1    Laboissière, R.2    Schwartz, J.L.3
  • 53
    • 0000665734 scopus 로고
    • Explaining phonetic variation: A sketch of the H and H theory
    • ed. by Hardcastle, MarchalKluwer Academic, Dordrecht
    • B. Lindblom: Explaining phonetic variation: A sketch of the H and H theory. In: Speech Production Modeling, ed. by Hardcastle, Marchal (Kluwer Academic, Dordrecht 1990)
    • (1990) Speech Production Modeling
    • Lindblom, B.1
  • 55
    • 85075930205 scopus 로고
    • Effects of stress and vowel context on velar stops in British English, ICSLP 92
    • A. Slater, S. Hawkins: Effects of stress and vowel context on velar stops in British English, ICSLP 92 (Proc. 1992 Int. Conf. Spoken Language Processing) 1, 57–60 (1992)
    • (1992) Proc. 1992 Int. Conf. Spoken Language Processing , vol.1 , pp. 57-60
    • Slater, A.1    Hawkins, S.2
  • 58
    • 84930566519 scopus 로고
    • Streams, phones, and transitions: Toward a new phonological and phonetic model of formant timing
    • S. R. Hertz: Streams, phones, and transitions: toward a new phonological and phonetic model of formant timing, J. Phon. 19(1) (1991)
    • (1991) J. Phon. , vol.19 , Issue.1
    • Hertz, S.R.1
  • 59
    • 0022150238 scopus 로고
    • The Delta rule development system for speech synthesis from text
    • S.R. Hertz, J. Kadin, K.J. Karplus: The Delta rule development system for speech synthesis from text, Proc. IEEE 73(11), 1589–1601 (1985)
    • (1985) Proc. IEEE , vol.73 , Issue.11 , pp. 1589-1601
    • Hertz, S.R.1    Kadin, J.2    Karplus, K.J.3
  • 61
    • 85075935290 scopus 로고
    • Yet another rule compiler for text-to-speech conversion?
    • pp
    • K. Ceder, B. Lyberg: Yet another rule compiler for text-to-speech conversion? Proc. ICSLP92, Banff, Canada, pp. 1151-1154 (1992)
    • (1992) Proc. ICSLP92, Banff, Canada , pp. 1151-1154
    • Ceder, K.1    Lyberg, B.2
  • 62
    • 0026382594 scopus 로고
    • Speechmaker, text-to-speech synthesis based on a multilevel, synchronized data structure
    • H. C. van Leeuwen, E. te Lindert: Speechmaker, text-to-speech synthesis based on a multilevel, synchronized data structure, Proc. ICASSP-91 (1991)
    • (1991) Proc. ICASSP-91
    • van Leeuwen, H.C.1    Te Lindert, E.2
  • 63
    • 24144469759 scopus 로고    scopus 로고
    • Data-driven multimodal synthesis
    • R. Carlson, B. Granström: Data-driven multimodal synthesis, Issues Speech Commun. 47(1-2), 182–193 (2005)
    • (2005) Issues Speech Commun , vol.47 , Issue.1-2 , pp. 182-193
    • Carlson, R.1    Granström, B.2
  • 65
    • 85075933509 scopus 로고
    • Segmentation techniques in speech synthesis
    • G. Peterson, W. Wang, E. Sivertsen: Segmentation techniques in speech synthesis, J. Acoust. Soc. Am. 32, 639–703 (1958)
    • (1958) J. Acoust. Soc. Am , vol.32 , pp. 639-703
    • Peterson, G.1    Wang, W.2    Sivertsen, E.3
  • 66
    • 0345093720 scopus 로고
    • Terminal Analog Synthesis of Continuous Speech Using the Diphone Method of Segment Assembly
    • N.R. Dixon, H.D. Maxey: Terminal Analog Synthesis of Continuous Speech Using the Diphone Method of Segment Assembly, IEEE Trans. Audio Electroac. AU-16, 40–50 (1968)
    • (1968) IEEE Trans. Audio Electroac. AU , vol.16 , pp. 40-50
    • Dixon, N.R.1    Maxey, H.D.2
  • 67
    • 85068112784 scopus 로고
    • Rule synthesis of speech from dyadic units
    • J.P. Olive: Rule synthesis of speech from dyadic units, Proc. ICASSP 77, 568–570 (1977)
    • (1977) Proc. ICASSP , vol.77 , pp. 568-570
    • Olive, J.P.1
  • 68
    • 4544357742 scopus 로고    scopus 로고
    • Formant diphone parameter extraction utilising a labeled single speaker database
    • R. H. Mannell: Formant diphone parameter extraction utilising a labeled single speaker database. In: Proc. ICSLP-98 (1998)
    • (1998) Proc. ICSLP-98
    • Mannell, R.H.1
  • 69
    • 85009156064 scopus 로고    scopus 로고
    • A data-driven approach to source-formant type text-to-speech system
    • H. Mori, T. Ohtsuka, H. Kasuya: A data-driven approach to source-formant type text-to-speech system, ICSLP 2002, 2365–2368 (2002)
    • (2002) ICSLP , vol.2002 , pp. 2365-2368
    • Mori, H.1    Ohtsuka, T.2    Kasuya, H.3
  • 70
    • 84966440972 scopus 로고    scopus 로고
    • Integration of Rule-Based Formant Synthesis and Waveform Concatenation: A Hybrid Approach to Text-to-Speech Synthesis
    • Santa Monica
    • S. Hertz: Integration of Rule-Based Formant Synthesis and Waveform Concatenation: A Hybrid Approach to Text-to-Speech Synthesis, In: Proc. IEEE 2002 Workshop on Speech Synthesis, 11-13, Santa Monica (2002)
    • (2002) Proc. IEEE 2002 Workshop on Speech Synthesis , pp. 11-13
    • Hertz, S.1
  • 71
    • 33645585669 scopus 로고
    • Looking at Speech
    • D. Talkin: Looking at Speech. In: Speech Technology, 74-77 (1989)
    • (1989) Speech Technology , pp. 74-77
    • Talkin, D.1
  • 72
    • 85135264071 scopus 로고    scopus 로고
    • Formant analysis and synthesis using hidden Markov models
    • A. Acero: Formant analysis and synthesis using hidden Markov models, Proc. Eurospeech 99, 1047– 1050 (1999)
    • (1999) Proc. Eurospeech 99 , pp. 1047-1050
    • Acero, A.1
  • 73
    • 27644433406 scopus 로고    scopus 로고
    • Formant tracking using context-dependent phonemic information
    • M. Lee, J. van Santen, B. Möbius, J. Olive: Formant tracking using context-dependent phonemic information, IEEE TSAP 13(5), 741–750 (2005)
    • (2005) IEEE TSAP , vol.13 , Issue.5 , pp. 741-750
    • Lee, M.1    van Santen, J.2    Möbius, B.3    Olive, J.4
  • 74
    • 84928220995 scopus 로고
    • The use of a synthesis-by-rule system in a study of deaf speech
    • A.-M. Öster: The use of a synthesis-by-rule system in a study of deaf speech, STL-QPSR 1/ 1985, 95–107 (1985)
    • (1985) STL-QPSR , vol.1 , Issue.1985 , pp. 95-107
    • Öster, A.-M.1
  • 75
    • 85075956501 scopus 로고
    • Speech synthesis for hearing impaired persons-in research, training and communication
    • B. Granström, A.-M. Öster: Speech synthesis for hearing impaired persons-in research, training and communication, STL/QPSR 2-3/ 94, 93–111 (1994)
    • (1994) STL/QPSR , vol.2-3 , Issue.94 , pp. 93-111
    • Granström, B.1    Öster, A.-M.2
  • 77
    • 0012700837 scopus 로고
    • A communication system for the disabled with emotional synthetic speech produced by rule
    • I. Murray, J. Arnott, N. Alm, A. Newell: A communication system for the disabled with emotional synthetic speech produced by rule, Procs. Eurospeech 91(1), 311–314 (1991)
    • (1991) Procs. Eurospeech , vol.91 , Issue.1 , pp. 311-314
    • Murray, I.1    Arnott, J.2    Alm, N.3    Newell, A.4
  • 79
    • 0002515370 scopus 로고
    • The generation of affect in synthesized speech
    • J. Cahn: The generation of affect in synthesized speech, J. Am. Voice I/O Soc. 8 (1990)
    • (1990) J. Am. Voice I/O Soc , vol.8
    • Cahn, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.