메뉴 건너뛰기




Volumn 6, Issue 1 SPEC., 2003, Pages 57-71

Rare events and closed domains: Two delicate concepts in speech synthesis

Author keywords

Linguistic analysis; Low frequency events; Prosody; Speech synthesis; Unit selection

Indexed keywords

DATABASE SYSTEMS; FREQUENCIES; LINGUISTICS; MATHEMATICAL TECHNIQUES; PROBLEM SOLVING; VOCABULARY CONTROL;

EID: 0037236894     PISSN: 13812416     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1021052023237     Document Type: Article
Times cited : (33)

References (67)
  • 1
    • 0012586273 scopus 로고
    • On frequency, transparency and productivity
    • G. Booij and J. van Marle (Eds.), Dordrecht: Kluwer
    • Baayen, H. (1993). On frequency, transparency and productivity. In G. Booij and J. van Marle (Eds.), Yearbook of Morphology 1992. Dordrecht: Kluwer, pp. 181-208.
    • (1993) Yearbook of Morphology 1992 , pp. 181-208
    • Baayen, H.1
  • 3
    • 84936527095 scopus 로고
    • Productivity and English derivation: A corpus based study
    • Baayen, H. and Lieber, R. (1991). Productivity and English derivation: A corpus based study. Linguistics, 29:801-843.
    • (1991) Linguistics , vol.29 , pp. 801-843
    • Baayen, H.1    Lieber, R.2
  • 4
    • 0002703873 scopus 로고
    • Trainable grammars for speech recognition
    • In D. Klatt and J. Wolf (Eds.)
    • Baker, J.K. (1979). Trainable grammars for speech recognition. In D. Klatt and J. Wolf (Eds.), Speech Communication Papers for ASA'79, pp. 547-550.
    • (1979) Speech Communication Papers for ASA'79 , pp. 547-550
    • Baker, J.K.1
  • 5
    • 0001862769 scopus 로고
    • An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes
    • Baum, L.E. (1972). An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes. Inequalities, 3:1-8.
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.E.1
  • 6
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • Baum, L., Petrie, T., Soules, G., and Weiss, N. (1970). A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics, 41(1):164-171.
    • (1970) Annals of Mathematical Statistics , vol.41 , Issue.1 , pp. 164-171
    • Baum, L.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 11
    • 0012613283 scopus 로고    scopus 로고
    • Non-uniform unit selection and the similarity metric within BT's Laureate TTS system
    • Jenolan Caves, Australia
    • Breen, A.P. and Jackson, P. (1998). Non-uniform unit selection and the similarity metric within BT's Laureate TTS system. Proceedings of the Third International Workshop on Speech Synthesis. Jenolan Caves, Australia, pp. 373-376.
    • (1998) Proceedings of the Third International Workshop on Speech Synthesis , pp. 373-376
    • Breen, A.P.1    Jackson, P.2
  • 12
    • 0001717383 scopus 로고
    • Syllable-based segmental duration
    • In G. Bailly, C. Benoît, and T.R. Sawallis (Eds.), Amsterdam: Elsevier
    • Campbell, W.N. (1992). Syllable-based segmental duration. In G. Bailly, C. Benoît, and T.R. Sawallis (Eds.), Talking Machines: Theories, Models, and Designs. Amsterdam: Elsevier, pp. 211-224.
    • (1992) Talking Machines: Theories, Models, and Designs , pp. 211-224
    • Campbell, W.N.1
  • 15
    • 0032651722 scopus 로고    scopus 로고
    • A hidden Markov-model-based trainable speech synthesizer
    • Donovan, R.E. and Woodland, P.C. (1999). A hidden Markov-model-based trainable speech synthesizer. Computer Speech and Language, 13:223-241.
    • (1999) Computer Speech and Language , vol.13 , pp. 223-241
    • Donovan, R.E.1    Woodland, P.C.2
  • 16
    • 0012583475 scopus 로고    scopus 로고
    • Measuring morphological productivity: Is automatic preprocessing sufficient?
    • In P. Rayson, A. Wilson, T. McEnery, A. Hardie, and S. Khoja (Eds.), Lancaster, UK
    • Evert, S. and Lüdeling, A. (2001). Measuring morphological productivity: Is automatic preprocessing sufficient? In P. Rayson, A. Wilson, T. McEnery, A. Hardie, and S. Khoja (Eds.), Proceedings of the Corpus Linguistics 2001 Conference. Lancaster, UK, pp. 167-175.
    • (2001) Proceedings of the Corpus Linguistics 2001 Conference , pp. 167-175
    • Evert, S.1    Lüdeling, A.2
  • 17
    • 0000803388 scopus 로고
    • The population frequencies of species and the estimation of population parameters
    • Good, I.J. (1953). The population frequencies of species and the estimation of population parameters. Biometrika, 40(3/4):237-264.
    • (1953) Biometrika , vol.40 , Issue.3-4 , pp. 237-264
    • Good, I.J.1
  • 18
    • 85009122462 scopus 로고    scopus 로고
    • A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features
    • Sydney, Australia
    • Holzapfel, M. and Campbell, N. (1998). A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features. Proceedings of the International Conference on Spoken Language Processing. Sydney, Australia, vol. 6, pp. 2755-2758.
    • (1998) Proceedings of the International Conference on Spoken Language Processing , vol.6 , pp. 2755-2758
    • Holzapfel, M.1    Campbell, N.2
  • 23
    • 0029386592 scopus 로고
    • Speech segment network approach for an optimal synthesis unit set
    • Iwahashi, N. and Sagisaka, Y. (1995). Speech segment network approach for an optimal synthesis unit set. Computer Speech and Language, 9:335-352.
    • (1995) Computer Speech and Language , vol.9 , pp. 335-352
    • Iwahashi, N.1    Sagisaka, Y.2
  • 26
    • 0023312404 scopus 로고
    • Estimation of probabilities from sparse data for the language model component of a speech recognizen
    • Katz, S.M. (1987). Estimation of probabilities from sparse data for the language model component of a speech recognizen. IEEE Transactions on Acoustics, Speech and Signal Processing, 33(6):400-401.
    • (1987) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.33 , Issue.6 , pp. 400-401
    • Katz, S.M.1
  • 27
    • 0001829989 scopus 로고
    • Tech. Report MS-R8804. Department of Mathematical Statistics, CWI. Amsterdam: Center for Mathematics and Computer Science
    • Khmaladze, E. (1987). The statistical analysis of large number of rare events (Tech. Report MS-R8804). Department of Mathematical Statistics, CWI. Amsterdam: Center for Mathematics and Computer Science.
    • (1987) The Statistical Analysis of Large Number of Rare Events
    • Khmaladze, E.1
  • 29
    • 0015676852 scopus 로고
    • Interaction between two factors that influence vowel duration
    • Klatt, D.H. (1973). Interaction between two factors that influence vowel duration. Journal of the Acoustical Society of America, 54(4):1102-1104.
    • (1973) Journal of the Acoustical Society of America , vol.54 , Issue.4 , pp. 1102-1104
    • Klatt, D.H.1
  • 31
    • 0000178296 scopus 로고    scopus 로고
    • Producing spoken language: A blueprint of the speaker
    • In C.M. Brown and P. Hagoort (Eds.), Oxford, UK: Oxford University Press
    • Levelt, W.J.M. (1999). Producing spoken language: A blueprint of the speaker. In C.M. Brown and P. Hagoort (Eds.), The Neurocognition of Language. Oxford, UK: Oxford University Press, pp. 83-122.
    • (1999) The Neurocognition of Language , pp. 83-122
    • Levelt, W.J.M.1
  • 32
    • 0028417520 scopus 로고
    • Do speakers have access to a mental syllabary?
    • Levelt, W.J.M. and Wheeldon, L. (1994). Do speakers have access to a mental syllabary? Cognition, 50:239-269.
    • (1994) Cognition , vol.50 , pp. 239-269
    • Levelt, W.J.M.1    Wheeldon, L.2
  • 34
    • 84868306093 scopus 로고    scopus 로고
    • On measuring morphological productivity
    • Ilmenau, Germany
    • Lüdeling, A., Evert, S., and Heid, U. (2000). On measuring morphological productivity. Proceedings of KONVENS 2000. Ilmenau, Germany, pp. 57-61.
    • (2000) Proceedings of KONVENS 2000 , pp. 57-61
    • Lüdeling, A.1    Evert, S.2    Heid, U.3
  • 39
    • 0342626572 scopus 로고    scopus 로고
    • The Bell Labs German text-to-speech system
    • Möbius, B. (1999). The Bell Labs German text-to-speech system. Computer Speech and Language, 13:319-358.
    • (1999) Computer Speech and Language , vol.13 , pp. 319-358
    • Möbius, B.1
  • 40
    • 0012530287 scopus 로고    scopus 로고
    • Arbeitspapiere des Instituts für Maschinelle Sprachverarbeitung (Univ. Stuttgart), AIMS
    • Möbius, B. (2001). German and multilingual speech synthesis. Arbeitspapiere des Instituts für Maschinelle Sprachverarbeitung (Univ. Stuttgart), AIMS, 7(4):1-300.
    • (2001) German and Multilingual Speech Synthesis , vol.7 , Issue.4 , pp. 1-300
    • Möbius, B.1
  • 43
    • 0028499480 scopus 로고
    • Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering
    • Nakajima, S. (1994). Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering. Speech Communication, 14:313-324.
    • (1994) Speech Communication , vol.14 , pp. 313-324
    • Nakajima, S.1
  • 46
    • 0012581186 scopus 로고    scopus 로고
    • Arbeitspapiere des Instituts für Maschinelle Sprachverarbeitung (Univ. Stuttgart), AIMS
    • Prescher, D. (2002). EM-basierte maschinelle Lernverfahren für natürliche Sprachen. Arbeitspapiere des Instituts für Maschinelle Sprachverarbeitung (Univ. Stuttgart), AIMS, 8(2): 1-366.
    • (2002) EM-Basierte Maschinelle Lernverfahren für Natürliche Sprachen , vol.8 , Issue.2 , pp. 1-366
    • Prescher, D.1
  • 48
    • 0002069313 scopus 로고
    • Tree-based modeling for speech synthesis
    • In G. Bailly, C. Benoït, and T.R. Sawallis (Eds.), Amsterdam: Elsevier
    • Riley, M.D. (1992). Tree-based modeling for speech synthesis. In G. Bailly, C. Benoït, and T.R. Sawallis (Eds.), Talking Machines: Theories, Models, and Designs. Amsterdam: Elsevier, pp. 265-273.
    • (1992) Talking Machines: Theories, Models, and Designs , pp. 265-273
    • Riley, M.D.1
  • 52
    • 0012532378 scopus 로고
    • Produktiviteit als morfologisch fenomeen
    • Schultink, H. (1961). Produktiviteit als morfologisch fenomeen. Forum der Letteren, 2:110-125.
    • (1961) Forum der Letteren , vol.2 , pp. 110-125
    • Schultink, H.1
  • 53
    • 0003154521 scopus 로고    scopus 로고
    • Duration study for the Bell Laboratories Mandarin text-to-speech system
    • In J. van Santen, R.W. Sproat, J.P. Olive, and J. Hirschberg (Eds.), New York: Springer
    • Shih, C. and Ao, B. (1997). Duration study for the Bell Laboratories Mandarin text-to-speech system. In J. van Santen, R.W. Sproat, J.P. Olive, and J. Hirschberg (Eds.), Progress in Speech Synthesis. New York: Springer, pp. 383-399.
    • (1997) Progress in Speech Synthesis , pp. 383-399
    • Shih, C.1    Ao, B.2
  • 55
    • 0342725926 scopus 로고    scopus 로고
    • A Japanese text-to-speech system based on multi-form units with consideration of frequency distribution in Japanese
    • Budapest, Hungary
    • Tanaka, K., Mizuno, H., Abe, M., and Nakajima, S. (1999). A Japanese text-to-speech system based on multi-form units with consideration of frequency distribution in Japanese. Proceedings of the European Conference on Speech Communication and Technology. Budapest, Hungary, vol. 2, pp. 839-842.
    • (1999) Proceedings of the European Conference on Speech Communication and Technology , vol.2 , pp. 839-842
    • Tanaka, K.1    Mizuno, H.2    Abe, M.3    Nakajima, S.4
  • 56
    • 38249000711 scopus 로고
    • Exploring N-way tables with sums-of-products models
    • van Santen, J.P.H. (1993a). Exploring N-way tables with sums-of-products models. Journal of Mathematical Psychology, 37(3):327-371.
    • (1993) Journal of Mathematical Psychology , vol.37 , Issue.3 , pp. 327-371
    • Van Santen, J.P.H.1
  • 58
    • 0028405296 scopus 로고
    • Assignment of segmental duration in text-to-speech synthesis
    • van Santen, J.P.H. (1994). Assignment of segmental duration in text-to-speech synthesis. Computer Speech and Language, 8:95-128.
    • (1994) Computer Speech and Language , vol.8 , pp. 95-128
    • Van Santen, J.P.H.1
  • 59
    • 0012532379 scopus 로고
    • Computation of timing in text-to-speech synthesis
    • In W.B. Kleijn and K.K. Paliwal (Eds.), Amsterdam: Elsevier
    • van Santen, J.P.H. (1995). Computation of timing in text-to-speech synthesis. In W.B. Kleijn and K.K. Paliwal (Eds.), Speech Coding and Synthesis. Amsterdam: Elsevier, pp. 663-684.
    • (1995) Speech Coding and Synthesis , pp. 663-684
    • Van Santen, J.P.H.1
  • 61
    • 0001569732 scopus 로고    scopus 로고
    • A quantitative model of F0 generation and alignment
    • In A. Botinis (Ed.), Dordrecht: Kluwer
    • van Santen, J.P.H. and Möbius, B. (2000). A quantitative model of F0 generation and alignment. In A. Botinis (Ed.), Intonation-Analysis, Modelling and Technology. Dordrecht: Kluwer, pp. 269-288.
    • (2000) Intonation - Analysis, Modelling and Technology , pp. 269-288
    • Van Santen, J.P.H.1    Möbius, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.