메뉴 건너뛰기




Volumn 19, Issue 2, 2005, Pages 129-146

On-line experimental methods to evaluate text-to-speech (TTS) synthesis: Effects of voice gender and signal quality on intelligibility, naturalness and preference

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC WAVE INTERFERENCE; AUTOMATION; COMPUTATIONAL LINGUISTICS; SIGNAL PROCESSING; SPEECH; SPEECH COMMUNICATION;

EID: 10844288683     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.csl.2004.03.003     Document Type: Article
Times cited : (37)

References (32)
  • 1
    • 0003443729 scopus 로고
    • The Celex lexical database (Release 2)
    • [CD-ROM]. University of Pennsylvania [Distributor], Philadelphia, PA
    • Baayen, R.H., Piepenbrock, R., Gulikers, L., 1995. The Celex lexical database (Release 2) [CD-ROM]. Linguistic Data Consortium, University of Pennsylvania [Distributor], Philadelphia, PA.
    • (1995) Linguistic Data Consortium
    • Baayen, R.H.1    Piepenbrock, R.2    Gulikers, L.3
  • 2
    • 0037232539 scopus 로고    scopus 로고
    • Close shadowing natural versus synthetic speech
    • Bailly, G., 2002. Close shadowing natural versus synthetic speech. Int. J. Speech Technol. 6, 11-19.
    • (2002) Int. J. Speech Technol. , vol.6 , pp. 11-19
    • Bailly, G.1
  • 4
    • 0030166343 scopus 로고    scopus 로고
    • The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using Semantically Unpredictable Sentences
    • Benoit, C., Grice, M., Hazan, V., 1996. The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using Semantically Unpredictable Sentences. Speech Commun. 18, 381-392.
    • (1996) Speech Commun. , vol.18 , pp. 381-392
    • Benoit, C.1    Grice, M.2    Hazan, V.3
  • 5
    • 0025158381 scopus 로고
    • Auditory search using vowel sounds
    • Charleston, D.E., Boyer, R.W., 1990. Auditory search using vowel sounds. Perc. Motor Skills 70, 1289-1290.
    • (1990) Perc. Motor Skills , vol.70 , pp. 1289-1290
    • Charleston, D.E.1    Boyer, R.W.2
  • 6
    • 0037685576 scopus 로고    scopus 로고
    • Concepts of ecological validity: Their differing implications for comparative cognitive research
    • Cole, M., Engestroem, Y. (Eds.). Cambridge University Press, New York
    • Cole, M., Hood, L., McDermott, R.P., 1997. Concepts of ecological validity: their differing implications for comparative cognitive research. In: Cole, M., Engestroem, Y. (Eds.), Mind, Culture and Activity: Seminal Papers from the Laboratory of Comparative Human Cognition. Cambridge University Press, New York, pp. 49-56.
    • (1997) Mind, Culture and Activity: Seminal Papers from the Laboratory of Comparative Human Cognition , pp. 49-56
    • Cole, M.1    Hood, L.2    McDermott, R.P.3
  • 8
    • 0002014444 scopus 로고
    • Phoneme-monitoring reaction time as a function of preceding intonation contour
    • Cutler, A., 1976. Phoneme-monitoring reaction time as a function of preceding intonation contour. Perc. Psychophys. 20, 55-60.
    • (1976) Perc. Psychophys , vol.20 , pp. 55-60
    • Cutler, A.1
  • 9
    • 0014090556 scopus 로고
    • Auditory search for syllables embedded within meaningful sentences
    • Davis, J., 1967. Auditory search for syllables embedded within meaningful sentences. JASA 41, 1277-1282.
    • (1967) JASA , vol.41 , pp. 1277-1282
    • Davis, J.1
  • 10
    • 0032070590 scopus 로고    scopus 로고
    • Cognitive factors in the evaluation of synthetic speech
    • Delogu, C., Conte, S., Sementina, C., 1998. Cognitive factors in the evaluation of synthetic speech. Speech Commun. 24, 153-168.
    • (1998) Speech Commun. , vol.24 , pp. 153-168
    • Delogu, C.1    Conte, S.2    Sementina, C.3
  • 11
    • 0037377313 scopus 로고    scopus 로고
    • To mix or not to mix synthetic speech and human speech? Contrasting impact on judge-rated task performance versus self-rated performance and attitudinal responses
    • Gong, L., Lai, J., 2003. To mix or not to mix synthetic speech and human speech? Contrasting impact on judge-rated task performance versus self-rated performance and attitudinal responses. Int. J. Speech Technol. 6, 123-131.
    • (2003) Int. J. Speech Technol. , vol.6 , pp. 123-131
    • Gong, L.1    Lai, J.2
  • 12
    • 0002232642 scopus 로고
    • Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems
    • Greene, B.G., Logan, J.S., Pisoni, D.B., 1986. Perception of synthetic speech produced automatically by rule: intelligibility of eight text-to-speech systems. Behav. Res. Methods, Instruments, Computers 18, 100-107.
    • (1986) Behav. Res. Methods, Instruments, Computers , vol.18 , pp. 100-107
    • Greene, B.G.1    Logan, J.S.2    Pisoni, D.B.3
  • 13
    • 84873287838 scopus 로고    scopus 로고
    • Perception of band-specific speech quality distortions: Detection and pairwise comparison
    • Hansen, M., Kollmeier, B., 1998. Perception of band-specific speech quality distortions: detection and pairwise comparison. Acustica 86, 24.
    • (1998) Acustica , vol.86 , pp. 24
    • Hansen, M.1    Kollmeier, B.2
  • 14
    • 0031821878 scopus 로고    scopus 로고
    • DECTalk and MacinTalk speech synthesisers: Intelligibility differences for three listener groups
    • Hustad, K.C., Kent, R.D., Beukelman, D., 1998. DECTalk and MacinTalk speech synthesisers: intelligibility differences for three listener groups. J. Speech, Lang. Hearing Res. 41, 744-752.
    • (1998) J. Speech, Lang. Hearing Res. , vol.41 , pp. 744-752
    • Hustad, K.C.1    Kent, R.D.2    Beukelman, D.3
  • 15
  • 16
    • 0017325947 scopus 로고
    • Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability
    • Kalikow, D.N., Stevens, K.N., Elliott, L.L., 1977. Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability. JASA 61, 1337-1351.
    • (1977) JASA , vol.61 , pp. 1337-1351
    • Kalikow, D.N.1    Stevens, K.N.2    Elliott, L.L.3
  • 17
    • 0027186268 scopus 로고
    • Segmental intelligibility and speech interference thresholds of high quality synthetic speech in presence of noise
    • Koul, R.K., Allen, G.D., 1993. Segmental intelligibility and speech interference thresholds of high quality synthetic speech in presence of noise. J. Speech Hearing Res. 36, 790-798.
    • (1993) J. Speech Hearing Res. , vol.36 , pp. 790-798
    • Koul, R.K.1    Allen, G.D.2
  • 18
    • 0015438571 scopus 로고
    • Search time as a function of context letter frequency
    • Latimer, C.R., 1972. Search time as a function of context letter frequency. Perception 1, 57-71.
    • (1972) Perception , vol.1 , pp. 57-71
    • Latimer, C.R.1
  • 19
    • 0037236894 scopus 로고    scopus 로고
    • Rare events and closed domains: Two delicate concepts in speech synthesis
    • Möbius, B., 2003. Rare events and closed domains: two delicate concepts in speech synthesis. Int. J. Speech Technol. 6, 57-71.
    • (2003) Int. J. Speech Technol. , vol.6 , pp. 57-71
    • Möbius, B.1
  • 20
    • 0037236540 scopus 로고    scopus 로고
    • A metrical model of prosody for multilingual TTS
    • Monaghan, A.I.C., 2003. A metrical model of prosody for multilingual TTS. Int. J. Speech Technol. 6, 73-81.
    • (2003) Int. J. Speech Technol. , vol.6 , pp. 73-81
    • Monaghan, A.I.C.1
  • 21
    • 85006544659 scopus 로고    scopus 로고
    • Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction
    • Nass, C., Lee, K., 2001. Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction. J. Exp. Psychol.: App. 7, 171-181.
    • (2001) J. Exp. Psychol.: App. , vol.7 , pp. 171-181
    • Nass, C.1    Lee, K.2
  • 22
    • 0037376648 scopus 로고    scopus 로고
    • Speech-based disclosure systems: Effects of modality, gender of prompt, and gender of user
    • Nass, C., Robles, E., Heenan, C., Bienstock, H., Treinen, M., 2003. Speech-based disclosure systems: effects of modality, gender of prompt, and gender of user. Int. J. Speech Technol. 6, 113-121.
    • (2003) Int. J. Speech Technol. , vol.6 , pp. 113-121
    • Nass, C.1    Robles, E.2    Heenan, C.3    Bienstock, H.4    Treinen, M.5
  • 25
    • 0000896983 scopus 로고
    • Perception and comprehension of speech
    • Syrdal, A.K., Bennet, R., Greenspan, S. (Eds.). CRC Press, Boca Raton, FL
    • Ralston, J.V., Pisoni, D.B., Mullennix, J.W., 1995. Perception and comprehension of speech. In: Syrdal, A.K., Bennet, R., Greenspan, S. (Eds.), Applied Speech Technology. CRC Press, Boca Raton, FL, pp. 233-288.
    • (1995) Applied Speech Technology , pp. 233-288
    • Ralston, J.V.1    Pisoni, D.B.2    Mullennix, J.W.3
  • 26
    • 0141435237 scopus 로고    scopus 로고
    • The future of cognitive psychology?
    • Solso, R.L. (Ed.). MIT Press, Cambridge, MA
    • Roediger, H.L., 1997. The future of cognitive psychology? In: Solso, R.L. (Ed.), Mind and Brain Sciences in the 21st Century. MIT Press, Cambridge, MA, pp. 175-198.
    • (1997) Mind and Brain Sciences in the 21st Century , pp. 175-198
    • Roediger, H.L.1
  • 27
  • 29
    • 0037381121 scopus 로고    scopus 로고
    • Segmental intelligibility of four currently used text-to-speech synthesis methods
    • Venkatagiri, H.S., 2003. Segmental intelligibility of four currently used text-to-speech synthesis methods. J. Acoust. Soc. Am. 113, 2095-2104.
    • (2003) J. Acoust. Soc. Am. , vol.113 , pp. 2095-2104
    • Venkatagiri, H.S.1
  • 31
    • 0020588285 scopus 로고
    • Evaluating processed speech using the Diagnostic Rhyme Test
    • Voiers, W.D., 1983. Evaluating processed speech using the Diagnostic Rhyme Test. Speech Technol. (Jan/Feb), 30-39.
    • (1983) Speech Technol. , Issue.JAN-FEB , pp. 30-39
    • Voiers, W.D.1
  • 32
    • 0038489847 scopus 로고
    • Orders for the presentation of pairs in the method of paired comparison
    • Wherry, R.J., 1938. Orders for the presentation of pairs in the method of paired comparison. J. Exp. Psychol. 23, 651-660.
    • (1938) J. Exp. Psychol. , vol.23 , pp. 651-660
    • Wherry, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.