메뉴 건너뛰기




Volumn 113, Issue 4 I, 2003, Pages 2095-2104

Segmental intelligibility of four currently used text-to-speech synthesis methods

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; ERROR ANALYSIS; SPEECH; SPEECH INTELLIGIBILITY;

EID: 0037381121     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.1558356     Document Type: Article
Times cited : (18)

References (25)
  • 3
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of the speech wave
    • Atal, B. S., and Hanauer, S. L. (1971). "Speech analysis and synthesis by linear prediction of the speech wave," J. Acoust. Soc. Am. 50, 637-655.
    • (1971) J. Acoust. Soc. Am. , vol.50 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 5
    • 0027839344 scopus 로고
    • MBR-PSOLA: Text-to-speech synthesis based on an MBE re-synthesis of the segments database
    • Dutoit, T., and Leich, H. (1993). "MBR-PSOLA: Text-to-speech synthesis based on an MBE re-synthesis of the segments database," Speech Commun. 13, 435-440.
    • (1993) Speech Commun. , vol.13 , pp. 435-440
    • Dutoit, T.1    Leich, H.2
  • 6
    • 0028455481 scopus 로고
    • Invariants, specifiers, cues: An investigation of locus equations as information for place of articulation
    • Fowler, C. A. (1994). "Invariants, specifiers, cues: An investigation of locus equations as information for place of articulation," Percept. Psychophys. 55, 597-610.
    • (1994) Percept. Psychophys. , vol.55 , pp. 597-610
    • Fowler, C.A.1
  • 7
    • 84961430477 scopus 로고
    • Synthetic speech intelligibility under several experimental conditions
    • Fucci, D., Reynolds, M. E., Bettagere, R., and Gonzales, M. D. (1995). "Synthetic speech intelligibility under several experimental conditions," Augment. Alt. Commun. 11, 113-117.
    • (1995) Augment. Alt. Commun. , vol.11 , pp. 113-117
    • Fucci, D.1    Reynolds, M.E.2    Bettagere, R.3    Gonzales, M.D.4
  • 8
    • 0002232642 scopus 로고
    • Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems
    • Green, B. G., Logan, J. S., and Pisoni, D. B. (1986). "Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems," Behav. Res. Methods Instrum. Comput. 18, 100-107.
    • (1986) Behav. Res. Methods Instrum. Comput. , vol.18 , pp. 100-107
    • Green, B.G.1    Logan, J.S.2    Pisoni, D.B.3
  • 9
    • 0001562208 scopus 로고
    • Articulation-testing methods: Consonantal differentiation with a closed-response set
    • House, A. S., Williams, C. E., Hecker, M. H. L., and Kryter, K. D. (1965). "Articulation-testing methods: Consonantal differentiation with a closed-response set," J. Acoust. Soc. Am. 37, 158-166.
    • (1965) J. Acoust. Soc. Am. , vol.37 , pp. 158-166
    • House, A.S.1    Williams, C.E.2    Hecker, M.H.L.3    Kryter, K.D.4
  • 10
    • 0242638092 scopus 로고    scopus 로고
    • Speech perception
    • H. Pashler & S. Yantis, edited by H. Pashler and S. Yantis (Wiley, New York)
    • Jusczyk, P. W., and Luce, P. A. (2002). "Speech perception," in H. Pashler & S. Yantis, Stevens' Handbook of Experimental Psychology, 3rd ed., edited by H. Pashler and S. Yantis (Wiley, New York), Vol. 1, pp. 493-536.
    • (2002) Stevens' Handbook of Experimental Psychology, 3rd Ed. , vol.1 , pp. 493-536
    • Jusczyk, P.W.1    Luce, P.A.2
  • 11
    • 0017325947 scopus 로고
    • Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability
    • Kalikow, D. N., Stevens, K. N., and Elliott, L. L. (1977). "Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability," J. Acoust. Soc. Am. 61, 1337-1351.
    • (1977) J. Acoust. Soc. Am. , vol.61 , pp. 1337-1351
    • Kalikow, D.N.1    Stevens, K.N.2    Elliott, L.L.3
  • 12
    • 0023407575 scopus 로고
    • Review of text-to-speech conversion of English
    • Klatt, D. H. (1987). "Review of text-to-speech conversion of English," J. Acoust. Soc. Am. 82, 737-793.
    • (1987) J. Acoust. Soc. Am. , vol.82 , pp. 737-793
    • Klatt, D.H.1
  • 13
    • 0027186268 scopus 로고
    • Segmental intelligibility and speech interference thresholds of high-quality synthetic speech in the presence of noise
    • Koul, R. K., and Allen, G. D. (1993). "Segmental intelligibility and speech interference thresholds of high-quality synthetic speech in the presence of noise," J. Speech Hear. Res. 36, 790-798.
    • (1993) J. Speech Hear. Res. , vol.36 , pp. 790-798
    • Koul, R.K.1    Allen, G.D.2
  • 14
    • 0023830160 scopus 로고
    • Babble and random-noise masking of speech in high and low context cue conditions
    • Lewis, H., Benignus, V. A., Muller, K. E., Malott, C. M., and Barton, C. N. (1988). "Babble and random-noise masking of speech in high and low context cue conditions," J. Speech Hear. Res. 31, 108-114.
    • (1988) J. Speech Hear. Res. , vol.31 , pp. 108-114
    • Lewis, H.1    Benignus, V.A.2    Muller, K.E.3    Malott, C.M.4    Barton, C.N.5
  • 15
    • 0024570198 scopus 로고
    • A specialization for speech perception
    • Liberman, A. M., and Mattingly, I. G. (1989). "A specialization for speech perception," Science 243, 489-494.
    • (1989) Science , vol.243 , pp. 489-494
    • Liberman, A.M.1    Mattingly, I.G.2
  • 16
    • 0024344665 scopus 로고
    • Segmental intelligibility of synthetic speech produced by rule
    • Logan, J. S., Greene, B. G., and Pisoni, D. B. (1989). "Segmental intelligibility of synthetic speech produced by rule," J. Acoust. Soc. Am. 86, 566-581.
    • (1989) J. Acoust. Soc. Am. , vol.86 , pp. 566-581
    • Logan, J.S.1    Greene, B.G.2    Pisoni, D.B.3
  • 17
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • Moulines, E., and Charpentier, F. (1990). "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun. 9, 453-467.
    • (1990) Speech Commun. , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 18
    • 0001705913 scopus 로고
    • Trainability of listening comprehension of speech discourse
    • Orr, D. B., Friedman, H. L., and Williams, J. C. C. (1965). "Trainability of listening comprehension of speech discourse," J. Educ. Psychol. 56, 148-156.
    • (1965) J. Educ. Psychol. , vol.56 , pp. 148-156
    • Orr, D.B.1    Friedman, H.L.2    Williams, J.C.C.3
  • 20
    • 0242722456 scopus 로고
    • Some comparisons of intelligibility of synthetic and natural speech at different speech-to-noise ratios
    • Pisoni, D. B., and Koen, E. (1982). "Some comparisons of intelligibility of synthetic and natural speech at different speech-to-noise ratios," J. Acoust. Soc. Am. Suppl. 1 71, S94.
    • (1982) J. Acoust. Soc. Am. Suppl. 1 , vol.71
    • Pisoni, D.B.1    Koen, E.2
  • 21
    • 0022148789 scopus 로고
    • Perception of synthetic speech generated by rule
    • Pisoni, D. B., Nusbaum, H. C., and Greene, B. G. (1985). "Perception of synthetic speech generated by rule," Proc. IEEE 73, 1665-1676.
    • (1985) Proc. IEEE , vol.73 , pp. 1665-1676
    • Pisoni, D.B.1    Nusbaum, H.C.2    Greene, B.G.3
  • 22
    • 0019508260 scopus 로고
    • Speech perception without traditional cues
    • Remez, R. E., Rubin, P. E., Pisoni, D. B., and Carrell, T. D. (1981). "Speech perception without traditional cues," Science 212, 947-950.
    • (1981) Science , vol.212 , pp. 947-950
    • Remez, R.E.1    Rubin, P.E.2    Pisoni, D.B.3    Carrell, T.D.4
  • 24
    • 0033709105 scopus 로고    scopus 로고
    • On the Implementation of the Harmonic Plus Noise Model for Concatenative Speech Synthesis
    • Paper presented at, Istanbul, Turkey
    • Stylianou, Y. (2000). "On the Implementation of the Harmonic Plus Noise Model for Concatenative Speech Synthesis," Paper presented at ICASSP 2000, Istanbul, Turkey (available at http://www.research.att.com/projects/ tts/papers/2000_ICASSP/fastHNM.pdf).
    • (2000) ICASSP 2000
    • Stylianou, Y.1
  • 25
    • 0000207066 scopus 로고
    • Auditory temporal discrimination by trained listeners
    • Warren, R. M. (1974). "Auditory temporal discrimination by trained listeners," Cogn. Psychol. 6, 237-256.
    • (1974) Cogn. Psychol. , vol.6 , pp. 237-256
    • Warren, R.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.