SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 113, Issue 4 I, 2003, Pages 2095-2104

Segmental intelligibility of four currently used text-to-speech synthesis methods

a Department of Statistics Center for Statistics and Applications in Forensic Evidence Iowa State University 2438 Osborn Dr Ames (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; ERROR ANALYSIS; SPEECH; SPEECH INTELLIGIBILITY;

SEGMENTAL INTELLIGIBILITY;

SPEECH SYNTHESIS;

ACOUSTICS;

ADULT; ARTICLE; AUTOMATION; CODING; DEVICE; E-MAIL; FEMALE; HUMAN; HUMAN EXPERIMENT; MALE; NOISE; NORMAL HUMAN; PHONEME; PHONETICS; PRIORITY JOURNAL; PUBLICATION; SIGNAL NOISE RATIO; SPEECH INTELLIGIBILITY; TECHNIQUE;

ADOLESCENT; ADULT; COMMUNICATION AIDS FOR DISABLED; FEMALE; HUMANS; MALE; MIDDLE AGED; PERCEPTUAL MASKING; PROSTHESIS DESIGN; SOUND SPECTROGRAPHY; SPEECH INTELLIGIBILITY;

EID: 0037381121 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.1558356 Document Type: Article

Times cited : (18)

References (25)

1
- 0003724033
- Cambridge University Press, New York
- Allen, J., Hunnicutt, S., and Klatt, D. (1987). From Text To Speech, The MITTALK System (Cambridge University Press, New York).
- (1987) From Text To Speech, The MITTALK System
- Allen, J.¹ Hunnicutt, S.² Klatt, D.³

2
- 0003402909
- American National Standards Institute, New York
- ANSI (1969). ANSI S3.6-1969, "American National Standards Specification for Audiometers" (American National Standards Institute, New York).
- (1969) ANSI S3.6-1969 "American National Standards Specification for Audiometers"

3
- 0015112070
- Speech analysis and synthesis by linear prediction of the speech wave
- Atal, B. S., and Hanauer, S. L. (1971). "Speech analysis and synthesis by linear prediction of the speech wave," J. Acoust. Soc. Am. 50, 637-655.
- (1971) J. Acoust. Soc. Am. , vol.50 , pp. 637-655
- Atal, B.S.¹ Hanauer, S.L.²

4
- 0004315161
- Brooks/Cole, Pacific Grove, CA
- Cohen, B. H. (1996). Explaining Psychological Statistics (Brooks/Cole, Pacific Grove, CA).
- (1996) Explaining Psychological Statistics
- Cohen, B.H.¹

5
- 0027839344
- MBR-PSOLA: Text-to-speech synthesis based on an MBE re-synthesis of the segments database
- Dutoit, T., and Leich, H. (1993). "MBR-PSOLA: Text-to-speech synthesis based on an MBE re-synthesis of the segments database," Speech Commun. 13, 435-440.
- (1993) Speech Commun. , vol.13 , pp. 435-440
- Dutoit, T.¹ Leich, H.²

6
- 0028455481
- Invariants, specifiers, cues: An investigation of locus equations as information for place of articulation
- Fowler, C. A. (1994). "Invariants, specifiers, cues: An investigation of locus equations as information for place of articulation," Percept. Psychophys. 55, 597-610.
- (1994) Percept. Psychophys. , vol.55 , pp. 597-610
- Fowler, C.A.¹

7
- 84961430477
- Synthetic speech intelligibility under several experimental conditions
- Fucci, D., Reynolds, M. E., Bettagere, R., and Gonzales, M. D. (1995). "Synthetic speech intelligibility under several experimental conditions," Augment. Alt. Commun. 11, 113-117.
- (1995) Augment. Alt. Commun. , vol.11 , pp. 113-117
- Fucci, D.¹ Reynolds, M.E.² Bettagere, R.³ Gonzales, M.D.⁴

8
- 0002232642
- Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems
- Green, B. G., Logan, J. S., and Pisoni, D. B. (1986). "Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems," Behav. Res. Methods Instrum. Comput. 18, 100-107.
- (1986) Behav. Res. Methods Instrum. Comput. , vol.18 , pp. 100-107
- Green, B.G.¹ Logan, J.S.² Pisoni, D.B.³

9
- 0001562208
- Articulation-testing methods: Consonantal differentiation with a closed-response set
- House, A. S., Williams, C. E., Hecker, M. H. L., and Kryter, K. D. (1965). "Articulation-testing methods: Consonantal differentiation with a closed-response set," J. Acoust. Soc. Am. 37, 158-166.
- (1965) J. Acoust. Soc. Am. , vol.37 , pp. 158-166
- House, A.S.¹ Williams, C.E.² Hecker, M.H.L.³ Kryter, K.D.⁴

10
- 0242638092
- Speech perception
- H. Pashler & S. Yantis, edited by H. Pashler and S. Yantis (Wiley, New York)
- Jusczyk, P. W., and Luce, P. A. (2002). "Speech perception," in H. Pashler & S. Yantis, Stevens' Handbook of Experimental Psychology, 3rd ed., edited by H. Pashler and S. Yantis (Wiley, New York), Vol. 1, pp. 493-536.
- (2002) Stevens' Handbook of Experimental Psychology, 3rd Ed. , vol.1 , pp. 493-536
- Jusczyk, P.W.¹ Luce, P.A.²

11
- 0017325947
- Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability
- Kalikow, D. N., Stevens, K. N., and Elliott, L. L. (1977). "Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability," J. Acoust. Soc. Am. 61, 1337-1351.
- (1977) J. Acoust. Soc. Am. , vol.61 , pp. 1337-1351
- Kalikow, D.N.¹ Stevens, K.N.² Elliott, L.L.³

12
- 0023407575
- Review of text-to-speech conversion of English
- Klatt, D. H. (1987). "Review of text-to-speech conversion of English," J. Acoust. Soc. Am. 82, 737-793.
- (1987) J. Acoust. Soc. Am. , vol.82 , pp. 737-793
- Klatt, D.H.¹

13
- 0027186268
- Segmental intelligibility and speech interference thresholds of high-quality synthetic speech in the presence of noise
- Koul, R. K., and Allen, G. D. (1993). "Segmental intelligibility and speech interference thresholds of high-quality synthetic speech in the presence of noise," J. Speech Hear. Res. 36, 790-798.
- (1993) J. Speech Hear. Res. , vol.36 , pp. 790-798
- Koul, R.K.¹ Allen, G.D.²

14
- 0023830160
- Babble and random-noise masking of speech in high and low context cue conditions
- Lewis, H., Benignus, V. A., Muller, K. E., Malott, C. M., and Barton, C. N. (1988). "Babble and random-noise masking of speech in high and low context cue conditions," J. Speech Hear. Res. 31, 108-114.
- (1988) J. Speech Hear. Res. , vol.31 , pp. 108-114
- Lewis, H.¹ Benignus, V.A.² Muller, K.E.³ Malott, C.M.⁴ Barton, C.N.⁵

15
- 0024570198
- A specialization for speech perception
- Liberman, A. M., and Mattingly, I. G. (1989). "A specialization for speech perception," Science 243, 489-494.
- (1989) Science , vol.243 , pp. 489-494
- Liberman, A.M.¹ Mattingly, I.G.²

16
- 0024344665
- Segmental intelligibility of synthetic speech produced by rule
- Logan, J. S., Greene, B. G., and Pisoni, D. B. (1989). "Segmental intelligibility of synthetic speech produced by rule," J. Acoust. Soc. Am. 86, 566-581.
- (1989) J. Acoust. Soc. Am. , vol.86 , pp. 566-581
- Logan, J.S.¹ Greene, B.G.² Pisoni, D.B.³

17
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- Moulines, E., and Charpentier, F. (1990). "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun. 9, 453-467.
- (1990) Speech Commun. , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

18
- 0001705913
- Trainability of listening comprehension of speech discourse
- Orr, D. B., Friedman, H. L., and Williams, J. C. C. (1965). "Trainability of listening comprehension of speech discourse," J. Educ. Psychol. 56, 148-156.
- (1965) J. Educ. Psychol. , vol.56 , pp. 148-156
- Orr, D.B.¹ Friedman, H.L.² Williams, J.C.C.³

19
- 0003522449
- Addison-Wesley, Reading, MA
- O'Shaughnessy, D. (1987). Speech Communication: Human and Machine (Addison-Wesley, Reading, MA).
- (1987) Speech Communication: Human and Machine
- O'Shaughnessy, D.¹

20
- 0242722456
- Some comparisons of intelligibility of synthetic and natural speech at different speech-to-noise ratios
- Pisoni, D. B., and Koen, E. (1982). "Some comparisons of intelligibility of synthetic and natural speech at different speech-to-noise ratios," J. Acoust. Soc. Am. Suppl. 1 71, S94.
- (1982) J. Acoust. Soc. Am. Suppl. 1 , vol.71
- Pisoni, D.B.¹ Koen, E.²

21
- 0022148789
- Perception of synthetic speech generated by rule
- Pisoni, D. B., Nusbaum, H. C., and Greene, B. G. (1985). "Perception of synthetic speech generated by rule," Proc. IEEE 73, 1665-1676.
- (1985) Proc. IEEE , vol.73 , pp. 1665-1676
- Pisoni, D.B.¹ Nusbaum, H.C.² Greene, B.G.³

22
- 0019508260
- Speech perception without traditional cues
- Remez, R. E., Rubin, P. E., Pisoni, D. B., and Carrell, T. D. (1981). "Speech perception without traditional cues," Science 212, 947-950.
- (1981) Science , vol.212 , pp. 947-950
- Remez, R.E.¹ Rubin, P.E.² Pisoni, D.B.³ Carrell, T.D.⁴

23
- 33646629866
- Sproat, R. M., Ostendorf, M., and Hunt, A., eds., (1999). "The need for increased speech synthesis research" (a report of the 1998 NSF workshop for discussing research priorities and evaluation strategies in speech synthesis). (Available at http://cslu.cse.ogi.edu/publications/).
- (1999) "The Need for Increased Speech Synthesis Research" (A Report of the 1998 NSF Workshop for Discussing Research Priorities and Evaluation Strategies in Speech Synthesis)
- Sproat, R.M.¹ Ostendorf, M.² Hunt, A.³

24
- 0033709105
- On the Implementation of the Harmonic Plus Noise Model for Concatenative Speech Synthesis
- Paper presented at, Istanbul, Turkey
- Stylianou, Y. (2000). "On the Implementation of the Harmonic Plus Noise Model for Concatenative Speech Synthesis," Paper presented at ICASSP 2000, Istanbul, Turkey (available at http://www.research.att.com/projects/ tts/papers/2000_ICASSP/fastHNM.pdf).
- (2000) ICASSP 2000
- Stylianou, Y.¹

25
- 0000207066
- Auditory temporal discrimination by trained listeners
- Warren, R. M. (1974). "Auditory temporal discrimination by trained listeners," Cogn. Psychol. 6, 237-256.
- (1974) Cogn. Psychol. , vol.6 , pp. 237-256
- Warren, R.M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.