메뉴 건너뛰기




Volumn 6, Issue 2, 2003, Pages 161-182

Expanding the MOS: Development and psychometric evaluation of the MOS-R and MOS-X

Author keywords

Mean Opinion Scale (MOS); Psychometric evaluation; Subjective assessment of synthetic speech

Indexed keywords

RELIABILITY; SENSITIVITY ANALYSIS; SPEECH SYNTHESIS;

EID: 0037376750     PISSN: 13812416     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1022390615396     Document Type: Article
Times cited : (41)

References (63)
  • 2
    • 0012393512 scopus 로고
    • Vocal types and stereotypes: Joint effects of vocal attractiveness and vocal maturity on person perception
    • Berry, D. (1992). Vocal types and stereotypes: Joint effects of vocal attractiveness and vocal maturity on person perception. Journal of Nonverbal Behavior, 16:41-45.
    • (1992) Journal of Nonverbal Behavior , vol.16 , pp. 41-45
    • Berry, D.1
  • 3
    • 0033474703 scopus 로고    scopus 로고
    • The influence of nasality of voice on sex-stereotyped perceptions
    • Bloom, K., Zajac, D., and Titus, J. (1999). The influence of nasality of voice on sex-stereotyped perceptions. Journal of Nonverbal Behavior, 23:271-281.
    • (1999) Journal of Nonverbal Behavior , vol.23 , pp. 271-281
    • Bloom, K.1    Zajac, D.2    Titus, J.3
  • 4
    • 0030379379 scopus 로고    scopus 로고
    • Intelligibility of normal speech I: Global and fine-grained acoustic-phonetic talker characteristics
    • Bradlow, A., Torretta, G., and Pisoni, D. (1996). Intelligibility of normal speech I: Global and fine-grained acoustic-phonetic talker characteristics. Speech Communication, 20:255-272.
    • (1996) Speech Communication , vol.20 , pp. 255-272
    • Bradlow, A.1    Torretta, G.2    Pisoni, D.3
  • 5
    • 0015842321 scopus 로고
    • Perceptions of personality from speech: Effects of manipulations of acoustical parameters
    • Brown, B., Strong, W., and Rencher, A. (1973). Perceptions of personality from speech: Effects of manipulations of acoustical parameters. Journal of the Acoustical Society of America, 54:29-35.
    • (1973) Journal of the Acoustical Society of America , vol.54 , pp. 29-35
    • Brown, B.1    Strong, W.2    Rencher, A.3
  • 7
  • 8
    • 84973767026 scopus 로고
    • Determining the number of common factors in factor analysis: A review and program
    • Coovert, M.D. and McNelis, K. (1988). Determining the number of common factors in factor analysis: A review and program. Educational and Psychological Measurement, 48:687-693.
    • (1988) Educational and Psychological Measurement , vol.48 , pp. 687-693
    • Coovert, M.D.1    McNelis, K.2
  • 10
    • 0001782969 scopus 로고    scopus 로고
    • Evaluating the quality of synthetic speech
    • D. Gardner-Bonneau (Ed.), Boston, MA: Kluwer
    • Francis, A.L. and Nusbaum, H.C. (1999). Evaluating the quality of synthetic speech. In D. Gardner-Bonneau (Ed.), Human Factors and Voice Interactive Systems. Boston, MA: Kluwer, pp. 63-97.
    • (1999) Human Factors and Voice Interactive Systems , pp. 63-97
    • Francis, A.L.1    Nusbaum, H.C.2
  • 11
    • 0029292169 scopus 로고
    • Classification of methods used for assessment of text-to-speech systems according to the demands placed on the listener
    • Goldstein, M. (1995). Classification of methods used for assessment of text-to-speech systems according to the demands placed on the listener. Speech Communication, 16:225-244.
    • (1995) Speech Communication , vol.16 , pp. 225-244
    • Goldstein, M.1
  • 12
    • 0031169046 scopus 로고    scopus 로고
    • Effects of synthetic speech, gender, and perceived similarity on attitudes toward the augmented communicator
    • Gorenflo, D. and Gorenflo, C. (1997). Effects of synthetic speech, gender, and perceived similarity on attitudes toward the augmented communicator. AAC: Augmentative and Alternative Communication, 13:87-91.
    • (1997) AAC: Augmentative and Alternative Communication , vol.13 , pp. 87-91
    • Gorenflo, D.1    Gorenflo, C.2
  • 13
    • 0001450940 scopus 로고
    • Neglected dimensions in speech synthesis
    • Granstrom, B. and Nord, L. (1992). Neglected dimensions in speech synthesis. Speech Communication, 11:459-462.
    • (1992) Speech Communication , vol.11 , pp. 459-462
    • Granstrom, B.1    Nord, L.2
  • 14
    • 0002232642 scopus 로고
    • Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems
    • Greene, B., Logan, J., and Pisoni, D. (1986). Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems. Behavior Research Methods, Instruments, and Computers, 18: 100-107.
    • (1986) Behavior Research Methods, Instruments, and Computers , vol.18 , pp. 100-107
    • Greene, B.1    Logan, J.2    Pisoni, D.3
  • 15
    • 0031296638 scopus 로고    scopus 로고
    • Preliminary study of relations between physical characteristics and psychological impressions of natural voices
    • Hieda, I. and Kuchinomachi, Y. (1997). Preliminary study of relations between physical characteristics and psychological impressions of natural voices. Perceptual and Motor Skills, 85:1483-1491.
    • (1997) Perceptual and Motor Skills , vol.85 , pp. 1483-1491
    • Hieda, I.1    Kuchinomachi, Y.2
  • 16
    • 0033065909 scopus 로고    scopus 로고
    • Acoustical-perceptual correlates of 'whisper pitch' in synthetically generated vowels
    • Higashikawa, M. and Minifie, F. (1999). Acoustical-perceptual correlates of 'whisper pitch' in synthetically generated vowels. Journal of Speech, Language, and Hearing Research, 42:583-591.
    • (1999) Journal of Speech, Language, and Hearing Research , vol.42 , pp. 583-591
    • Higashikawa, M.1    Minifie, F.2
  • 17
    • 0023888984 scopus 로고
    • Perception of aperiodicities in synthetically generated voices
    • Hillenbrand, J. (1988). Perception of aperiodicities in synthetically generated voices. Journal of the Acoustical Society of America, 85:2361-2371.
    • (1988) Journal of the Acoustical Society of America , vol.85 , pp. 2361-2371
    • Hillenbrand, J.1
  • 18
    • 0026716744 scopus 로고
    • Effects of speech output type, message length, and reauditorization on perceptions of the communicative competence of an adult AAC user
    • Hoag, L. and Bedrosian, J. (1992). Effects of speech output type, message length, and reauditorization on perceptions of the communicative competence of an adult AAC user. Journal of Speech and Hearing Research, 35:1363-1366.
    • (1992) Journal of Speech and Hearing Research , vol.35 , pp. 1363-1366
    • Hoag, L.1    Bedrosian, J.2
  • 20
    • 84986412082 scopus 로고
    • The evaluative consequences of hedges, hesitations, and intensifiers: Powerful and powerless speech styles
    • Hosman, L. (1989). The evaluative consequences of hedges, hesitations, and intensifiers: Powerful and powerless speech styles. Human Communication Research, 15:383-406.
    • (1989) Human Communication Research , vol.15 , pp. 383-406
    • Hosman, L.1
  • 22
    • 0029735211 scopus 로고    scopus 로고
    • Beyond intelligibility: The performance of text-to-speech synthesisers
    • Johnston, R.D. (1996). Beyond intelligibility: The performance of text-to-speech synthesisers. BT Technology Journal, 14:100-111.
    • (1996) BT Technology Journal , vol.14 , pp. 100-111
    • Johnston, R.D.1
  • 24
    • 0025321354 scopus 로고
    • Analysis, synthesis, and perception of voice quality variations among female and male talkers
    • Klatt, D. and Klatt, L. (1990). Analysis, synthesis, and perception of voice quality variations among female and male talkers. Journal of the Acoustical Society of America, 87:820-857.
    • (1990) Journal of the Acoustical Society of America , vol.87 , pp. 820-857
    • Klatt, D.1    Klatt, L.2
  • 25
    • 0026939612 scopus 로고
    • The role of focus words in natural and in synthetic continuous speech: Acoustic aspects
    • Koopmans-Van Beinum, F. (1992). The role of focus words in natural and in synthetic continuous speech: Acoustic aspects. Speech Communication, 11:439-452.
    • (1992) Speech Communication , vol.11 , pp. 439-452
    • Koopmans-Van Beinum, F.1
  • 26
    • 0000205331 scopus 로고
    • Quality evaluation of five German speech synthesis systems
    • Kraft, V. and Portele, T. (1995). Quality evaluation of five German speech synthesis systems. Acta Acustica, 3:351-365.
    • (1995) Acta Acustica , vol.3 , pp. 351-365
    • Kraft, V.1    Portele, T.2
  • 27
    • 0001581203 scopus 로고
    • Research methods in human-computer interaction
    • M. Helander (Ed.), New York: Elsevier
    • Landauer, T.K. (1988). Research methods in human-computer interaction. In M. Helander (Ed.), Handbook of Human-Computer Interaction. New York: Elsevier.
    • (1988) Handbook of Human-Computer Interaction
    • Landauer, T.K.1
  • 28
    • 0033883193 scopus 로고    scopus 로고
    • The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels
    • Lavner, Y., Gath, I., and Rosenhouse, J. (2000). The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels. Speech Communication, 30:9-26.
    • (2000) Speech Communication , vol.30 , pp. 9-26
    • Lavner, Y.1    Gath, I.2    Rosenhouse, J.3
  • 29
    • 21344487368 scopus 로고
    • Multipoint scales: Mean and median differences and observed significance levels
    • Lewis, J.R. (1993). Multipoint scales: Mean and median differences and observed significance levels. International Journal of Human-Computer Interaction, 5:383-392.
    • (1993) International Journal of Human-Computer Interaction , vol.5 , pp. 383-392
    • Lewis, J.R.1
  • 32
    • 0026717068 scopus 로고
    • Stuttering and speech naturalness: Audio and audiovisual judgments
    • Martin, R. and Haroldson, S. (1992). Stuttering and speech naturalness: Audio and audiovisual judgments. Journal of Speech and Hearing Research, 35:521-528.
    • (1992) Journal of Speech and Hearing Research , vol.35 , pp. 521-528
    • Martin, R.1    Haroldson, S.2
  • 33
    • 0000954082 scopus 로고    scopus 로고
    • Perceiving affect from the voice and the face
    • Massaro, D. and Egan, P. (1996). Perceiving affect from the voice and the face. Psychonomic Bulletin & Review, 3:215-221.
    • (1996) Psychonomic Bulletin & Review , vol.3 , pp. 215-221
    • Massaro, D.1    Egan, P.2
  • 34
    • 0027670653 scopus 로고
    • Beyond personality: Effects of physical and vocal attractiveness on false consensus, social comparison, affiliation, and assumed and perceived personality
    • Miyake, K. and Zuckerman, M. (1993). Beyond personality: Effects of physical and vocal attractiveness on false consensus, social comparison, affiliation, and assumed and perceived personality. Journal of Personality, 61:411-437.
    • (1993) Journal of Personality , vol.61 , pp. 411-437
    • Miyake, K.1    Zuckerman, M.2
  • 35
    • 0035342286 scopus 로고    scopus 로고
    • Auditory assessment of synthesized speech in application scenarios: Two case studies
    • Moller, S., Jekosch, U., Mersdorf, J., and Kraft, V. (2001). Auditory assessment of synthesized speech in application scenarios: Two case studies. Speech Communication, 34:229-246.
    • (2001) Speech Communication , vol.34 , pp. 229-246
    • Moller, S.1    Jekosch, U.2    Mersdorf, J.3    Kraft, V.4
  • 37
    • 0027447292 scopus 로고
    • Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
    • Murray, I. and Arnott, J. (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America, 93:1097-1108.
    • (1993) Journal of the Acoustical Society of America , vol.93 , pp. 1097-1108
    • Murray, I.1    Arnott, J.2
  • 38
    • 0029325035 scopus 로고
    • Implementation and testing of a system for producing emotion-by-rule in synthetic speech
    • Murray, I. and Arnott, J. (1995). Implementation and testing of a system for producing emotion-by-rule in synthetic speech. Speech Communication, 16:369-390.
    • (1995) Speech Communication , vol.16 , pp. 369-390
    • Murray, I.1    Arnott, J.2
  • 39
    • 0030291449 scopus 로고    scopus 로고
    • Emotional stress in synthetic speech: Progress and future directions
    • Murray, I., Arnott, J., and Rohwer, E. (1996). Emotional stress in synthetic speech: Progress and future directions. Speech Communication, 20:85-91.
    • (1996) Speech Communication , vol.20 , pp. 85-91
    • Murray, I.1    Arnott, J.2    Rohwer, E.3
  • 41
    • 0002578029 scopus 로고
    • Paralanguage and the interpersonal impact of dysphoria: It's not what you say but how you say it
    • Paddock, J. and Nowicki, S. (1986). Paralanguage and the interpersonal impact of dysphoria: It's not what you say but how you say it. Social Behavior and Personality, 14:29-44.
    • (1986) Social Behavior and Personality , vol.14 , pp. 29-44
    • Paddock, J.1    Nowicki, S.2
  • 42
    • 84925917221 scopus 로고
    • The effect of voice volume on the perception of personality
    • Page, R. and Balloun, J. (1978). The effect of voice volume on the perception of personality. Journal of Social Psychology, 105:65-72.
    • (1978) Journal of Social Psychology , vol.105 , pp. 65-72
    • Page, R.1    Balloun, J.2
  • 43
    • 0034529786 scopus 로고    scopus 로고
    • Linguistic cues and memory for synthetic and natural speech
    • Paris, C.R., Thomas, M.H., Gilson, R.D., and Kincaid, J.P. (2000). Linguistic cues and memory for synthetic and natural speech. Human Factors, 42:421-431.
    • (2000) Human Factors , vol.42 , pp. 421-431
    • Paris, C.R.1    Thomas, M.H.2    Gilson, R.D.3    Kincaid, J.P.4
  • 45
    • 0012433827 scopus 로고    scopus 로고
    • Perception of synthetic speech
    • J. van Santen, R. Sproat, J. Olive, and J. Hirschberg (Eds.), New York: Springer
    • Pisoni, D. (1997). Perception of synthetic speech. In J. van Santen, R. Sproat, J. Olive, and J. Hirschberg (Eds.), Progress in Speech Synthesis. New York: Springer, pp. 541-560.
    • (1997) Progress in Speech Synthesis , pp. 541-560
    • Pisoni, D.1
  • 46
    • 0012430075 scopus 로고    scopus 로고
    • A structured way of looking at the performance of text-to-speech systems
    • J. van Santen, R. Sproat, J. Olive, and J. Hirschberg (Eds.), New York: Springer
    • Pols, L. and Jekosch, U. (1997). A structured way of looking at the performance of text-to-speech systems. In J. van Santen, R. Sproat, J. Olive, and J. Hirschberg (Eds.), Progress in Speech Synthesis. New York: Springer, pp. 519-528.
    • (1997) Progress in Speech Synthesis , pp. 519-528
    • Pols, L.1    Jekosch, U.2
  • 47
    • 0031071430 scopus 로고    scopus 로고
    • Toward a prominence-based synthesis system
    • Portele, T. and Heuft, B. (1997). Toward a prominence-based synthesis system. Speech Communication, 21:61-72.
    • (1997) Speech Communication , vol.21 , pp. 61-72
    • Portele, T.1    Heuft, B.2
  • 48
    • 0030182579 scopus 로고    scopus 로고
    • MOS and pair comparison combined methods for quality evaluation of text to speech systems
    • Salza, P.L., Foti, E., Nebbia, L., and Oreglia, M. (1996). MOS and pair comparison combined methods for quality evaluation of text to speech systems. Acta Acustica, 82:650-656.
    • (1996) Acta Acustica , vol.82 , pp. 650-656
    • Salza, P.L.1    Foti, E.2    Nebbia, L.3    Oreglia, M.4
  • 49
    • 0012480847 scopus 로고
    • Intelligibility and acceptability testing for speech technology
    • A. Syrdal, R. Bennett, and S. Greenspan (Eds.), Boca Raton: CRC Press
    • Schmidt-Nielsen, A. (1995). Intelligibility and acceptability testing for speech technology. In A. Syrdal, R. Bennett, and S. Greenspan (Eds.), Applied Speech Technology. Boca Raton: CRC Press.
    • (1995) Applied Speech Technology
    • Schmidt-Nielsen, A.1
  • 51
    • 0032181249 scopus 로고    scopus 로고
    • PURR - A method for prosody evaluation and investigation
    • Sonntag, G.P. and Portele, T. (1998). PURR - A method for prosody evaluation and investigation. Computer Speech and Language, 12:437-451.
    • (1998) Computer Speech and Language , vol.12 , pp. 437-451
    • Sonntag, G.P.1    Portele, T.2
  • 52
    • 85135255291 scopus 로고    scopus 로고
    • Comparative evaluation of six German TTS systems
    • Budapest: Technical University of Budapest
    • Sonntag, G.P., Portele, T., Haas, F., and Kohler, J. (1999). Comparative evaluation of six German TTS systems. Eurospeech '99. Budapest: Technical University of Budapest, pp. 251-254.
    • (1999) Eurospeech '99 , pp. 251-254
    • Sonntag, G.P.1    Portele, T.2    Haas, F.3    Kohler, J.4
  • 53
    • 0022297308 scopus 로고
    • Effects of speech rate and pitch contour on the perception of synthetic speech
    • Slowiaczek, L. and Nusbaum, H. (1985). Effects of speech rate and pitch contour on the perception of synthetic speech. Human Factors, 27:701-712.
    • (1985) Human Factors , vol.27 , pp. 701-712
    • Slowiaczek, L.1    Nusbaum, H.2
  • 54
    • 0033505120 scopus 로고    scopus 로고
    • The persuasiveness of synthetic speech versus human speech
    • Stern, S., Mullennix, J., Dyson, C., and Wilson, S. (1999). The persuasiveness of synthetic speech versus human speech. Human Factors, 41:588-595.
    • (1999) Human Factors , vol.41 , pp. 588-595
    • Stern, S.1    Mullennix, J.2    Dyson, C.3    Wilson, S.4
  • 57
    • 0012428630 scopus 로고    scopus 로고
    • Intelligibility and acceptability of short phrases generated by embedded text-to-speech engines
    • Mahwah, NJ: Lawrence Erlbaum
    • Wang, H. and Lewis, J.R. (2001). Intelligibility and acceptability of short phrases generated by embedded text-to-speech engines. In Proceedings of HCI International 2001: Usability Evaluation and Interface Design. Mahwah, NJ: Lawrence Erlbaum, pp. 144-148.
    • (2001) Proceedings of HCI International 2001: Usability Evaluation and Interface Design , pp. 144-148
    • Wang, H.1    Lewis, J.R.2
  • 58
    • 0029007921 scopus 로고
    • The effects of breath sounds on the perception of synthetic speech
    • Whalen, D. and Hoequist, C. (1995). The effects of breath sounds on the perception of synthetic speech. Journal of the Acoustical Society of America, 97:3147-3153.
    • (1995) Journal of the Acoustical Society of America , vol.97 , pp. 3147-3153
    • Whalen, D.1    Hoequist, C.2
  • 59
    • 0030289961 scopus 로고    scopus 로고
    • Speech during sustained operations
    • Whitmore, J. and Fisher, S. (1996). Speech during sustained operations. Speech Communication, 20:55-70.
    • (1996) Speech Communication , vol.20 , pp. 55-70
    • Whitmore, J.1    Fisher, S.2
  • 62
    • 0030181237 scopus 로고    scopus 로고
    • Register as a variable in prosodic analysis: The case of the english negative
    • Yaeger-Dror, M. (1996). Register as a variable in prosodic analysis: The case of the English negative. Speech Communication, 19:39-60.
    • (1996) Speech Communication , vol.19 , pp. 39-60
    • Yaeger-Dror, M.1
  • 63
    • 0026149741 scopus 로고
    • Cross-channel effects of vocal and physical attractiveness and their implications for interpersonal perception
    • Zuckerman, M., Miyake, K., and Hodgins, H. (1991). Cross-channel effects of vocal and physical attractiveness and their implications for interpersonal perception. Journal of Personality and Social Psychology, 60:545-554.
    • (1991) Journal of Personality and Social Psychology , vol.60 , pp. 545-554
    • Zuckerman, M.1    Miyake, K.2    Hodgins, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.