메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7982-7986

Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech

Author keywords

audiobooks; expressive speech; glottal source; speech synthesis; Voice quality

Indexed keywords

AUDIOBOOKS; EXPRESSIVE SPEECH; EXPRESSIVE SPEECH SYNTHESIS; FUZZY-INPUT FUZZY-OUTPUT SUPPORT VECTOR MACHINES; GLOTTAL SOURCE; LANGUAGE INDEPENDENTS; SPEECH TECHNOLOGY; VOICE QUALITY;

EID: 84890470090     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639219     Document Type: Conference Paper
Times cited : (17)

References (31)
  • 2
    • 0037380186 scopus 로고    scopus 로고
    • The role of voice quality in communicating emotion, mood and attitude
    • C. Gobl and A. N? Chasaide, "The role of voice quality in communicating emotion, mood and attitude," Speech Communication, vol. 40, pp. 189-212, 2003.
    • (2003) Speech Communication , vol.40 , pp. 189-212
    • Gobl, C.1    Chasaide, A.N.2
  • 3
    • 0035668083 scopus 로고    scopus 로고
    • Phonation types: A crosslinguistic review
    • M. Gordon and P. Ladefoged, "Phonation types: A crosslinguistic review," Journal of Phonetics, no. 29, pp. 383-406, 2001.
    • (2001) Journal of Phonetics , Issue.29 , pp. 383-406
    • Gordon, M.1    Ladefoged, P.2
  • 5
    • 80051650578 scopus 로고    scopus 로고
    • Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis
    • T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, "Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis," Proceedings of ICASSP, Prague, pp. 4564-4567, 2011.
    • (2011) Proceedings of ICASSP, Prague , pp. 4564-4567
    • Raitio, T.1    Suni, A.2    Pulakka, H.3    Vainio, M.4    Alku, P.5
  • 6
    • 84890528272 scopus 로고    scopus 로고
    • Expressive speech synthesis: Synthesising ambiguity
    • submitted
    • M. P. Aylett, B. Potard, and C. J. Pidcock, "Expressive speech synthesis: Synthesising ambiguity," in ICASSP13, submitted.
    • ICASSP13
    • Aylett, M.P.1    Potard, B.2    Pidcock, C.J.3
  • 7
    • 34547496515 scopus 로고    scopus 로고
    • The relevance of voice quality features in speaker independent emotion recognition
    • M. Lugger and B. Yang, "The relevance of voice quality features in speaker independent emotion recognition," Proceedings of ICASSP, Honolulu, Hawaii, vol. 4, pp. 17-20, 2007.
    • (2007) Proceedings of ICASSP, Honolulu, Hawaii , vol.4 , pp. 17-20
    • Lugger, M.1    Yang, B.2
  • 9
    • 84859756209 scopus 로고    scopus 로고
    • Impact of vocal effort variability on automatic speech recognition
    • P. Zelinka, M. Sigmund, and J. Schimmel, "Impact of vocal effort variability on automatic speech recognition," Speech Communication, vol. 54, no. 6, pp. 732-742, 2012.
    • (2012) Speech Communication , vol.54 , Issue.6 , pp. 732-742
    • Zelinka, P.1    Sigmund, M.2    Schimmel, J.3
  • 12
    • 0026941709 scopus 로고
    • Acoustic characteristics of voice quality
    • C. Gobl and A. N? Chasaide, "Acoustic characteristics of voice quality," Speech Communication, vol. 11, pp. 481-490, 1992.
    • (1992) Speech Communication , vol.11 , pp. 481-490
    • Gobl, C.1    Chasaide, A.N.2
  • 14
    • 84865726860 scopus 로고    scopus 로고
    • Identifying regions of non-modal phonation using features of the wavelet transform
    • J. Kane and C. Gobl, "Identifying regions of non-modal phonation using features of the wavelet transform," Proceedings of Interspeech, Florence, Italy, pp. 177-180, 2011.
    • (2011) Proceedings of Interspeech, Florence, Italy , pp. 177-180
    • Kane, J.1    Gobl, C.2
  • 17
    • 0000547455 scopus 로고
    • Classification of glottal vibration from acoustic measurements
    • K. Stevens and H. Hanson, "Classification of glottal vibration from acoustic measurements," Vocal fold physiology, pp. 147-170, 1994.
    • (1994) Vocal Fold Physiology , pp. 147-170
    • Stevens, K.1    Hanson, H.2
  • 18
    • 84867329306 scopus 로고    scopus 로고
    • Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification
    • S. Scherer, J. Kane, C. Gobl, and F. Schwenker, "Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification," Computer Speech and Language, vol. 27, pp. 263-287, 2013.
    • (2013) Computer Speech and Language , vol.27 , pp. 263-287
    • Scherer, S.1    Kane, J.2    Gobl, C.3    Schwenker, F.4
  • 19
    • 84878620106 scopus 로고    scopus 로고
    • Cries and whispers: Classification of vocal effort in expressive speech
    • Oregon, USA
    • N. Obin, "Cries and whispers: Classification of vocal effort in expressive speech," Proceedings of Interspeech, Portland, Oregon, USA, 2012.
    • (2012) Proceedings of Interspeech, Portland
    • Obin, N.1
  • 21
    • 84870254871 scopus 로고    scopus 로고
    • Evaluation of glottal closure instant detection in a range of voice qualities
    • J. Kane and C. Gobl, "Evaluation of glottal closure instant detection in a range of voice qualities," Speech Communication, vol. 55, pp. 295-314, 2013.
    • (2013) Speech Communication , vol.55 , pp. 295-314
    • Kane, J.1    Gobl, C.2
  • 22
    • 0026881384 scopus 로고
    • Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
    • P. Alku, T. Backstrom, and E. Vilkman, "Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering," Speech Communication, vol. 11, no. 2-3, pp. 109-118, 1992.
    • (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 109-118
    • Alku, P.1    Backstrom, T.2    Vilkman, E.3
  • 23
    • 0036339929 scopus 로고    scopus 로고
    • Normalized amplitude quotient for parameterization of the glottal flow
    • P. Alku, T. Backstrom, and E. Vilkman, "Normalized amplitude quotient for parameterization of the glottal flow," Journal of the Acoustical Society of America, vol. 112, no. 2, pp. 701-710, 2002.
    • (2002) Journal of the Acoustical Society of America , vol.112 , Issue.2 , pp. 701-710
    • Alku, P.1    Backstrom, T.2    Vilkman, E.3
  • 24
    • 0024381490 scopus 로고
    • Klassifizierung von glottisdysfunktionen mit hilfe der elektroglottographie
    • T. Hacki, "Klassifizierung von glottisdysfunktionen mit hilfe der elektroglottographie," Folia Phoniatrica, pp. 43-48, 1989.
    • (1989) Folia Phoniatrica , pp. 43-48
    • Hacki, T.1
  • 27
    • 84856245716 scopus 로고    scopus 로고
    • Glottal closure instant and voice source analysis using time-scale lines of maximum amplitude
    • C. d'Alessandro and N. Sturmel, "Glottal closure instant and voice source analysis using time-scale lines of maximum amplitude," Sadhana, vol. 36, no. 5, pp. 601-622, 2011.
    • (2011) Sadhana , vol.36 , Issue.5 , pp. 601-622
    • D'alessandro, C.1    Sturmel, N.2
  • 28
    • 84865734075 scopus 로고    scopus 로고
    • Joint robust voicing detection and pitch estimation based on residual harmonics
    • T. Drugman and A. Alwan, "Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics," Proceedings of Interspeech, Florence, Italy, pp. 1973-1976, 2011.
    • (2011) Proceedings of Interspeech, Florence, Italy , pp. 1973-1976
    • Drugman, T.1    Alwan, A.2
  • 30
    • 78049527800 scopus 로고    scopus 로고
    • The cerevoice characterful speech synthesiser sdk
    • M. P. Aylett and C. J. Pidcock, "The cerevoice characterful speech synthesiser sdk," in AISB, 2007, pp. 174-8.
    • (2007) AISB , pp. 174-178
    • Aylett, M.P.1    Pidcock, C.J.2
  • 31
    • 79959817774 scopus 로고    scopus 로고
    • Lightly supervised recognition for automatic alignment of large coherent speech recordings
    • N. Braunschweiler, M. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings," Proceedings of Interspeech, Makuhari, Japan, pp. 2222-2225, 2010.
    • (2010) Proceedings of Interspeech, Makuhari, Japan , pp. 2222-2225
    • Braunschweiler, N.1    Gales, M.2    Buchholz, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.