SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 7982-7986

Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech

(5) Kane, John a Scherer, Stefan b Aylett, Matthew c Morency, Louis Philippe b Gobl, Christer a

a TRINITY COLLEGE DUBLIN (Ireland)

b University of Southern California (United States)

c UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

audiobooks; expressive speech; glottal source; speech synthesis; Voice quality

Indexed keywords

AUDIOBOOKS; EXPRESSIVE SPEECH; EXPRESSIVE SPEECH SYNTHESIS; FUZZY-INPUT FUZZY-OUTPUT SUPPORT VECTOR MACHINES; GLOTTAL SOURCE; LANGUAGE INDEPENDENTS; SPEECH TECHNOLOGY; VOICE QUALITY;

SIGNAL PROCESSING; SPEECH SYNTHESIS;

QUALITY CONTROL;

EID: 84890470090 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6639219 Document Type: Conference Paper

Times cited : (17)

References (31)

1
- 0003557856
- Cambridge University Press
- J. Laver, The Phonetic Description of Voice Quality, Cambridge University Press, 1980.
- (1980) The Phonetic Description of Voice Quality
- Laver, J.¹

2
- 0037380186
- The role of voice quality in communicating emotion, mood and attitude
- C. Gobl and A. N? Chasaide, "The role of voice quality in communicating emotion, mood and attitude," Speech Communication, vol. 40, pp. 189-212, 2003.
- (2003) Speech Communication , vol.40 , pp. 189-212
- Gobl, C.¹ Chasaide, A.N.²

3
- 0035668083
- Phonation types: A crosslinguistic review
- M. Gordon and P. Ladefoged, "Phonation types: A crosslinguistic review," Journal of Phonetics, no. 29, pp. 383-406, 2001.
- (2001) Journal of Phonetics , Issue.29 , pp. 383-406
- Gordon, M.¹ Ladefoged, P.²

4
- 80051619329
- HMM-based speech synthesiser using the LF-model of the glottal source
- J.P. Cabral, S. Renals, J. Yamagishi, and K. Richmond, "HMM-based speech synthesiser using the LF-model of the glottal source," Proceedings of ICASSP, Prague, Czech Republic, pp. 4704-4707, 2011.
- (2011) Proceedings of ICASSP, Prague, Czech Republic , pp. 4704-4707
- Cabral, J.P.¹ Renals, S.² Yamagishi, J.³ Richmond, K.⁴

5
- 80051650578
- Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis
- T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, "Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis," Proceedings of ICASSP, Prague, pp. 4564-4567, 2011.
- (2011) Proceedings of ICASSP, Prague , pp. 4564-4567
- Raitio, T.¹ Suni, A.² Pulakka, H.³ Vainio, M.⁴ Alku, P.⁵

6
- 84890528272
- Expressive speech synthesis: Synthesising ambiguity
- submitted
- M. P. Aylett, B. Potard, and C. J. Pidcock, "Expressive speech synthesis: Synthesising ambiguity," in ICASSP13, submitted.
- ICASSP13
- Aylett, M.P.¹ Potard, B.² Pidcock, C.J.³

7
- 34547496515
- The relevance of voice quality features in speaker independent emotion recognition
- M. Lugger and B. Yang, "The relevance of voice quality features in speaker independent emotion recognition," Proceedings of ICASSP, Honolulu, Hawaii, vol. 4, pp. 17-20, 2007.
- (2007) Proceedings of ICASSP, Honolulu, Hawaii , vol.4 , pp. 17-20
- Lugger, M.¹ Yang, B.²

8
- 84902969948
- Usual voice quality features and glottal features for emotional valence detection
- M. Tahon, G. Degottex, and L. Devillers, "Usual voice quality features and glottal features for emotional valence detection," Proceedings of Speech Prosody, Shanghai, China, 2012.
- (2012) Proceedings of Speech Prosody, Shanghai, China
- Tahon, M.¹ Degottex, G.² Devillers, L.³

9
- 84859756209
- Impact of vocal effort variability on automatic speech recognition
- P. Zelinka, M. Sigmund, and J. Schimmel, "Impact of vocal effort variability on automatic speech recognition," Speech Communication, vol. 54, no. 6, pp. 732-742, 2012.
- (2012) Speech Communication , vol.54 , Issue.6 , pp. 732-742
- Zelinka, P.¹ Sigmund, M.² Schimmel, J.³

10
- 84867213570
- Effects of vocal effort and speaking style on text-independent speaker verification
- E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, and F. Goodman, "Effects of vocal effort and speaking style on text-independent speaker verification," Proceedings of Interspeech, Brisbane, Australia, pp. 609-612, 2008.
- (2008) Proceedings of Interspeech, Brisbane, Australia , pp. 609-612
- Shriberg, E.¹ Graciarena, M.² Bratt, H.³ Kathol, A.⁴ Kajarekar, S.⁵ Jameel, H.⁶ Richey, C.⁷ Goodman, F.⁸

11
- 0003418124
- Mouton, Hague (2nd edition 1970)
- G. Fant, The acoustic theory of speech production, Mouton, Hague (2nd edition 1970), 1960.
- (1960) The Acoustic Theory of Speech Production
- Fant, G.¹

12
- 0026941709
- Acoustic characteristics of voice quality
- C. Gobl and A. N? Chasaide, "Acoustic characteristics of voice quality," Speech Communication, vol. 11, pp. 481-490, 1992.
- (1992) Speech Communication , vol.11 , pp. 481-490
- Gobl, C.¹ Chasaide, A.N.²

13
- 85008008295
- Phase minimization for glottal model estimation
- G. Degottex, A. Roebel, and X. Rodet, "Phase minimization for glottal model estimation," IEEE Transactions on Audio Speech and Language processing, vol. 19, no. 5, pp. 1080-1090, 2011.
- (2011) IEEE Transactions on Audio Speech and Language Processing , vol.19 , Issue.5 , pp. 1080-1090
- Degottex, G.¹ Roebel, A.² Rodet, X.³

14
- 84865726860
- Identifying regions of non-modal phonation using features of the wavelet transform
- J. Kane and C. Gobl, "Identifying regions of non-modal phonation using features of the wavelet transform," Proceedings of Interspeech, Florence, Italy, pp. 177-180, 2011.
- (2011) Proceedings of Interspeech, Florence, Italy , pp. 177-180
- Kane, J.¹ Gobl, C.²

15
- 84947292665
- Wavelet maxima dispersion for breathy to tense voice discrimination
- J. Kane and C. Gobl, "Wavelet maxima dispersion for breathy to tense voice discrimination," IEEE Transactions on Audio Speech and Language processing, Under Review.
- IEEE Transactions on Audio Speech and Language Processing, under Review
- Kane, J.¹ Gobl, C.²

16
- 84902658348
- Extracting voice quality contours using discrete hidden markov models
- ISCA
- M. Lugger, F. Stimm, and B. Yang, "Extracting voice quality contours using discrete hidden markov models," in Proceedings of Speech Prosody 2008, Campinas, Brazil, 2008, pp. 29-32, ISCA.
- (2008) Proceedings of Speech Prosody 2008, Campinas, Brazil , pp. 29-32
- Lugger, M.¹ Stimm, F.² Yang, B.³

17
- 0000547455
- Classification of glottal vibration from acoustic measurements
- K. Stevens and H. Hanson, "Classification of glottal vibration from acoustic measurements," Vocal fold physiology, pp. 147-170, 1994.
- (1994) Vocal Fold Physiology , pp. 147-170
- Stevens, K.¹ Hanson, H.²

18
- 84867329306
- Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification
- S. Scherer, J. Kane, C. Gobl, and F. Schwenker, "Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification," Computer Speech and Language, vol. 27, pp. 263-287, 2013.
- (2013) Computer Speech and Language , vol.27 , pp. 263-287
- Scherer, S.¹ Kane, J.² Gobl, C.³ Schwenker, F.⁴

19
- 84878620106
- Cries and whispers: Classification of vocal effort in expressive speech
- Oregon, USA
- N. Obin, "Cries and whispers: Classification of vocal effort in expressive speech," Proceedings of Interspeech, Portland, Oregon, USA, 2012.
- (2012) Proceedings of Interspeech, Portland
- Obin, N.¹

20
- 81155151861
- Oscillating statistical moments for speech polarity detection
- T. Drugman and T. Dutoit, "Oscillating Statistical Moments for Speech Polarity Detection," Proceedings of Non-Linear Speech Processing Workshop (NOLISP11), Las Palmas, Gran Canaria, Spain, pp. 48-54, 2011.
- (2011) Proceedings of Non-Linear Speech Processing Workshop (NOLISP11), Las Palmas, Gran Canaria, Spain , pp. 48-54
- Drugman, T.¹ Dutoit, T.²

21
- 84870254871
- Evaluation of glottal closure instant detection in a range of voice qualities
- J. Kane and C. Gobl, "Evaluation of glottal closure instant detection in a range of voice qualities," Speech Communication, vol. 55, pp. 295-314, 2013.
- (2013) Speech Communication , vol.55 , pp. 295-314
- Kane, J.¹ Gobl, C.²

22
- 0026881384
- Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
- P. Alku, T. Backstrom, and E. Vilkman, "Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering," Speech Communication, vol. 11, no. 2-3, pp. 109-118, 1992.
- (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 109-118
- Alku, P.¹ Backstrom, T.² Vilkman, E.³

23
- 0036339929
- Normalized amplitude quotient for parameterization of the glottal flow
- P. Alku, T. Backstrom, and E. Vilkman, "Normalized amplitude quotient for parameterization of the glottal flow," Journal of the Acoustical Society of America, vol. 112, no. 2, pp. 701-710, 2002.
- (2002) Journal of the Acoustical Society of America , vol.112 , Issue.2 , pp. 701-710
- Alku, P.¹ Backstrom, T.² Vilkman, E.³

24
- 0024381490
- Klassifizierung von glottisdysfunktionen mit hilfe der elektroglottographie
- T. Hacki, "Klassifizierung von glottisdysfunktionen mit hilfe der elektroglottographie," Folia Phoniatrica, pp. 43-48, 1989.
- (1989) Folia Phoniatrica , pp. 43-48
- Hacki, T.¹

25
- 33947684811
- A four parameter model of glottal flow
- G. Fant, J. Liljencrants, and Q. Lin, "A four parameter model of glottal flow," KTH, Speech Transmission Laboratory, Quarterly Report, vol. 4, pp. 1-13, 1985.
- (1985) KTH, Speech Transmission Laboratory, Quarterly Report , vol.4 , pp. 1-13
- Fant, G.¹ Liljencrants, J.² Lin, Q.³

26
- 0003465464
- Englewood Cliffs, NJ: Prentice-Hall
- R. P. Brent, Algorithms for Minimization Without Derivatives., Englewood Cliffs, NJ: Prentice-Hall 1973.
- (1973) Algorithms for Minimization Without Derivatives.
- Brent, R.P.¹

27
- 84856245716
- Glottal closure instant and voice source analysis using time-scale lines of maximum amplitude
- C. d'Alessandro and N. Sturmel, "Glottal closure instant and voice source analysis using time-scale lines of maximum amplitude," Sadhana, vol. 36, no. 5, pp. 601-622, 2011.
- (2011) Sadhana , vol.36 , Issue.5 , pp. 601-622
- D'alessandro, C.¹ Sturmel, N.²

28
- 84865734075
- Joint robust voicing detection and pitch estimation based on residual harmonics
- T. Drugman and A. Alwan, "Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics," Proceedings of Interspeech, Florence, Italy, pp. 1973-1976, 2011.
- (2011) Proceedings of Interspeech, Florence, Italy , pp. 1973-1976
- Drugman, T.¹ Alwan, A.²

29
- 0001907967
- Support vector machines: Hype or hallelujah
- K. P. Bennett and C. Campbell, "Support vector machines: hype or hallelujah?," ACM Special Interest Group on Knowledge Discovery and Data Mining Explorations Newsletter, vol. 2, no. 2, pp. 1-13, 2000.
- (2000) ACM Special Interest Group on Knowledge Discovery and Data Mining Explorations Newsletter , vol.2 , Issue.2 , pp. 1-13
- Bennett, K.P.¹ Campbell, C.²

30
- 78049527800
- The cerevoice characterful speech synthesiser sdk
- M. P. Aylett and C. J. Pidcock, "The cerevoice characterful speech synthesiser sdk," in AISB, 2007, pp. 174-8.
- (2007) AISB , pp. 174-178
- Aylett, M.P.¹ Pidcock, C.J.²

31
- 79959817774
- Lightly supervised recognition for automatic alignment of large coherent speech recordings
- N. Braunschweiler, M. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings," Proceedings of Interspeech, Makuhari, Japan, pp. 2222-2225, 2010.
- (2010) Proceedings of Interspeech, Makuhari, Japan , pp. 2222-2225
- Braunschweiler, N.¹ Gales, M.² Buchholz, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.