메뉴 건너뛰기




Volumn 59, Issue , 2014, Pages 10-21

Phonetic feature extraction for context-sensitive glottal source processing

Author keywords

Expressive speech; Glottal source; Phonation type; Speech synthesis; Voice quality

Indexed keywords

CLASSIFICATION ALGORITHM; DISCRIMINATIVE CLASSIFIERS; EXPRESSIVE SPEECH; GAUSSIAN MIXTURE MODEL (GMMS); GLOTTAL SOURCE; PHONATION TYPE; SUPPORT VECTOR MACHINE (SVMS); VOICE QUALITY;

EID: 84892721755     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2013.12.003     Document Type: Article
Times cited : (12)

References (52)
  • 1
    • 70450163450 scopus 로고    scopus 로고
    • Comparison of multiple voice source parameters in different phonation types
    • Airas, M.; Alku, P.; 2007. Comparison of multiple voice source parameters in different phonation types. In: Proceedings of Interspeech 2007, Antwerp, Belgium, pp. 1410-1413.
    • (2007) Proceedings of Interspeech 2007, Antwerp, Belgium , pp. 1410-1413
    • Airas, M.1    Alku, P.2
  • 3
    • 0026881384 scopus 로고
    • Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
    • P. Alku Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering Speech Commun. 11 2-3 1992 109 118
    • (1992) Speech Commun. , vol.11 , Issue.23 , pp. 109-118
    • Alku, P.1
  • 4
    • 84856294347 scopus 로고    scopus 로고
    • Glottal inverse filtering analysis of human voice production - A review of estimation and parameterization methods of the glottal excitation and their applications
    • P. Alku Glottal inverse filtering analysis of human voice production - a review of estimation and parameterization methods of the glottal excitation and their applications Sadhana 36 5 2011 623 650
    • (2011) Sadhana , vol.36 , Issue.5 , pp. 623-650
    • Alku, P.1
  • 6
    • 0031189455 scopus 로고    scopus 로고
    • Parabolic spectral parameter - A new method for quantification of the glottal flow
    • P. Alku, H. Strik, and E. Vilkman Parabolic spectral parameter - a new method for quantification of the glottal flow Speech Commun. 22 1 1997 67 79
    • (1997) Speech Commun. , vol.22 , Issue.1 , pp. 67-79
    • Alku, P.1    Strik, H.2    Vilkman, E.3
  • 7
    • 0036339929 scopus 로고    scopus 로고
    • Normalized amplitude quotient for parameterization of the glottal flow
    • P. Alku, T. Bäckström, and E. Vilkman Normalized amplitude quotient for parameterization of the glottal flow J. Acoust. Soc. Am. 112 2 2002 701 710
    • (2002) J. Acoust. Soc. Am. , vol.112 , Issue.2 , pp. 701-710
    • Alku, P.1    Bäckström, T.2    Vilkman, E.3
  • 8
    • 84882383984 scopus 로고    scopus 로고
    • Formant frequency estimation of high-pitched vowels using weighted linear prediction
    • P. Alku, J. Pohjalainen, M. Vainio, A. Laukkanen, and B. Story Formant frequency estimation of high-pitched vowels using weighted linear prediction J. Acoust. Soc. Am. 134 2 2013 1295 1313
    • (2013) J. Acoust. Soc. Am. , vol.134 , Issue.2 , pp. 1295-1313
    • Alku, P.1    Pohjalainen, J.2    Vainio, M.3    Laukkanen, A.4    Story, B.5
  • 12
    • 64449086223 scopus 로고    scopus 로고
    • Discrimination power of vocal source and vocal tract related features for speaker segmentation
    • W. Chan, N. Zheng, and T. Lee Discrimination power of vocal source and vocal tract related features for speaker segmentation IEEE Trans. Audio Speech Lang. process. 15 6 2007 1884 1892
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.6 , pp. 1884-1892
    • Chan, W.1    Zheng, N.2    Lee, T.3
  • 15
    • 80955173659 scopus 로고    scopus 로고
    • A comparative study of glottal source estimation techniques
    • T. Drugman, B. Bozkurt, and T. Dutoit A comparative study of glottal source estimation techniques Comput. Speech Lang. 26 2011 20 34
    • (2011) Comput. Speech Lang. , vol.26 , pp. 20-34
    • Drugman, T.1    Bozkurt, B.2    Dutoit, T.3
  • 20
    • 84875243987 scopus 로고    scopus 로고
    • Inverse filtering of nasalized vowels using synthesized speech
    • C. Gobl, and J. Mahshie Inverse filtering of nasalized vowels using synthesized speech J.Voice 27 2 2013 155 169
    • (2013) J.Voice , vol.27 , Issue.2 , pp. 155-169
    • Gobl, C.1    Mahshie, J.2
  • 21
    • 0024381490 scopus 로고
    • Klassifizierung von glottisdysfunktionen mit hilfe der elektroglottographie
    • T. Hacki Klassifizierung von glottisdysfunktionen mit hilfe der elektroglottographie Folia Phoniatrica 1989 43 48
    • (1989) Folia Phoniatrica , pp. 43-48
    • Hacki, T.1
  • 22
    • 0031023993 scopus 로고    scopus 로고
    • Glottal characteristics of female speakers: Acoustic correlates
    • H.M. Hanson Glottal characteristics of female speakers: acoustic correlates J. Acoust. Soc. Am. 10 1 1997 466 481
    • (1997) J. Acoust. Soc. Am. , vol.10 , Issue.1 , pp. 466-481
    • Hanson, H.M.1
  • 23
    • 0025751820 scopus 로고
    • Approximation capabilities of multilayer feedforward networks
    • K. Hornik Approximation capabilities of multilayer feedforward networks Neural Networks 4 2 1991 251 257
    • (1991) Neural Networks , vol.4 , Issue.2 , pp. 251-257
    • Hornik, K.1
  • 24
    • 77950073346 scopus 로고    scopus 로고
    • Spoken emotion recognition through optimum-path forest classification using glottal features
    • I. Iliev, M. Scordilis, J. Papa, and A. Falco Spoken emotion recognition through optimum-path forest classification using glottal features Comput. Speech Lang. 24 3 2010 445 460
    • (2010) Comput. Speech Lang. , vol.24 , Issue.3 , pp. 445-460
    • Iliev, I.1    Scordilis, M.2    Papa, J.3    Falco, A.4
  • 25
    • 84875409944 scopus 로고    scopus 로고
    • Automating manual user strategies for precise voice source analysis
    • J. Kane, and C. Gobl Automating manual user strategies for precise voice source analysis Speech Commun. 55 3 2013 397 414
    • (2013) Speech Commun. , vol.55 , Issue.3 , pp. 397-414
    • Kane, J.1    Gobl, C.2
  • 26
    • 84888256701 scopus 로고    scopus 로고
    • Evaluation of automatic glottal source analysis
    • Mons, Belgium
    • Kane, J.; Gobl, C.; 2013. Evaluation of automatic glottal source analysis. In: Proceedings of NOLISP, Mons, Belgium, pp. 1-8.
    • (2013) Proceedings of NOLISP , pp. 1-8
    • Kane, J.1    Gobl, C.2
  • 27
    • 84870254871 scopus 로고    scopus 로고
    • Evaluation of glottal closure instant detection in a range of voice qualities
    • J. Kane, and C. Gobl Evaluation of glottal closure instant detection in a range of voice qualities Speech Commun. 55 2 2013 295 314
    • (2013) Speech Commun. , vol.55 , Issue.2 , pp. 295-314
    • Kane, J.1    Gobl, C.2
  • 28
    • 84875035728 scopus 로고    scopus 로고
    • Wavelet maxima dispersion for breathy to tense voice discrimination
    • J. Kane, and C. Gobl Wavelet maxima dispersion for breathy to tense voice discrimination IEEE Trans. Audio Speech Lang. Process. 21 6 2013 1170 1179
    • (2013) IEEE Trans. Audio Speech Lang. Process. , vol.21 , Issue.6 , pp. 1170-1179
    • Kane, J.1    Gobl, C.2
  • 29
    • 84890470090 scopus 로고    scopus 로고
    • Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech
    • Vancouver, Canada
    • Kane, J.; Scherer, S.; Aylett, M.; Morency, L.; Gobl, C.; 2013. Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech. In: Proceedings of ICASSP, Vancouver, Canada.
    • (2013) Proceedings of ICASSP
    • Kane, J.1    Scherer, S.2    Aylett, M.3    Morency, L.4    Gobl, C.5
  • 30
    • 84906263810 scopus 로고    scopus 로고
    • Using phonetic feature extraction to determine optimal speech regions for maximising the effectiveness of glottal source analysis
    • Lyon, France
    • Kane, J.; Yanushevskaya, I.; Dalton, J.; Gobl, C.; NíChasaide, A.; 2013. Using phonetic feature extraction to determine optimal speech regions for maximising the effectiveness of glottal source analysis. In: Proceedings of Interspeech, Lyon, France.
    • (2013) Proceedings of Interspeech
    • Kane, J.1    Yanushevskaya, I.2    Dalton, J.3    Gobl, C.4    Níchasaide, A.5
  • 32
    • 0034297586 scopus 로고    scopus 로고
    • Detection of phonological features in continuous speech using neural networks
    • S. King, and P. Taylor Detection of phonological features in continuous speech using neural networks Comput. Speech Lang. 14 2000 333 353
    • (2000) Comput. Speech Lang. , vol.14 , pp. 333-353
    • King, S.1    Taylor, P.2
  • 34
    • 0038676761 scopus 로고    scopus 로고
    • Towards knowledge-based features for hmm based large vocabulary automatic speech recognition
    • Launay, B.; Siohan, O.; Surendran, A.; Lee, C.; 2002. Towards knowledge-based features for hmm based large vocabulary automatic speech recognition. In: Proceedings of ICASSP, Orlando, Florida, USA, pp. 817-820.
    • (2002) Proceedings of ICASSP, Orlando, Florida, USA , pp. 817-820
    • Launay, B.1    Siohan, O.2    Surendran, A.3    Lee, C.4
  • 36
    • 51449108623 scopus 로고    scopus 로고
    • Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters
    • Lugger, M.; Yang, B.; 2008. Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters. In: Proceedings of ICASSP, Las Vegas, Nevada, USA, pp. 4945-4948.
    • (2008) Proceedings of ICASSP, Las Vegas, Nevada, USA , pp. 4945-4948
    • Lugger, M.1    Yang, B.2
  • 37
    • 84966285002 scopus 로고    scopus 로고
    • Automatic detection of acoustic centres of reliability for tagging paralinguistic information in expressive speech
    • Mokhtari, P.; Campbell, N.; 2002. Automatic detection of acoustic centres of reliability for tagging paralinguistic information in expressive speech. In: Proceedings of Language Resources and Evaluation (LREC).
    • (2002) Proceedings of Language Resources and Evaluation (LREC)
    • Mokhtari, P.1    Campbell, N.2
  • 39
    • 30444446629 scopus 로고    scopus 로고
    • Combining evidence from residual phase and mfcc features for speaker recognition
    • K. Murty, and B. Yegnanarayana Combining evidence from residual phase and mfcc features for speaker recognition IEEE Signal Processing Lett. 13 1 2006 52 55
    • (2006) IEEE Signal Processing Lett. , vol.13 , Issue.1 , pp. 52-55
    • Murty, K.1    Yegnanarayana, B.2
  • 41
    • 84890547237 scopus 로고    scopus 로고
    • Synthesis and perception of breathy, normal, and lombard speech in the presence of noise
    • T. Raitio, A. Suni, M. Vainio, and P. Alku Synthesis and perception of breathy, normal, and lombard speech in the presence of noise Comput. Speech Lang. 28 2 2014 648 664
    • (2014) Comput. Speech Lang. , vol.28 , Issue.2 , pp. 648-664
    • Raitio, T.1    Suni, A.2    Vainio, M.3    Alku, P.4
  • 43
    • 67650999674 scopus 로고    scopus 로고
    • A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition
    • S. Siniscalchi, and C. Lee A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition Speech Commun. 51 11 2009 1139 1153
    • (2009) Speech Commun. , vol.51 , Issue.11 , pp. 1139-1153
    • Siniscalchi, S.1    Lee, C.2
  • 44
    • 84875405186 scopus 로고    scopus 로고
    • Exploiting deep neural networks for detection-based speech recognition
    • S. Siniscalchi, D. Yu, L. Deng, and C. Lee Exploiting deep neural networks for detection-based speech recognition Neurocomputing 106 2013 148 157
    • (2013) Neurocomputing , vol.106 , pp. 148-157
    • Siniscalchi, S.1    Yu, D.2    Deng, L.3    Lee, C.4
  • 45
    • 38549135463 scopus 로고    scopus 로고
    • A spectral method for estimation of the voice speed quotient and evaluation using electroglottography
    • Groningen, The Netherlands
    • Sturmel, N.; d'Alessandro, C.; Doval, B.; 2006. A spectral method for estimation of the voice speed quotient and evaluation using electroglottography. In: 7th Conference on Advances in Quantitative Laryngology, Groningen, The Netherlands.
    • (2006) 7th Conference on Advances in Quantitative Laryngology
    • Sturmel, N.1    D'Alessandro, C.2    Doval, B.3
  • 46
    • 84867584684 scopus 로고    scopus 로고
    • Detecting a targeted voice style in an audiobook using voice quality features
    • Kyoto, Japan
    • Székely, É.; Kane, J.; Scherer, S.; Gobl, C.; Carson-Berndsen, J.; 2012. Detecting a targeted voice style in an audiobook using voice quality features. In: Proceedings of ICASSP, Kyoto, Japan, 4593-4596.
    • (2012) Proceedings of ICASSP , pp. 4593-4596
    • Székely, E.1    Kane, J.2    Scherer, S.3    Gobl, C.4    Carson-Berndsen, J.5
  • 48
    • 0003236089 scopus 로고
    • Evidence for nonlinear sound production mechanisms in the vocal tract
    • W.J. Hardcastle, A. Marchal, Kluwer Academic
    • H.M. Teager, and S.M. Teager Evidence for nonlinear sound production mechanisms in the vocal tract W.J. Hardcastle, A. Marchal, Speech Production and Speech Modelling 1990 Kluwer Academic 241 261
    • (1990) Speech Production and Speech Modelling , pp. 241-261
    • Teager, H.M.1    Teager, S.M.2
  • 49
    • 39149117062 scopus 로고    scopus 로고
    • A review of glottal waveform analysis
    • Y. Stylianou, M. Faundez-Zanuy, A. Esposito, Springer Verlag
    • J. Walker, and P. Murphy A review of glottal waveform analysis Y. Stylianou, M. Faundez-Zanuy, A. Esposito, Progress in Nonlinear Speech Processing 2007 Springer Verlag 1 21
    • (2007) Progress in Nonlinear Speech Processing , pp. 1-21
    • Walker, J.1    Murphy, P.2
  • 51
    • 84867329143 scopus 로고    scopus 로고
    • Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition
    • Yu, D.; Siniscalchi, S.; Deng, L.; Lee, C.; 2012. Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition. In: Proceedings of ICASSP, pp. 4169-4172.
    • (2012) Proceedings of ICASSP , pp. 4169-4172
    • Yu, D.1    Siniscalchi, S.2    Deng, L.3    Lee, C.4
  • 52
    • 33947583290 scopus 로고    scopus 로고
    • Integration of complementary acoustic features for speaker recognition
    • N. Zheng, T. Lee, and P. Ching Integration of complementary acoustic features for speaker recognition IEEE Signal Processing Lett. 14 3 2007 181 184
    • (2007) IEEE Signal Processing Lett. , vol.14 , Issue.3 , pp. 181-184
    • Zheng, N.1    Lee, T.2    Ching, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.