메뉴 건너뛰기




Volumn 20, Issue 3, 2012, Pages 968-981

The Deterministic plus Stochastic model of the residual signal and its applications

Author keywords

excitation modeling; glottal flow; speaker recognition; Speech analysis; speech synthesis

Indexed keywords

COMPUTATIONAL PROPERTIES; DATA SETS; EXCITATION MODELS; GLOTTAL FLOW; HIGH-FREQUENCY NOISE; HMM-BASED SPEECH SYNTHESIS; LOW FREQUENCY; ORTHONORMAL; PARAMETERIZING; PROCESSING APPLICATIONS; PULSE EXCITATION; RECOGNITION RATES; RESIDUAL SIGNALS; SPEAKER IDENTIFICATION; SPEAKER RECOGNITION; SPECTRAL BAND; SPEECH PRODUCTION; STOCHASTIC COMPONENT;

EID: 84856248602     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2169787     Document Type: Article
Times cited : (98)

References (47)
  • 2
    • 80955173659 scopus 로고    scopus 로고
    • Comparative study of glottal source estimation techniques
    • Jan.
    • T. Drugman, B. Bozkurt, and T. Dutoit, "Comparative study of glottal source estimation techniques," Comput. Speech Lang., vol. 26, no. 1, pp. 20-34, Jan. 2012.
    • (2012) Comput. Speech Lang. , vol.26 , Issue.1 , pp. 20-34
    • Drugman, T.1    Bozkurt, B.2    Dutoit, T.3
  • 3
    • 85131821539 scopus 로고
    • Mel generalized cepstral analysis a unified approach to speech spectral estimation
    • K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel generalized cepstral analysis a unified approach to speech spectral estimation," in Proc. ICSLP, 1994.
    • (1994) Proc. ICSLP
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Imai, S.4
  • 6
    • 0028515601 scopus 로고
    • Enhancement of multiband excitation (MBE) by pitch-cycle waveform coding
    • H. Yang, S. Koh, and P. Sivaprakasapillai, "Enhancement of multiband excitation (MBE) by pitch-cycle waveform coding," Electron. Lett., vol. 30, no. 20, pp. 1645-1646, 1994.
    • (1994) Electron. Lett. , vol.30 , Issue.20 , pp. 1645-1646
    • Yang, H.1    Koh, S.2    Sivaprakasapillai, P.3
  • 7
    • 0033677122 scopus 로고    scopus 로고
    • Mixed excitation linear prediction coding of wideband speech at 8 kbps
    • Speech, Signal Process. (ICASSP)
    • W. Lin, S. Koh, and X. Lin, "Mixed excitation linear prediction coding of wideband speech at 8 kbps," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2000, vol. 2, pp. 1137-1140.
    • (2000) Proc. IEEE Int. Conf. Acoust. , vol.2 , pp. 1137-1140
    • Lin, W.1    Koh, S.2    Lin, X.3
  • 9
    • 78649297510 scopus 로고    scopus 로고
    • An excitation model for HMM-based speech synthesis based on residual modeling
    • R. Maia, T. Toda, H. Zen, Y. Nankaku, and K. Tokuda, "An excitation model for HMM-based speech synthesis based on residual modeling," in Proc. ISCA SSW6, 2007.
    • (2007) Proc. ISCA SSW6
    • Maia, R.1    Toda, T.2    Zen, H.3    Nankaku, Y.4    Tokuda, K.5
  • 10
    • 33846935000 scopus 로고    scopus 로고
    • HMM-based Korean speech synthesis system for hand-held devices
    • DOI 10.1109/TCE.2006.273160
    • S.-J. Kim, J.-J. Kim, and M. Hahn, "HMM-based korean speech synthesis system for hand-held devices," IEEE Trans. Consumer Electron., vol. 58, no. 4, pp. 1384-1390, Apr. 2006. (Pubitemid 46231653)
    • (2006) IEEE Transactions on Consumer Electronics , vol.52 , Issue.4 , pp. 1384-1390
    • Kim, S.-J.1    Kim, J.-J.2    Hahn, M.3
  • 11
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, pp. 187-207, 2001.
    • (2001) Speech Commun. , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 13
    • 33947684811 scopus 로고
    • A four parameter model of glottal flow
    • G. Fant and J. L. Q. Lin, "A four parameter model of glottal flow," in Proc. STL-QPSR4, 1985, pp. 1-13.
    • (1985) Proc. STL-QPSR4 , pp. 1-13
    • Fant, G.1    Lin, J.L.Q.2
  • 15
    • 0032595183 scopus 로고    scopus 로고
    • Modeling of the glottal flow derivative waveform with application to speaker identification
    • Sep.
    • M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds, "Modeling of the glottal flow derivative waveform with application to speaker identification," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 569-576, Sep. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.5 , pp. 569-576
    • Plumpe, M.D.1    Quatieri, T.F.2    Reynolds, D.A.3
  • 16
    • 51449086496 scopus 로고    scopus 로고
    • Voice source cepstrum coefficients for speaker identification
    • Speech, Signal Process. (ICASSP)
    • J. Gudnason and M. Brookes, "Voice source cepstrum coefficients for speaker identification," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008, pp. 4821-4824.
    • (2008) Proc. IEEE Int. Conf. Acoust. , pp. 4821-4824
    • Gudnason, J.1    Brookes, M.2
  • 17
    • 0029356550 scopus 로고
    • Usefulness of the LPC-residue intext-inde-pendent speaker verification
    • P. Thevenaz and H. Hugli, "Usefulness of the LPC-residue intext-inde-pendent speaker verification," Speech Commun., vol. 17, pp. 145-157, 1995.
    • (1995) Speech Commun. , vol.17 , pp. 145-157
    • Thevenaz, P.1    Hugli, H.2
  • 18
    • 30444446629 scopus 로고    scopus 로고
    • Combining evidence from residual phase and MFCC features for speaker recognition
    • DOI 10.1109/LSP.2005.860538
    • S. Murty and B. Yegnanarayana, "Combining evidence from residual phase and MFCC features for speaker recognition," IEEE Signal Process. Lett., vol. 13, no. 1, pp. 52-55, Jan. 2006. (Pubitemid 43072461)
    • (2006) IEEE Signal Processing Letters , vol.13 , Issue.1 , pp. 52-55
    • Sri Rama Murty, K.1    Yegnanarayana, B.2
  • 19
    • 70450204573 scopus 로고    scopus 로고
    • A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis
    • T. Drugman, G. Wilfart, and T. Dutoit, "A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis," in Proc. Interspeech Conf., 2009.
    • (2009) Proc. Interspeech Conf.
    • Drugman, T.1    Wilfart, G.2    Dutoit, T.3
  • 20
    • 79959830729 scopus 로고    scopus 로고
    • On the potential of glottal signatures for speaker recognition
    • T. Drugman and T. Dutoit, "On the potential of glottal signatures for speaker recognition," in Proc. Interspeech Conf., 2010.
    • (2010) Proc. Interspeech Conf.
    • Drugman, T.1    Dutoit, T.2
  • 21
    • 0035127703 scopus 로고    scopus 로고
    • Applying the harmonic plus noise model in concatenative speech synthesis
    • DOI 10.1109/89.890068
    • Y. Stylianou, "Applying the harmonic plus noise model in concatenative speech synthesis," IEEE Trans. Speech Audio Process., vol. 9, no. 1, pp. 21-29, Jan. 2001. (Pubitemid 32130684)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.1 , pp. 21-29
    • Stylianou, Y.1
  • 22
    • 34547541173 scopus 로고    scopus 로고
    • A new method for speech synthesis and transformation based on an ARX-LF source-filter decomposition and HNM modeling
    • Speech, Signal Process. (ICASSP)
    • D. Vincent, O. Rosec, and T. Chonavel, "A new method for speech synthesis and transformation based on an ARX-LF source-filter decomposition and HNM modeling," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 525-528.
    • (2007) Proc. IEEE Int. Conf. Acoust. , pp. 525-528
    • Vincent, D.1    Rosec, O.2    Chonavel, T.3
  • 23
    • 70349208681 scopus 로고    scopus 로고
    • ARX-LF-based source-filter methods for voice modification and transformation
    • Speech, Signal Process. (ICASSP)
    • Y. Agiomyrgiannakis and O. Rosec, "ARX-LF-based source-filter methods for voice modification and transformation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009, pp.3589-3592.
    • (2009) Proc. IEEE Int. Conf. Acoust. , pp. 3589-3592
    • Agiomyrgiannakis, Y.1    Rosec, O.2
  • 24
    • 70450198169 scopus 로고    scopus 로고
    • Glottal closure and opening instant detection from speech signals
    • T. Drugman and T. Dutoit, "Glottal closure and opening instant detection from speech signals," in Proc. Interspeech Conf., 2009.
    • (2009) Proc. Interspeech Conf.
    • Drugman, T.1    Dutoit, T.2
  • 25
    • 70349203247 scopus 로고    scopus 로고
    • The nitech-NAIST HMM-based speech synthesais system for the blizzard challenge 2006
    • H. Zen, T. Toda, and K. Tokuda, "The Nitech-NAIST HMM-based speech synthesais system for the Blizzard challenge 2006," IEICE Trans. Inf. Syst., 2006.
    • (2006) IEICE Trans. Inf. Syst.
    • Zen, H.1    Toda, T.2    Tokuda, K.3
  • 26
    • 68949094517 scopus 로고    scopus 로고
    • Optimum MVF estimation-based two-band excitation for HMM-based speech synthesis
    • S. Han, S. Jeong, and M. Hahn, "Optimum MVF estimation-based two-band excitation for HMM-based speech synthesis," ETRI J., vol. 31, no. 4, pp. 457-459, 2009.
    • (2009) ETRI J. , vol.31 , Issue.4 , pp. 457-459
    • Han, S.1    Jeong, S.2    Hahn, M.3
  • 27
    • 9444268127 scopus 로고    scopus 로고
    • Expressing vocal effort in concatenative synthesis
    • M. Schroeder and M. Grice, "Expressing vocal effort in concatenative synthesis," in Proc. 15th Int. Conf. Phon. Sci., 2003, pp. 2589-2592.
    • (2003) Proc. 15th Int. Conf. Phon. Sci. , pp. 2589-2592
    • Schroeder, M.1    Grice, M.2
  • 29
    • 51449095025 scopus 로고    scopus 로고
    • Improving the modeling of the noise part in the harmonic plus noise model of speech
    • Y. Pantazis and Y. Stylianou, "Improving the modeling of the noise part in the harmonic plus noise model of speech," in Proc. IEEE ICASSP, 2008, pp. 4609-1612.
    • (2008) Proc. IEEE ICASSP , pp. 4609-1612
    • Pantazis, Y.1    Stylianou, Y.2
  • 30
    • 84856272951 scopus 로고    scopus 로고
    • A comparative evaluation of pitch modification techniques
    • T. Drugman and T. Dutoit, "A comparative evaluation of pitch modification techniques," in Proc. Eur. Signal Process. Conf., 2010, pp. 756-760.
    • (2010) Proc. Eur. Signal Process. Conf. , pp. 756-760
    • Drugman, T.1    Dutoit, T.2
  • 34
    • 26844515690 scopus 로고    scopus 로고
    • Mixed-phase speech modeling and formant estimation, using differential phase spectrums
    • B. Bozkurt and T. Dutoit, "Mixed-phase speech modeling and formant estimation, using differential phase spectrums,' in Proc. ISCA ITRW VOQUAL03, 2003, pp. 21-24.
    • (2003) Proc. ISCA ITRW VOQUAL03 , pp. 21-24
    • Bozkurt, B.1    Dutoit, T.2
  • 35
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • Speech, Signal Process. (ICASSP)
    • T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for Mel-cepstral analysis of speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1992, vol. 1, pp. 137-140.
    • (1992) Proc. IEEE Int. Conf. Acoust. , vol.1 , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 36
    • 34547526960 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Speech, Signal Process. (ICASSP)
    • A. W. Black, H. Zen, and K. Tokuda, "Statistical parametric speech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 1229-1232.
    • (2007) Proc. IEEE Int. Conf. Acoust. , pp. 1229-1232
    • Black, A.W.1    Zen, H.2    Tokuda, K.3
  • 38
    • 85031628788 scopus 로고
    • An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features
    • K. Tokuda, T. Masuko, T. Yamada, T. Kobayashi, and S. Imai, "An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features," in Proc. Eurospeech, 1995.
    • (1995) Proc. Eurospeech
    • Tokuda, K.1    Masuko, T.2    Yamada, T.3    Kobayashi, T.4    Imai, S.5
  • 40
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. Syst., vol. 90, no. 5, pp. 816-824, 2007.
    • (2007) IEICE Trans. Inf. Syst. , vol.90 , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 43
    • 0036293830 scopus 로고    scopus 로고
    • An overview of automatic speaker recognition technology
    • Speech, Signal Process. (ICASSP)
    • D. Reynolds, "An overview of automatic speaker recognition technology," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2002, vol. 4, pp. 4072-4075.
    • (2002) Proc. IEEE Int. Conf. Acoust. , vol.4 , pp. 4072-4075
    • Reynolds, D.1
  • 44
    • 33748443739 scopus 로고    scopus 로고
    • Extraction of speaker-specific excitation information from linear prediction residual of speech
    • DOI 10.1016/j.specom.2006.06.002, PII S0167639306000665
    • S. Prasanna, C. Gupta, and B. Yegnanarayana, "Extraction of speaker-specific information from linear prediction residual of speech," IEEE Trans. Pattern Anal. Mach. Intell., vol. 48, no. 10, pp. 1243-1261, Oct. 2006. (Pubitemid 44353818)
    • (2006) Speech Communication , vol.48 , Issue.10 , pp. 1243-1261
    • Mahadeva Prasanna, S.R.1    Gupta, C.S.2    Yegnanarayana, B.3
  • 47
    • 0028996937 scopus 로고
    • Testing with the YOHO CD-ROM voice verification corpus
    • Speech, Signal Process. (ICASSP)
    • J. Campbell, "Testing with the YOHO CD-ROM voice verification corpus," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1995, pp. 341-344.
    • (1995) Proc. IEEE Int. Conf. Acoust. , pp. 341-344
    • Campbell, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.