메뉴 건너뛰기




Volumn 17, Issue 4, 2009, Pages 775-786

Wrapped gaussian mixture models for modeling and high-rate quantization of phase data of speech

Author keywords

Circular statistics; Phase quantization; Sinusoidal models; Speech analysis; Speech coding; Voice over IP; Wrapped Gaussian mixture models (WGMMs)

Indexed keywords

CIRCULAR STATISTICS; PHASE QUANTIZATION; SINUSOIDAL MODELS; VOICE-OVER-IP; WRAPPED GAUSSIAN MIXTURE MODELS (WGMMS);

EID: 65249164963     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2008229     Document Type: Article
Times cited : (49)

References (38)
  • 1
    • 85009100883 scopus 로고    scopus 로고
    • Usefulness of phase spectrum in human speech perception
    • Geneva, Switzerland, Sep
    • K. K. Paliwal and L. Alsteris, "Usefulness of phase spectrum in human speech perception," in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 2117-2120.
    • (2003) Proc. Eurospeech , pp. 2117-2120
    • Paliwal, K.K.1    Alsteris, L.2
  • 2
    • 0035509497 scopus 로고    scopus 로고
    • On the perceptual irrelevant phase information in sinusoidal representation of speech
    • Nov
    • D.-S. Kim, "On the perceptual irrelevant phase information in sinusoidal representation of speech," IEEE Trans. Speech Audio Process., vol. 9, no. 8, pp. 900-905, Nov. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.8 , pp. 900-905
    • Kim, D.-S.1
  • 3
    • 0031220487 scopus 로고    scopus 로고
    • Effects of phase on the perception of intervocalic stop consonants
    • L. Liu, J. He, and G. Palm, "Effects of phase on the perception of intervocalic stop consonants," Speech Commun., vol. 22, no. 4, pp. 403-407, 1997.
    • (1997) Speech Commun , vol.22 , Issue.4 , pp. 403-407
    • Liu, L.1    He, J.2    Palm, G.3
  • 6
    • 0024060644 scopus 로고
    • Multi-band excitation vocoder
    • New York, Apr
    • D. Griffin and J. Lim, "Multi-band excitation vocoder," in Proc. ICASSP, New York, Apr. 1988, vol. 36, pp. 1223-1235.
    • (1988) Proc. ICASSP , vol.36 , pp. 1223-1235
    • Griffin, D.1    Lim, J.2
  • 9
    • 0030672097 scopus 로고    scopus 로고
    • A new sinusoidal phase modelling algorithm
    • Munich, Germany
    • S. Ahmadi and A. Spanias, "A new sinusoidal phase modelling algorithm," in Proc. ICASSP, Munich, Germany, 1997, vol. 3, pp. 1675-1678.
    • (1997) Proc. ICASSP , vol.3 , pp. 1675-1678
    • Ahmadi, S.1    Spanias, A.2
  • 10
    • 0023845839 scopus 로고
    • Phase compensation in all-pole speech analysis
    • New York, Apr
    • P. Hedelin, "Phase compensation in all-pole speech analysis," in Proc. ICASSP, New York, Apr. 1988, pp. 339-342.
    • (1988) Proc. ICASSP , pp. 339-342
    • Hedelin, P.1
  • 11
    • 0023871808 scopus 로고
    • Parametric models of the magnitude/phase spectrum for harmonic speech coding
    • D. L. Thomson, "Parametric models of the magnitude/phase spectrum for harmonic speech coding," in Proc. ICASSP, 1988, vol. 1, pp.378-381.
    • (1988) Proc. ICASSP , vol.1 , pp. 378-381
    • Thomson, D.L.1
  • 13
    • 0035510537 scopus 로고    scopus 로고
    • Enhanced waveform interpolative coding at low bitrate
    • O. Gottesman, "Enhanced waveform interpolative coding at low bitrate," IEEE Trans. Speech Audio Process., vol. 9, no. 8, pp. 786-798,2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.8 , pp. 786-798
    • Gottesman, O.1
  • 14
    • 27644568046 scopus 로고    scopus 로고
    • A sinusoidal voice over packet coder tailored for the frame-erasure channel
    • Sep
    • J. Lindblom, "A sinusoidal voice over packet coder tailored for the frame-erasure channel," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 787-798, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 787-798
    • Lindblom, J.1
  • 15
    • 65249102427 scopus 로고
    • Spectral envelope and phase optimization for sinusoidal speech coding
    • Annapolis, MD
    • X. Sun, B. Cheetham, and W. Wong, "Spectral envelope and phase optimization for sinusoidal speech coding," in Proc. IEEE Workshop Speech Coding for Telecomm., Annapolis, MD, 1995, pp. 75-76.
    • (1995) Proc. IEEE Workshop Speech Coding for Telecomm , pp. 75-76
    • Sun, X.1    Cheetham, B.2    Wong, W.3
  • 16
    • 0015015215 scopus 로고
    • Effect of glottal pulse shape on the quality of natural vowels
    • A. Rosenberg, "Effect of glottal pulse shape on the quality of natural vowels," J. Acoust. Soc. Amer., vol. 49, no. 2, pp. 583-590, 1971.
    • (1971) J. Acoust. Soc. Amer , vol.49 , Issue.2 , pp. 583-590
    • Rosenberg, A.1
  • 17
    • 0030648395 scopus 로고    scopus 로고
    • Phase modelling of speech excitation for low bit-rate sinusoidal transform coding
    • Munich, Germany, Apr
    • X. Sun, F. Plante, B. M. Cheetham, and K. W.Wong, "Phase modelling of speech excitation for low bit-rate sinusoidal transform coding," in Proc. ICASSP, Munich, Germany, Apr. 1997, vol. 3, pp. 1691-1694.
    • (1997) Proc. ICASSP , vol.3 , pp. 1691-1694
    • Sun, X.1    Plante, F.2    Cheetham, B.M.3    Wong, K.W.4
  • 18
    • 0033709105 scopus 로고    scopus 로고
    • On the implementation of the harmonics-plus-noise model for concantenative speech synthesis
    • Istanbul, Turkey
    • Y. Stylianou, "On the implementation of the harmonics-plus-noise model for concantenative speech synthesis," in Proc. ICASSP, Istanbul, Turkey, 2000, vol. 2, pp. 957-960.
    • (2000) Proc. ICASSP , vol.2 , pp. 957-960
    • Stylianou, Y.1
  • 19
    • 85009241677 scopus 로고    scopus 로고
    • D. Chazan, R. Hoory, Z. Kons, D. Silberstein, and A. Sorin, Reducing the footprint of the IBM trainable synthesis system, in Proc. 7th Int. Conf. Spoken Lang. Process., Denver, CO, 2002, pp. 2381-2384.
    • D. Chazan, R. Hoory, Z. Kons, D. Silberstein, and A. Sorin, "Reducing the footprint of the IBM trainable synthesis system," in Proc. 7th Int. Conf. Spoken Lang. Process., Denver, CO, 2002, pp. 2381-2384.
  • 20
    • 0033693078 scopus 로고    scopus 로고
    • An embedded sinusoidal transform codec with measured phases and sampling rate scalability
    • Istanbul, Turkey
    • G. Aguilar, J.-H. Chen, R. B. Dunn, and R. J. McAulay, "An embedded sinusoidal transform codec with measured phases and sampling rate scalability," in Proc. ICASSP, Istanbul, Turkey, 2000, pp. 141-144.
    • (2000) Proc. ICASSP , pp. 141-144
    • Aguilar, G.1    Chen, J.-H.2    Dunn, R.B.3    McAulay, R.J.4
  • 21
    • 33745216013 scopus 로고    scopus 로고
    • Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling
    • D. Chazan, R. Hoory, Z. Kons, A. Sagi, S. Shechtman, and A. Sorin, "Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling," in Proc. Interspeech, 2005, pp.2569-2572.
    • (2005) Proc. Interspeech , pp. 2569-2572
    • Chazan, D.1    Hoory, R.2    Kons, Z.3    Sagi, A.4    Shechtman, S.5    Sorin, A.6
  • 22
    • 0043069843 scopus 로고    scopus 로고
    • Squared error as a measure of perceived phase distortion
    • H. Pobloth and W. B. Kleijn, "Squared error as a measure of perceived phase distortion," J. Acoust. Soc. Amer., vol. 114, no. 2, pp. 1081-1094,2003.
    • (2003) J. Acoust. Soc. Amer , vol.114 , Issue.2 , pp. 1081-1094
    • Pobloth, H.1    Kleijn, W.B.2
  • 23
    • 37149041104 scopus 로고    scopus 로고
    • On the perceptual weighting function for phase quantization of speech
    • Delavan, WI
    • D.-S. Kim and M. Y. Kim, "On the perceptual weighting function for phase quantization of speech," in Proc. IEEEWorkshop Speech Coding, Delavan, WI, 2000, pp. 62-64.
    • (2000) Proc. IEEEWorkshop Speech Coding , pp. 62-64
    • Kim, D.-S.1    Kim, M.Y.2
  • 24
    • 4544315090 scopus 로고    scopus 로고
    • Gaussian mixture models in compression and communication,
    • Ph.D. dissertation, Univ. of California, San Diego
    • A. D. Subramaniam, "Gaussian mixture models in compression and communication," Ph.D. dissertation, Univ. of California, San Diego, 2003.
    • (2003)
    • Subramaniam, A.D.1
  • 27
    • 33847380259 scopus 로고    scopus 로고
    • Hidden Markov models for circular and linear-circular time series
    • H. Holzmann, A. Munk, M. Suster, and W. Zucchini, "Hidden Markov models for circular and linear-circular time series," J. Environ. Ecol. Statist., vol. 13, no. 3, pp. 325-347, 2006.
    • (2006) J. Environ. Ecol. Statist , vol.13 , Issue.3 , pp. 325-347
    • Holzmann, H.1    Munk, A.2    Suster, M.3    Zucchini, W.4
  • 30
    • 0003447548 scopus 로고    scopus 로고
    • Harmonic-plus-noise models for speech, combined with statistical methods for speech and speaker modification,
    • Ph.D. dissertation, Ecole Nationale. Superieure des Telecomm, Paris, France
    • Y. Stylianou, "Harmonic-plus-noise models for speech, combined with statistical methods for speech and speaker modification," Ph.D. dissertation, Ecole Nationale. Superieure des Telecomm., Paris, France, 1996.
    • (1996)
    • Stylianou, Y.1
  • 31
  • 32
    • 27344444890 scopus 로고    scopus 로고
    • Directional features in online handwriting recognition
    • C. Bahlmann, "Directional features in online handwriting recognition," Pattern Recognition, vol. 39, pp. 115-125, 2006.
    • (2006) Pattern Recognition , vol.39 , pp. 115-125
    • Bahlmann, C.1
  • 38
    • 65249103569 scopus 로고    scopus 로고
    • The harmonic model codec framework for VoIP
    • Antwerp, Belgium
    • Y. Agiomyrgiannakis and Y. Stylianou, "The harmonic model codec framework for VoIP," in Proc. Interspeech, Antwerp, Belgium, 2007, pp. 1681-1684.
    • (2007) Proc. Interspeech , pp. 1681-1684
    • Agiomyrgiannakis, Y.1    Stylianou, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.