메뉴 건너뛰기




Volumn 89, Issue 10, 2009, Pages 2036-2044

A study of HMM-based bandwidth extension of speech signals

Author keywords

Bandwidth extension; Baum Welch re estimation algorithm; Dynamic measure; Log spectral distortion

Indexed keywords

BANDWIDTH EXTENSION; BAUM-WELCH; BAUM-WELCH RE-ESTIMATION ALGORITHM; DYNAMIC PERFORMANCE; ESTIMATION ALGORITHM; GAUSSIAN COMPONENTS; GAUSSIAN MIXTURE MODEL; GENERAL APPROACH; HMM MODELS; LOG SPECTRAL DISTORTION; NUMBER OF STATE; PERFORMANCE TESTS; REFERENCE METHOD; SPEECH SIGNALS; STATIC AND DYNAMIC; STATIC PERFORMANCE; TEST RESULTS; TRAINING ALGORITHMS;

EID: 67349141249     PISSN: 01651684     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.sigpro.2009.03.037     Document Type: Article
Times cited : (53)

References (25)
  • 2
    • 33745188516 scopus 로고    scopus 로고
    • Bandwidth extension for speech
    • Larsen E., and Aarts R.M. (Eds), Wiley, New York (Chapter 6)
    • Jax P. Bandwidth extension for speech. In: Larsen E., and Aarts R.M. (Eds). Audio Bandwidth Extension (2004), Wiley, New York 171-235 (Chapter 6)
    • (2004) Audio Bandwidth Extension , pp. 171-235
    • Jax, P.1
  • 3
    • 4544244898 scopus 로고    scopus 로고
    • Feature selection for improved bandwidth extension of speech signals
    • Jax P., and Vary P. Feature selection for improved bandwidth extension of speech signals. ICASSP 1 (May 2004) 697-700
    • (2004) ICASSP , vol.1 , pp. 697-700
    • Jax, P.1    Vary, P.2
  • 4
    • 85065729672 scopus 로고
    • Statistical recovery of wideband speech from narrow-band speech
    • Y.M. Cheng, D. O'Shaugnessy, P. Mermelstein, Statistical recovery of wideband speech from narrow-band speech, in: ICSPL1992, 1992, pp. 1577-1580.
    • (1992) ICSPL1992 , pp. 1577-1580
    • Cheng, Y.M.1    O'Shaugnessy, D.2    Mermelstein, P.3
  • 5
    • 0033692729 scopus 로고    scopus 로고
    • Narrowband to wideband conversion of speech using GMM based transformation
    • Park K.-Y., and Kim H.S. Narrowband to wideband conversion of speech using GMM based transformation. ICASSP 3 (June 2000) 1843-1846
    • (2000) ICASSP , vol.3 , pp. 1843-1846
    • Park, K.-Y.1    Kim, H.S.2
  • 6
    • 0038383054 scopus 로고    scopus 로고
    • On artificial bandwidth extension of telephone speech
    • Jax P., and Vary P. On artificial bandwidth extension of telephone speech. Signal Process 83 8 (August 2003) 1707-1719
    • (2003) Signal Process , vol.83 , Issue.8 , pp. 1707-1719
    • Jax, P.1    Vary, P.2
  • 7
    • 0141590582 scopus 로고    scopus 로고
    • Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model
    • Jax P., and Vary P. Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model. ICASSP 1 (April 2003) 680-683
    • (2003) ICASSP , vol.1 , pp. 680-683
    • Jax, P.1    Vary, P.2
  • 8
    • 0028997012 scopus 로고
    • Spetral dynamics is more important than spectral distortion
    • Knagenhjelm H.P., and Kleijn W.B. Spetral dynamics is more important than spectral distortion. ICASSP 1 (May 1995) 665-668
    • (1995) ICASSP , vol.1 , pp. 665-668
    • Knagenhjelm, H.P.1    Kleijn, W.B.2
  • 9
    • 0034842441 scopus 로고    scopus 로고
    • A speech spectrum distortion measure with interframe memory
    • Norden F., and Eriksson T. A speech spectrum distortion measure with interframe memory. ICASSP 2 (May 2001) 717-720
    • (2001) ICASSP , vol.2 , pp. 717-720
    • Norden, F.1    Eriksson, T.2
  • 11
    • 33947636061 scopus 로고    scopus 로고
    • The effect of memory inclusion on mutual information between speech frequency bands
    • Nour-Eldin A.H., Shabestary T.Z., and Kabal P. The effect of memory inclusion on mutual information between speech frequency bands. ICASSP 3 (May 2006) 53-56
    • (2006) ICASSP , vol.3 , pp. 53-56
    • Nour-Eldin, A.H.1    Shabestary, T.Z.2    Kabal, P.3
  • 12
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner L.R. A tutorial on hidden Markov models and selected applications in speech recognition. in: Proceedings of the IEEE 77 2 (1989) 257-286
    • (1989) in: Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 13
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Linde Y., Buzo A., and Gray R.M. An algorithm for vector quantizer design. IEEE Trans. Commun. 28 1 (1980) 84-95
    • (1980) IEEE Trans. Commun. , vol.28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.M.3
  • 14
    • 0022883506 scopus 로고
    • Hidden Markov models applied to very low bit rate speech coding
    • Farges E., and Clements M. Hidden Markov models applied to very low bit rate speech coding. ICASSP (April 1986) 433-436
    • (1986) ICASSP , pp. 433-436
    • Farges, E.1    Clements, M.2
  • 15
    • 0026390877 scopus 로고
    • On the phonetic structure of a large hidden Markov model
    • Pepper D.J., and Clements M. On the phonetic structure of a large hidden Markov model. ICASSP (1991) 465-468
    • (1991) ICASSP , pp. 465-468
    • Pepper, D.J.1    Clements, M.2
  • 16
    • 33744687395 scopus 로고
    • Spectral quantization of cepstral coefficients
    • Hagen R. Spectral quantization of cepstral coefficients. ICASSP 1 (April 1994) 509-512
    • (1994) ICASSP , vol.1 , pp. 509-512
    • Hagen, R.1
  • 19
    • 67349196885 scopus 로고    scopus 로고
    • 3GPP TS 26.190, Adaptive multi-rate-wideband (AMR-WB) speech codec, Transcoding functions
    • 3GPP TS 26.190, Adaptive multi-rate-wideband (AMR-WB) speech codec, Transcoding functions.
  • 20
    • 67349174328 scopus 로고    scopus 로고
    • 3GPP TS 26.290, Extended adaptive multi-rate-wideband AMR-WB, codec, Transcoding functions
    • 3GPP TS 26.290, Extended adaptive multi-rate-wideband (AMR-WB+) codec, Transcoding functions.
  • 21
    • 0003812315 scopus 로고    scopus 로고
    • Information technology-coding of audiovisual objects, Part 3: Audio, Subparts: CELP
    • ISO/IEC 14496-3
    • ISO/IEC 14496-3, Information technology-coding of audiovisual objects, Part 3: audio, Subparts: CELP.
  • 23
    • 67349271332 scopus 로고    scopus 로고
    • 3GPP TS 26.796, Performance characterization of the adaptive multi-rate wideband (AMR-WB) speech codec
    • 3GPP TS 26.796, Performance characterization of the adaptive multi-rate wideband (AMR-WB) speech codec.
  • 24
    • 34547527271 scopus 로고    scopus 로고
    • Backwards compatible wideband telephony in mobile networks: CELP watermarking and bandwidth extension
    • Geiser B., and Vary P. Backwards compatible wideband telephony in mobile networks: CELP watermarking and bandwidth extension. ICASSP 4 (April 2007) 533-536
    • (2007) ICASSP , vol.4 , pp. 533-536
    • Geiser, B.1    Vary, P.2
  • 25
    • 34249986107 scopus 로고    scopus 로고
    • Predictive vector quantization of wideband LSF using narrowband LSF for bandwidth scalable coders
    • Ehara H., Morii T., and Yoshida K. Predictive vector quantization of wideband LSF using narrowband LSF for bandwidth scalable coders. Speech Commun. 49 (2007) 490-500
    • (2007) Speech Commun. , vol.49 , pp. 490-500
    • Ehara, H.1    Morii, T.2    Yoshida, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.