메뉴 건너뛰기




Volumn 16, Issue 1, 2008, Pages 57-64

Generalized postfilter for speech quality enhancement

Author keywords

Additive noise; Distortion measure; Multiplicative noise; Noise reduction; Perceptually optimal processing; Postfilter; Speech coding; Speech enhancement; Tandeming

Indexed keywords

DISTORTION MEASURE; MULTIPLICATIVE NOISE; NOISE REDUCTION; PERCEPTUALLY OPTIMAL PROCESSING; POSTFILTER; TANDEMING;

EID: 64849092071     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.909327     Document Type: Article
Times cited : (23)

References (51)
  • 1
    • 0022219187 scopus 로고
    • Code-excited linear prediction (CELP): High-quality speech at very low bit rates
    • M. Schroeder and B. Atal, "Code-excited linear prediction (CELP): High-quality speech at very low bit rates," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1985, vol. 10, pp. 937-940.
    • (1985) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.10 , pp. 937-940
    • Schroeder, M.1    Atal, B.2
  • 2
    • 84961820981 scopus 로고    scopus 로고
    • Reverse water-filling in predictive encoding of speech
    • S. V. Andersen and W. B. Kleijn, "Reverse water-filling in predictive encoding of speech," in Proc. IEEE Workshop Speech Coding, 1999, vol. 3, pp. 105-107.
    • (1999) Proc. IEEE Workshop Speech Coding , vol.3 , pp. 105-107
    • Andersen, S.V.1    Kleijn, W.B.2
  • 3
    • 0023963759 scopus 로고
    • Enhancement of ADPCM speech coding with backward-adaptive algorithms for post-filtering and noise feedback
    • Feb
    • V. Ramamoorthy, N. Jayant, R. Cox, and M. Sondhi, "Enhancement of ADPCM speech coding with backward-adaptive algorithms for post-filtering and noise feedback," IEEE J. Select. Areas Commun., vol. 6. no. 2, pp. 364-382, Feb. 1988.
    • (1988) IEEE J. Select. Areas Commun , vol.6 , Issue.2 , pp. 364-382
    • Ramamoorthy, V.1    Jayant, N.2    Cox, R.3    Sondhi, M.4
  • 4
    • 0029219433 scopus 로고
    • Adaptive postfiltering for quality enhancement of coded speech
    • Jan
    • J.-H. Chen and A. Gersho, "Adaptive postfiltering for quality enhancement of coded speech," IEEE Trans. Speech Audio Process., vol. 3, no. l,pp. 59-71, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.L , pp. 59-71
    • Chen, J.-H.1    Gersho, A.2
  • 6
    • 11744253565 scopus 로고
    • Sine-Wave Amplitude Coding at Low Data Rates
    • B. Atal, V. Cuperman, and A. Gersho, Eds. New York: Kfuwer
    • R. McAulay, T. Parks, T. Quatieri, and M. Sabin, Sine-Wave Amplitude Coding at Low Data Rates, in Advances in Speech Coding. B. Atal, V. Cuperman, and A. Gersho, Eds. New York: Kfuwer, 1991.
    • (1991) Advances in Speech Coding
    • McAulay, R.1    Parks, T.2    Quatieri, T.3    Sabin, M.4
  • 8
    • 33745207538 scopus 로고    scopus 로고
    • Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models
    • W.-Y. Chen, P. Kabal, and T. Shabestary, "Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models," in Proc. Interspeech, 2005, pp. 3161-3164.
    • (2005) Proc. Interspeech , pp. 3161-3164
    • Chen, W.-Y.1    Kabal, P.2    Shabestary, T.3
  • 10
    • 0141494379 scopus 로고    scopus 로고
    • Very low bit rate speech coding in tandem connections
    • R. C. de Lamare and A. Alcaim, "Very low bit rate speech coding in tandem connections," Election. Lett., vol. 39, pp. 1356-1357, 2003.
    • (2003) Election. Lett , vol.39 , pp. 1356-1357
    • de Lamare, R.C.1    Alcaim, A.2
  • 11
    • 17244378082 scopus 로고    scopus 로고
    • Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec
    • R. C. de Lamare and A. Alcaim, "Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec." Proc. Inst. Electron. Eng., vol. 152, pp. 74-86. 2005.
    • (2005) Proc. Inst. Electron. Eng , vol.152 , pp. 74-86
    • de Lamare, R.C.1    Alcaim, A.2
  • 12
    • 84961779105 scopus 로고    scopus 로고
    • Enhancement of coded speech by constrained optimization
    • W. B. Kleijn, "Enhancement of coded speech by constrained optimization," in Proc. IEEE Workshop Speech Coding, 2002. pp. 163-165.
    • (2002) Proc. IEEE Workshop Speech Coding , pp. 163-165
    • Kleijn, W.B.1
  • 13
    • 64849086549 scopus 로고    scopus 로고
    • Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s, ITU-T. Rec. G.723.1, 1996.
    • "Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s," ITU-T. Rec. G.723.1, 1996.
  • 16
    • 64849095185 scopus 로고
    • J. Lim, Ed, Englewood Cliffs, NJ: Prentice Hall
    • J. Lim, Ed., Speech Enliancement. Englewood Cliffs, NJ: Prentice Hall, 1983.
    • (1983) Speech Enliancement
  • 17
    • 0019009880 scopus 로고
    • Speech enhancement using a soft-decision noise suppression filter
    • Apr
    • R. McAulay and M. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 2, pp. 137-145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.2 , pp. 137-145
    • McAulay, R.1    Malpass, M.2
  • 18
  • 19
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Apr
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 20
    • 0014814553 scopus 로고
    • Transmission of noisy information to a noisy receiver with minimum distortion
    • Jul
    • J. Wolf and J. Ziv, "Transmission of noisy information to a noisy receiver with minimum distortion," IEEE Trans. Inf. Theory, vol. IT-16, no. 4, pp. 406-411, Jul. 1970.
    • (1970) IEEE Trans. Inf. Theory , vol.IT-16 , Issue.4 , pp. 406-411
    • Wolf, J.1    Ziv, J.2
  • 21
    • 0024035451 scopus 로고
    • A unified approach for encoding clean and noisy sources by means of waveform and autoregressive model vector quantization
    • Jul
    • Y. Ephraim and R. Gray, "A unified approach for encoding clean and noisy sources by means of waveform and autoregressive model vector quantization," IEEE Trans. Inf. Theory, vol. 34, no. 4, pp. 826-834, Jul. 1988.
    • (1988) IEEE Trans. Inf. Theory , vol.34 , Issue.4 , pp. 826-834
    • Ephraim, Y.1    Gray, R.2
  • 22
    • 84960882760 scopus 로고    scopus 로고
    • Study of the influence on noise pre-processing on the performance of low bit rate parameteic speech coder
    • G. Guilmin, R. Bouquin-Jeannes, and P. Gournay, "Study of the influence on noise pre-processing on the performance of low bit rate parameteic speech coder," in Proc. Eurospeech, 1999, vol. 3, pp. 2367-2370.
    • (1999) Proc. Eurospeech , vol.3 , pp. 2367-2370
    • Guilmin, G.1    Bouquin-Jeannes, R.2    Gournay, P.3
  • 23
    • 0034464407 scopus 로고    scopus 로고
    • Compressed domain noise reduction and echo suppression for network speech enhancement
    • R. Chandran and D. Marchok, "Compressed domain noise reduction and echo suppression for network speech enhancement," in Proc. 43rd Midwest Symp. Circuits Syst., 2000, vol. 1, pp. 10-13.
    • (2000) Proc. 43rd Midwest Symp. Circuits Syst , vol.1 , pp. 10-13
    • Chandran, R.1    Marchok, D.2
  • 26
    • 0032075135 scopus 로고    scopus 로고
    • Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer function
    • May
    • M. Zilovic, R. Ramachandran, and R. Mammone, "Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer function," IEEE Trans. Speech Audio Process., vol. 6, no. 3, pp. 260-267, May 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.3 , pp. 260-267
    • Zilovic, M.1    Ramachandran, R.2    Mammone, R.3
  • 27
    • 0003637864 scopus 로고
    • W. B. Kleijn and K. K. Paliwal, Eds, Amsterdam, The Netherlands: Elsevier
    • W. B. Kleijn and K. K. Paliwal, Eds., Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995.
    • (1995) Speech Coding and Synthesis
  • 28
    • 64849094482 scopus 로고    scopus 로고
    • Coding of speech at 8 kbit/s using conjugate-structure algebraic-code- excited linear prediction (CS-ACELP), ITU-T. Rec. G.729, 1996.
    • "Coding of speech at 8 kbit/s using conjugate-structure algebraic-code- excited linear prediction (CS-ACELP)," ITU-T. Rec. G.729, 1996.
  • 29
    • 64849111468 scopus 로고    scopus 로고
    • AMR Speech Codec; transcoding functions, 3GPP TS 26.090, 2004.
    • "AMR Speech Codec; transcoding functions," 3GPP TS 26.090, 2004.
  • 30
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 32
    • 33745197746 scopus 로고    scopus 로고
    • Distortion measures for vector quantization of noisy spectrum
    • V. Grancharov, J. Samuelsson, and W. B. Kleijn, "Distortion measures for vector quantization of noisy spectrum," in Proc. Interspeech, 2005, pp. 3173-3176.
    • (2005) Proc. Interspeech , pp. 3173-3176
    • Grancharov, V.1    Samuelsson, J.2    Kleijn, W.B.3
  • 34
    • 85008572471 scopus 로고
    • Variable rate wideband speech coding using perceptually motivated thresholds
    • J. Paulus, "Variable rate wideband speech coding using perceptually motivated thresholds." in Proc. IEEE Workshop Speech Coding, 1995, pp. 35-36.
    • (1995) Proc. IEEE Workshop Speech Coding , pp. 35-36
    • Paulus, J.1
  • 36
    • 0020148958 scopus 로고
    • Synthesis by spectral amplitude and brightness matching of analyzed musical instrument tones
    • J. Beauchamp, "Synthesis by spectral amplitude and brightness matching of analyzed musical instrument tones," J. Audio Eng. Soc, vol. 30, pp. 396406, 1982.
    • (1982) J. Audio Eng. Soc , vol.30 , pp. 396406
    • Beauchamp, J.1
  • 37
    • 2442472100 scopus 로고    scopus 로고
    • Time evolution in LPC spectrum coding
    • May
    • F. Norden and T. Eriksson, "Time evolution in LPC spectrum coding," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 290-301, May 2004.
    • (2004) IEEE Trans. Speech Audio Process , vol.12 , Issue.3 , pp. 290-301
    • Norden, F.1    Eriksson, T.2
  • 40
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the effective signal processing in the auditory system. I. Model sttucture
    • T. Dau, D. Püschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I. Model sttucture," J Acoust. Soc. Amer., vol. 99, pp. 3615-3622, 1996.
    • (1996) J Acoust. Soc. Amer , vol.99 , pp. 3615-3622
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 41
    • 47649083103 scopus 로고    scopus 로고
    • The sensitivity matrix: Using advanced auditory models in speech and audio processing
    • Jan
    • J. Plasberg and W. B. Kleijn, "The sensitivity matrix: Using advanced auditory models in speech and audio processing," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 310-319, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.1 , pp. 310-319
    • Plasberg, J.1    Kleijn, W.B.2
  • 42
    • 64849099262 scopus 로고    scopus 로고
    • DARPA-TIMIT, Acoustic-Phonetic Continuous Speech Corpus, NIST Speech Disc 1-1.1, 1990.
    • DARPA-TIMIT, "Acoustic-Phonetic Continuous Speech Corpus, NIST Speech Disc 1-1.1," 1990.
  • 43
    • 84868933707 scopus 로고    scopus 로고
    • Available
    • [Online]. Available: http://www.elda.org/catalogue/en/speech/S0156.html
  • 44
    • 64849106906 scopus 로고    scopus 로고
    • A. Varga, H. Steeneken, M. Tomlinson, and D. Jones, The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition, 1992.
    • A. Varga, H. Steeneken, M. Tomlinson, and D. Jones, "The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition, 1992.
  • 45
    • 64849094964 scopus 로고    scopus 로고
    • ITU-T Coded-Speech Database, ITU-T Rec. P. Supplement 23, 1998.
    • "ITU-T Coded-Speech Database," ITU-T Rec. P. Supplement 23, 1998.
  • 46
    • 0000169232 scopus 로고
    • An algorithm for least-squares estimation of nonlinear parameters
    • D. Marquardt, "An algorithm for least-squares estimation of nonlinear parameters," SIAM J. Appl. Math., vol. 11, pp. 431-441, 1963.
    • (1963) SIAM J. Appl. Math , vol.11 , pp. 431-441
    • Marquardt, D.1
  • 47
    • 0004349049 scopus 로고    scopus 로고
    • Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems
    • TIA/EIA/IS-127
    • "Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems," TIA/EIA/IS-127, 1997.
    • (1997)
  • 49
    • 64849108039 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ) ITU-T Rec. P. 862, 2001.
    • Perceptual evaluation of speech quality (PESQ) ITU-T Rec. P. 862, 2001.
  • 50
    • 64849105323 scopus 로고    scopus 로고
    • Methods for subjective determination of transmission quality, ITU-T Rec. P.800, 1996.
    • "Methods for subjective determination of transmission quality," ITU-T Rec. P.800, 1996.
  • 51
    • 13344250603 scopus 로고    scopus 로고
    • Method for the subjective assessment of intermediate quality level of coding systems
    • ITU-R Rec. BS
    • "Method for the subjective assessment of intermediate quality level of coding systems," ITU-R Rec. BS. 1534-1, 2005.
    • (2005) , pp. 1534-1541


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.