SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 16, Issue 1, 2008, Pages 57-64

Generalized postfilter for speech quality enhancement

(4) Grancharov, Volodya a,b Plasberg, Jan H a Samuelsson, Jonas a Kleijn, W Bastiaan a,c

a ROYAL INSTITUTE OF TECHNOLOGY (Sweden)

b ERICSSON RESEARCH (Sweden)

c Coding Technologies AB ^* (Sweden)

Author keywords

Additive noise; Distortion measure; Multiplicative noise; Noise reduction; Perceptually optimal processing; Postfilter; Speech coding; Speech enhancement; Tandeming

Indexed keywords

DISTORTION MEASURE; MULTIPLICATIVE NOISE; NOISE REDUCTION; PERCEPTUALLY OPTIMAL PROCESSING; POSTFILTER; TANDEMING;

ACOUSTIC NOISE MEASUREMENT; ADDITIVE NOISE; COMPUTATIONAL COMPLEXITY; ENCODING (SYMBOLS); PHASE NOISE; SPEECH CODING; SPEECH ENHANCEMENT;

OPTIMIZATION;

EID: 64849092071 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.909327 Document Type: Article

Times cited : (23)

References (51)

1
- 0022219187
- Code-excited linear prediction (CELP): High-quality speech at very low bit rates
- M. Schroeder and B. Atal, "Code-excited linear prediction (CELP): High-quality speech at very low bit rates," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1985, vol. 10, pp. 937-940.
- (1985) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.10 , pp. 937-940
- Schroeder, M.¹ Atal, B.²

2
- 84961820981
- Reverse water-filling in predictive encoding of speech
- S. V. Andersen and W. B. Kleijn, "Reverse water-filling in predictive encoding of speech," in Proc. IEEE Workshop Speech Coding, 1999, vol. 3, pp. 105-107.
- (1999) Proc. IEEE Workshop Speech Coding , vol.3 , pp. 105-107
- Andersen, S.V.¹ Kleijn, W.B.²

3
- 0023963759
- Enhancement of ADPCM speech coding with backward-adaptive algorithms for post-filtering and noise feedback
- Feb
- V. Ramamoorthy, N. Jayant, R. Cox, and M. Sondhi, "Enhancement of ADPCM speech coding with backward-adaptive algorithms for post-filtering and noise feedback," IEEE J. Select. Areas Commun., vol. 6. no. 2, pp. 364-382, Feb. 1988.
- (1988) IEEE J. Select. Areas Commun , vol.6 , Issue.2 , pp. 364-382
- Ramamoorthy, V.¹ Jayant, N.² Cox, R.³ Sondhi, M.⁴

4
- 0029219433
- Adaptive postfiltering for quality enhancement of coded speech
- Jan
- J.-H. Chen and A. Gersho, "Adaptive postfiltering for quality enhancement of coded speech," IEEE Trans. Speech Audio Process., vol. 3, no. l,pp. 59-71, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.L , pp. 59-71
- Chen, J.-H.¹ Gersho, A.²

5
- 0023833737
- Improved speech quality and efficient vector quantization in SELP
- W. B. Kleijn, D. Krasinski, and R. Ketchum, "Improved speech quality and efficient vector quantization in SELP." in Pwc. IEEE Int. Conf. Acoust, Speech, Signal Process., 1988, pp. 155-158.
- (1988) Pwc. IEEE Int. Conf. Acoust, Speech, Signal Process , pp. 155-158
- Kleijn, W.B.¹ Krasinski, D.² Ketchum, R.³

6
- 11744253565
- Sine-Wave Amplitude Coding at Low Data Rates
- B. Atal, V. Cuperman, and A. Gersho, Eds. New York: Kfuwer
- R. McAulay, T. Parks, T. Quatieri, and M. Sabin, Sine-Wave Amplitude Coding at Low Data Rates, in Advances in Speech Coding. B. Atal, V. Cuperman, and A. Gersho, Eds. New York: Kfuwer, 1991.
- (1991) Advances in Speech Coding
- McAulay, R.¹ Parks, T.² Quatieri, T.³ Sabin, M.⁴

7
- 0026373183
- Adaptive postfiltering for enhancement of noisy speech in the frequency domain
- P. Kabal, F. Wang, D. O'Shaughnessy, and R. Ramachandran, "Adaptive postfiltering for enhancement of noisy speech in the frequency domain," in Proc. IEEE Int. Symp. Circuits Syst., 1991, pp. 312-315.
- (1991) Proc. IEEE Int. Symp. Circuits Syst , pp. 312-315
- Kabal, P.¹ Wang, F.² O'Shaughnessy, D.³ Ramachandran, R.⁴

8
- 33745207538
- Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models
- W.-Y. Chen, P. Kabal, and T. Shabestary, "Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models," in Proc. Interspeech, 2005, pp. 3161-3164.
- (2005) Proc. Interspeech , pp. 3161-3164
- Chen, W.-Y.¹ Kabal, P.² Shabestary, T.³

9
- 0032639678
- An adaptive post-filtering technique based on the modified Yule-Walker filter
- A. Mustapha and S. Yeldener, "An adaptive post-filtering technique based on the modified Yule-Walker filter," in Pwc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1999, pp. 197-200.
- (1999) Pwc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 197-200
- Mustapha, A.¹ Yeldener, S.²

10
- 0141494379
- Very low bit rate speech coding in tandem connections
- R. C. de Lamare and A. Alcaim, "Very low bit rate speech coding in tandem connections," Election. Lett., vol. 39, pp. 1356-1357, 2003.
- (2003) Election. Lett , vol.39 , pp. 1356-1357
- de Lamare, R.C.¹ Alcaim, A.²

11
- 17244378082
- Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec
- R. C. de Lamare and A. Alcaim, "Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec." Proc. Inst. Electron. Eng., vol. 152, pp. 74-86. 2005.
- (2005) Proc. Inst. Electron. Eng , vol.152 , pp. 74-86
- de Lamare, R.C.¹ Alcaim, A.²

12
- 84961779105
- Enhancement of coded speech by constrained optimization
- W. B. Kleijn, "Enhancement of coded speech by constrained optimization," in Proc. IEEE Workshop Speech Coding, 2002. pp. 163-165.
- (2002) Proc. IEEE Workshop Speech Coding , pp. 163-165
- Kleijn, W.B.¹

13
- 64849086549
- Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s, ITU-T. Rec. G.723.1, 1996.
- "Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s," ITU-T. Rec. G.723.1, 1996.

14
- 0003579084
- Englewood Cliffs, NJ: Prentice-Hall
- S. Jayant and P. Noll, Digital Coding of Waveforms. Englewood Cliffs', NJ: Prentice-Hall, 1984.
- (1984) Digital Coding of Waveforms
- Jayant, S.¹ Noll, P.²

15
- 4544259344
- Noise-depen-dent postfiltering
- V. Grancharov, J. Samuelsson, and W. B. Kleijn, "Noise-depen-dent postfiltering," in Proc. IEEE Int. Conf. Acoust, Speech, Signal Process., 2004, vol. 1, pp. 457-160.
- (2004) Proc. IEEE Int. Conf. Acoust, Speech, Signal Process , vol.1 , pp. 457-160
- Grancharov, V.¹ Samuelsson, J.² Kleijn, W.B.³

16
- 64849095185
- J. Lim, Ed, Englewood Cliffs, NJ: Prentice Hall
- J. Lim, Ed., Speech Enliancement. Englewood Cliffs, NJ: Prentice Hall, 1983.
- (1983) Speech Enliancement

17
- 0019009880
- Speech enhancement using a soft-decision noise suppression filter
- Apr
- R. McAulay and M. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 2, pp. 137-145, Apr. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.2 , pp. 137-145
- McAulay, R.¹ Malpass, M.²

18
- 0025587084
- A minimum mean square error approach for speech enhancement
- Y. Ephraim, "A minimum mean square error approach for speech enhancement," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1990, vol. 2, pp. 829-832.
- (1990) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 829-832
- Ephraim, Y.¹

19
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Apr
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

20
- 0014814553
- Transmission of noisy information to a noisy receiver with minimum distortion
- Jul
- J. Wolf and J. Ziv, "Transmission of noisy information to a noisy receiver with minimum distortion," IEEE Trans. Inf. Theory, vol. IT-16, no. 4, pp. 406-411, Jul. 1970.
- (1970) IEEE Trans. Inf. Theory , vol.IT-16 , Issue.4 , pp. 406-411
- Wolf, J.¹ Ziv, J.²

21
- 0024035451
- A unified approach for encoding clean and noisy sources by means of waveform and autoregressive model vector quantization
- Jul
- Y. Ephraim and R. Gray, "A unified approach for encoding clean and noisy sources by means of waveform and autoregressive model vector quantization," IEEE Trans. Inf. Theory, vol. 34, no. 4, pp. 826-834, Jul. 1988.
- (1988) IEEE Trans. Inf. Theory , vol.34 , Issue.4 , pp. 826-834
- Ephraim, Y.¹ Gray, R.²

22
- 84960882760
- Study of the influence on noise pre-processing on the performance of low bit rate parameteic speech coder
- G. Guilmin, R. Bouquin-Jeannes, and P. Gournay, "Study of the influence on noise pre-processing on the performance of low bit rate parameteic speech coder," in Proc. Eurospeech, 1999, vol. 3, pp. 2367-2370.
- (1999) Proc. Eurospeech , vol.3 , pp. 2367-2370
- Guilmin, G.¹ Bouquin-Jeannes, R.² Gournay, P.³

23
- 0034464407
- Compressed domain noise reduction and echo suppression for network speech enhancement
- R. Chandran and D. Marchok, "Compressed domain noise reduction and echo suppression for network speech enhancement," in Proc. 43rd Midwest Symp. Circuits Syst., 2000, vol. 1, pp. 10-13.
- (2000) Proc. 43rd Midwest Symp. Circuits Syst , vol.1 , pp. 10-13
- Chandran, R.¹ Marchok, D.²

24
- 4544369711
- Noise reduction on speech codec parameters
- H. Taddei, C. Beaugeant, and M. de Meuleneire, "Noise reduction on speech codec parameters," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Pmcess., 2004, vol. 1, pp. 497-500.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Pmcess , vol.1 , pp. 497-500
- Taddei, H.¹ Beaugeant, C.² de Meuleneire, M.³

25
- 0025630302
- Adaptive postfiltering applied to speech in noise
- R. Conway, T. Sreenivas, and R. Niederjohn, "Adaptive postfiltering applied to speech in noise," in Proc. 32nd Midwest 1989 Symp. Circuits Syst., 1989, pp. 101-104.
- (1989) Proc. 32nd Midwest 1989 Symp. Circuits Syst , pp. 101-104
- Conway, R.¹ Sreenivas, T.² Niederjohn, R.³

26
- 0032075135
- Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer function
- May
- M. Zilovic, R. Ramachandran, and R. Mammone, "Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer function," IEEE Trans. Speech Audio Process., vol. 6, no. 3, pp. 260-267, May 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.3 , pp. 260-267
- Zilovic, M.¹ Ramachandran, R.² Mammone, R.³

27
- 0003637864
- W. B. Kleijn and K. K. Paliwal, Eds, Amsterdam, The Netherlands: Elsevier
- W. B. Kleijn and K. K. Paliwal, Eds., Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995.
- (1995) Speech Coding and Synthesis

28
- 64849094482
- Coding of speech at 8 kbit/s using conjugate-structure algebraic-code- excited linear prediction (CS-ACELP), ITU-T. Rec. G.729, 1996.
- "Coding of speech at 8 kbit/s using conjugate-structure algebraic-code- excited linear prediction (CS-ACELP)," ITU-T. Rec. G.729, 1996.

29
- 64849111468
- AMR Speech Codec; transcoding functions, 3GPP TS 26.090, 2004.
- "AMR Speech Codec; transcoding functions," 3GPP TS 26.090, 2004.

30
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Jul
- R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.5 , pp. 504-512
- Martin, R.¹

31
- 0033693215
- Quantile based noise estimation for spectral subtraction and Wiener filtering
- V. Stahl, A. Fischer, and R. Bippus, "Quantile based noise estimation for spectral subtraction and Wiener filtering," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2000, vol. 3, pp. 1875-1878.
- (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.3 , pp. 1875-1878
- Stahl, V.¹ Fischer, A.² Bippus, R.³

32
- 33745197746
- Distortion measures for vector quantization of noisy spectrum
- V. Grancharov, J. Samuelsson, and W. B. Kleijn, "Distortion measures for vector quantization of noisy spectrum," in Proc. Interspeech, 2005, pp. 3173-3176.
- (2005) Proc. Interspeech , pp. 3173-3176
- Grancharov, V.¹ Samuelsson, J.² Kleijn, W.B.³

33
- 0028997012
- Spectral dynamics is more important than spectral distortion
- H. Knagenhjelm and W. B. Kleijn, "Spectral dynamics is more important than spectral distortion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1995, vol. 1, pp. 732-735.
- (1995) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 732-735
- Knagenhjelm, H.¹ Kleijn, W.B.²

34
- 85008572471
- Variable rate wideband speech coding using perceptually motivated thresholds
- J. Paulus, "Variable rate wideband speech coding using perceptually motivated thresholds." in Proc. IEEE Workshop Speech Coding, 1995, pp. 35-36.
- (1995) Proc. IEEE Workshop Speech Coding , pp. 35-36
- Paulus, J.¹

35
- 0003425258
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall. 1978.
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

36
- 0020148958
- Synthesis by spectral amplitude and brightness matching of analyzed musical instrument tones
- J. Beauchamp, "Synthesis by spectral amplitude and brightness matching of analyzed musical instrument tones," J. Audio Eng. Soc, vol. 30, pp. 396406, 1982.
- (1982) J. Audio Eng. Soc , vol.30 , pp. 396406
- Beauchamp, J.¹

37
- 2442472100
- Time evolution in LPC spectrum coding
- May
- F. Norden and T. Eriksson, "Time evolution in LPC spectrum coding," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 290-301, May 2004.
- (2004) IEEE Trans. Speech Audio Process , vol.12 , Issue.3 , pp. 290-301
- Norden, F.¹ Eriksson, T.²

38
- 0036295827
- Speech enhancement based on auditory spectral change
- T. Quatieri and R. Dunn, "Speech enhancement based on auditory spectral change," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2002, vol. 1, pp. 257-260.
- (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 257-260
- Quatieri, T.¹ Dunn, R.²

39
- 84979940385
- The sensitivity matrix fora spectro-temporal auditory model
- J. Plasberg, D. Y. Zhao, and W. B. Kleijn, "The sensitivity matrix fora spectro-temporal auditory model," in Proc. XII Eur. Signal Process. Conf, 2004, pp. 1673-1676.
- (2004) Proc. XII Eur. Signal Process. Conf , pp. 1673-1676
- Plasberg, J.¹ Zhao, D.Y.² Kleijn, W.B.³

40
- 0029952425
- A quantitative model of the effective signal processing in the auditory system. I. Model sttucture
- T. Dau, D. Püschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I. Model sttucture," J Acoust. Soc. Amer., vol. 99, pp. 3615-3622, 1996.
- (1996) J Acoust. Soc. Amer , vol.99 , pp. 3615-3622
- Dau, T.¹ Püschel, D.² Kohlrausch, A.³

41
- 47649083103
- The sensitivity matrix: Using advanced auditory models in speech and audio processing
- Jan
- J. Plasberg and W. B. Kleijn, "The sensitivity matrix: Using advanced auditory models in speech and audio processing," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 310-319, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.1 , pp. 310-319
- Plasberg, J.¹ Kleijn, W.B.²

42
- 64849099262
- DARPA-TIMIT, Acoustic-Phonetic Continuous Speech Corpus, NIST Speech Disc 1-1.1, 1990.
- DARPA-TIMIT, "Acoustic-Phonetic Continuous Speech Corpus, NIST Speech Disc 1-1.1," 1990.

43
- 84868933707
- Available
- [Online]. Available: http://www.elda.org/catalogue/en/speech/S0156.html

44
- 64849106906
- A. Varga, H. Steeneken, M. Tomlinson, and D. Jones, The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition, 1992.
- A. Varga, H. Steeneken, M. Tomlinson, and D. Jones, "The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition, 1992.

45
- 64849094964
- ITU-T Coded-Speech Database, ITU-T Rec. P. Supplement 23, 1998.
- "ITU-T Coded-Speech Database," ITU-T Rec. P. Supplement 23, 1998.

46
- 0000169232
- An algorithm for least-squares estimation of nonlinear parameters
- D. Marquardt, "An algorithm for least-squares estimation of nonlinear parameters," SIAM J. Appl. Math., vol. 11, pp. 431-441, 1963.
- (1963) SIAM J. Appl. Math , vol.11 , pp. 431-441
- Marquardt, D.¹

47
- 0004349049
- Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems
- TIA/EIA/IS-127
- "Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems," TIA/EIA/IS-127, 1997.
- (1997)

48
- 33947667509
- Subjective comparison of speech enhancement algorithms
- Y. Hu and P. Loizou, "Subjective comparison of speech enhancement algorithms," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2006, vol. 1, pp. 153-156.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 153-156
- Hu, Y.¹ Loizou, P.²

49
- 64849108039
- Perceptual evaluation of speech quality (PESQ) ITU-T Rec. P. 862, 2001.
- Perceptual evaluation of speech quality (PESQ) ITU-T Rec. P. 862, 2001.

50
- 64849105323
- Methods for subjective determination of transmission quality, ITU-T Rec. P.800, 1996.
- "Methods for subjective determination of transmission quality," ITU-T Rec. P.800, 1996.

51
- 13344250603
- Method for the subjective assessment of intermediate quality level of coding systems
- ITU-R Rec. BS
- "Method for the subjective assessment of intermediate quality level of coding systems," ITU-R Rec. BS. 1534-1, 2005.
- (2005) , pp. 1534-1541

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.