메뉴 건너뛰기




Volumn 8, Issue 2, 2014, Pages 119-130

Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise

Author keywords

[No Author keywords available]

Indexed keywords

CLASSIFICATION (OF INFORMATION); HEARING AIDS; SPEECH ENHANCEMENT;

EID: 84897500022     PISSN: 17519675     EISSN: 17519683     Source Type: Journal    
DOI: 10.1049/iet-spr.2011.0224     Document Type: Article
Times cited : (4)

References (28)
  • 1
    • 0032654850 scopus 로고    scopus 로고
    • Cepstrum-based pitch detection using a new statistical V/UV classification algorithm
    • Ahmadi, S., Spanias, A.: 'Cepstrum-based pitch detection using a new statistical V/UV classification algorithm', IEEE Trans. Speech Audio Process., 1999, 7, (3), pp. 333-338
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 333-338
    • Ahmadi, S.1    Spanias, A.2
  • 2
    • 0022790593 scopus 로고
    • Adaptive comb filtering for harmonic signal enhancement
    • Nehorai, A., Porat, B.: 'Adaptive comb filtering for harmonic signal enhancement', IEEE Trans. Acoust. Speech Signal Process., 1986, 34, (5), pp. 1124-1138
    • (1986) IEEE Trans. Acoust. Speech Signal Process. , vol.34 , Issue.5 , pp. 1124-1138
    • Nehorai, A.1    Porat, B.2
  • 3
    • 0031232722 scopus 로고    scopus 로고
    • Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model
    • George, E., Smith, M.: 'Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model', IEEE Trans. Speech Audio Process., 1997, 5, (5), pp. 389-406
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.5 , pp. 389-406
    • George, E.1    Smith, M.2
  • 4
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Boll, S.: 'Suppression of acoustic noise in speech using spectral subtraction', IEEE Trans. Acoust. Speech Signal Process., 1979, 27, (2), pp. 113-120
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 5
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
    • Ephraim, Y., Malah, D.: 'Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator', IEEE Trans. Acoust. Speech Signal Process., 1984, 32, (6), pp. 1109-1121
    • (1984) IEEE Trans. Acoust. Speech Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 6
    • 0029345417 scopus 로고
    • A signal subspace approach for speech enhancement
    • Ephraim, Y., Van Trees, H.: 'A signal subspace approach for speech enhancement', IEEE Trans. Speech Audio Process., 1995, 3, (4), pp. 251-266
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.4 , pp. 251-266
    • Ephraim, Y.1    Van Trees, H.2
  • 7
    • 0027576049 scopus 로고
    • Voiced-unvoiced-silence classifications of speech using hybrid features and a network classifier
    • Qi, Y., Hunt, B.: 'Voiced-unvoiced-silence classifications of speech using hybrid features and a network classifier', IEEE Trans. Speech Audio Process., 1993, 1, (2), pp. 250-255
    • (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.2 , pp. 250-255
    • Qi, Y.1    Hunt, B.2
  • 8
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds
    • Kawahara, H., Masuda-Katsuse, I., de Cheveigné, A.: 'Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds', Speech Commun., 1999, 27, (3-4), pp. 187-207
    • (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3
  • 9
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • Proc. Institute of Phonetic Sciences, Amsterdam, The Netherlands
    • Boersma, P.: 'Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound'. Proc. Institute of Phonetic Sciences, Amsterdam, The Netherlands, 1993, vol. 17, pp. 97-110
    • (1993) , vol.17 , pp. 97-110
    • Boersma, P.1
  • 10
    • 0036214787 scopus 로고    scopus 로고
    • Yin, a fundamental frequency estimator for speech and music
    • de Cheveigné, A., Kawahara, H.: 'Yin, a fundamental frequency estimator for speech and music', J. Acoust. Soc. Am., 2002, 111, (4), pp. 1917-1930
    • (2002) J. Acoust. Soc. Am. , vol.111 , Issue.4 , pp. 1917-1930
    • De Cheveigné, A.1    Kawahara, H.2
  • 11
    • 0023164594 scopus 로고
    • Optimization of voiced/unvoiced decisions in nonstationary noise environments
    • Kobatake, H.: 'Optimization of voiced/unvoiced decisions in nonstationary noise environments', IEEE Trans. Acoust. Speech Signal Process., 1987, 35, (1), pp. 9-18
    • (1987) IEEE Trans. Acoust. Speech Signal Process. , vol.35 , Issue.1 , pp. 9-18
    • Kobatake, H.1
  • 12
    • 4544321727 scopus 로고    scopus 로고
    • Speech processing, transmission and quality aspects (STQ); distributed speech recognition; extended front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm
    • ETSI ES 202 211 V1.1.1:
    • ETSI ES 202 211 V1.1.1: 'Speech processing, transmission and quality aspects (STQ); distributed speech recognition; extended front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm', 2003
    • (2003)
  • 13
    • 38649087695 scopus 로고    scopus 로고
    • A method for fundamental frequency estimation and voicing decision: application to infant utterances recorded in real acoustical environments
    • Nakatani, T., Amano, S., Irino, T., Ishizuka, K., Kondo, T.: 'A method for fundamental frequency estimation and voicing decision: application to infant utterances recorded in real acoustical environments', Speech Commun., 2008, 50, (3), pp. 203-214
    • (2008) Speech Commun. , vol.50 , Issue.3 , pp. 203-214
    • Nakatani, T.1    Amano, S.2    Irino, T.3    Ishizuka, K.4    Kondo, T.5
  • 15
    • 50249167077 scopus 로고    scopus 로고
    • Single and multiple F0 contour estimation through parametric spectrogram modeling of speech in noisy environments
    • Le Roux, J., Kameoka, H., Ono, N., de Cheveigné, A., Sagayama, S.: 'Single and multiple F0 contour estimation through parametric spectrogram modeling of speech in noisy environments', IEEE Trans. Audio, Speech, Lang. Process., 2007, 15, (4), pp. 1135-1145
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1135-1145
    • Le Roux, J.1    Kameoka, H.2    Ono, N.3    De Cheveigné, A.4    Sagayama, S.5
  • 16
    • 79959536022 scopus 로고    scopus 로고
    • Low-complexity F0-based speech/nonspeech discrimination approach for digital hearing aids
    • Cabañas-Molero, P., Ruiz-Reyes, N., Vera-Candeas, P., Maldonado-Bascon, S.: 'Low-complexity F0-based speech/nonspeech discrimination approach for digital hearing aids', Multimedia Tools Appl., 2011, 54, (2), pp. 291-319
    • (2011) Multimedia Tools Appl. , vol.54 , Issue.2 , pp. 291-319
    • Cabañas-Molero, P.1    Ruiz-Reyes, N.2    Vera-Candeas, P.3    Maldonado-Bascon, S.4
  • 17
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Martin, R.: 'Noise power spectral density estimation based on optimal smoothing and minimum statistics', IEEE Trans. Speech Audio Process., 2001, 9, (5), pp. 504-512
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 18
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L.: 'A tutorial on hidden Markov models and selected applications in speech recognition', Proc. IEEE, 1989, 77, (2), pp. 257-286
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 20
    • 84867592389 scopus 로고    scopus 로고
    • Exploiting the harmonic structure for speech enhancement
    • Speech, and Signal Processing, Kyoto, Japan, March
    • Cho, E., Smith, J.O., Widrow, B.: 'Exploiting the harmonic structure for speech enhancement'. Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Kyoto, Japan, March 2012, pp. 4569-4572
    • (2012) Proc. IEEE Int. Conf. Acoustics , pp. 4569-4572
    • Cho, E.1    Smith, J.O.2    Widrow, B.3
  • 21
    • 0036293748 scopus 로고    scopus 로고
    • A multi-band spectral subtraction method for enhancing speech corrupted by colored noise
    • Speech, and Signal Processing, Orlando, FL, USA, May
    • Kamath, S., Loizou, P.: 'A multi-band spectral subtraction method for enhancing speech corrupted by colored noise'. Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Orlando, FL, USA, May 2002, pp. 4160-4164
    • (2002) Proc. IEEE Int. Conf. Acoustics , pp. 4160-4164
    • Kamath, S.1    Loizou, P.2
  • 22
    • 85009074922 scopus 로고    scopus 로고
    • Harmonic tunneling: tracking non-stationary noises during speech
    • Speech Communication and Technology, Aalborg, Denmark, September
    • Ealey, D., Kelleher, H., Pearce, D.: 'Harmonic tunneling: tracking non-stationary noises during speech'. Proc. Seventh European Conf. Speech Communication and Technology, Aalborg, Denmark, September 2001, pp. 437-440
    • (2001) Proc. Seventh European Conf. , pp. 437-440
    • Ealey, D.1    Kelleher, H.2    Pearce, D.3
  • 24
    • 59849095077 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs
    • ITU-T Recommendation:
    • ITU-T Recommendation P.862: 'Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs', 2000
    • (2000) , pp. 862
  • 25
    • 44149106061 scopus 로고    scopus 로고
    • Evaluation of objective quality measures for speech enhancement
    • Hu, Y., Loizou, P.: 'Evaluation of objective quality measures for speech enhancement', IEEE Trans. Audio, Speech, Lang. Process., 2008, 16, (1), pp. 229-238
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.1 , pp. 229-238
    • Hu, Y.1    Loizou, P.2
  • 26
    • 79955609335 scopus 로고    scopus 로고
    • Speech quality assessment
    • Weisi, L., et al. (Eds.): 'Multimedia analysis, processing and communications' (Springer Verlag)
    • Loizou, P.: 'Speech quality assessment', in Weisi, L., et al. (Eds.): 'Multimedia analysis, processing and communications' (Springer Verlag, 2011), pp. 623-654
    • (2011) , pp. 623-654
    • Loizou, P.1
  • 27
    • 44149115462 scopus 로고    scopus 로고
    • A geometric approach to spectral subtraction
    • Lu, Y., Loizou, P.: 'A geometric approach to spectral subtraction', Speech Commun., 2008, 50, (6), pp. 453-466
    • (2008) Speech Commun. , vol.50 , Issue.6 , pp. 453-466
    • Lu, Y.1    Loizou, P.2
  • 28
    • 0442317754 scopus 로고    scopus 로고
    • Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms
    • ETSI ES 202 050 V1.1.5:
    • ETSI ES 202 050 V1.1.5: 'Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms', 2007
    • (2007)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.