메뉴 건너뛰기




Volumn 42, Issue 3-4, 2004, Pages 271-287

Efficient voice activity detection algorithms using long-term speech information

Author keywords

Long term spectral divergence; Long term spectral envelope; Speech enhancement; Speech recognition; Speech non speech detection

Indexed keywords

ALGORITHMS; HEARING AIDS; NOISE ABATEMENT; SIGNAL TO NOISE RATIO; SPEECH PROCESSING; SPEECH RECOGNITION; SPURIOUS SIGNAL NOISE; VOICE ACTIVATED INPUT DEVICES;

EID: 1842476689     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2003.10.002     Document Type: Article
Times cited : (387)

References (31)
  • 1
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    • Benyassine A., Shlomot E., Su H., Massaloux D., Lamblin C., Petit J. ITU-T recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications. IEEE Comm. Magazine. 35(9):1997;64-73.
    • (1997) IEEE Comm. Magazine , vol.35 , Issue.9 , pp. 64-73
    • Benyassine, A.1    Shlomot, E.2    Su, H.3    Massaloux, D.4    Lamblin, C.5    Petit, J.6
  • 3
    • 0032308777 scopus 로고    scopus 로고
    • A robust voice activity detector for wireless communications using soft computing
    • Beritelli F., Casale S., Cavallaro A. A robust voice activity detector for wireless communications using soft computing. IEEE J. Select. Areas Comm. 16(9):1998;1818-1829.
    • (1998) IEEE J. Select. Areas Comm. , vol.16 , Issue.9 , pp. 1818-1829
    • Beritelli, F.1    Casale, S.2    Cavallaro, A.3
  • 4
    • 0036494209 scopus 로고    scopus 로고
    • Performance evaluation and comparison of G.729/AMR/fuzzy voice activity detectors
    • Beritelli F., Casale S., Rugeri G., Serrano S. Performance evaluation and comparison of G.729/AMR/fuzzy voice activity detectors. IEEE Signal Process. Lett. 9(3):2002;85-88.
    • (2002) IEEE Signal Process. Lett. , vol.9 , Issue.3 , pp. 85-88
    • Beritelli, F.1    Casale, S.2    Rugeri, G.3    Serrano, S.4
  • 5
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Boll S.F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27:1979;113-120.
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , pp. 113-120
    • Boll, S.F.1
  • 6
    • 0028769421 scopus 로고
    • Proposal of a voice activity detector for noise reduction
    • Bouquin-Jeannes R.L., Faucon G. Proposal of a voice activity detector for noise reduction. Electron. Lett. 30(12):1994;930-932.
    • (1994) Electron. Lett. , vol.30 , Issue.12 , pp. 930-932
    • Bouquin-Jeannes, R.L.1    Faucon, G.2
  • 7
    • 0029290274 scopus 로고
    • Study of voice activity detector and its influence on a noise reduction system
    • Bouquin-Jeannes R.L., Faucon G. Study of voice activity detector and its influence on a noise reduction system. Speech Comm. 16:1995;245-254.
    • (1995) Speech Comm. , vol.16 , pp. 245-254
    • Bouquin-Jeannes, R.L.1    Faucon, G.2
  • 8
    • 0035481845 scopus 로고    scopus 로고
    • Analysis and improvement of a statistical model-based voice activity detector
    • Cho Y.D., Kondoz A. Analysis and improvement of a statistical model-based voice activity detector. IEEE Signal Process. Lett. 8(10):2001;276-278.
    • (2001) IEEE Signal Process. Lett. , vol.8 , Issue.10 , pp. 276-278
    • Cho, Y.D.1    Kondoz, A.2
  • 9
    • 0034846572 scopus 로고    scopus 로고
    • Improved voice activity detection based on a smoothed statistical likelihood ratio
    • Cho, Y.D., Al-Naimi, K., Kondoz, A., 2001a. Improved voice activity detection based on a smoothed statistical likelihood ratio. In: Internat. Conf. on Acoust. Speech Signal Process., Vol. 2, pp. 737-740.
    • (2001) Internat. Conf. on Acoust. Speech Signal Process. , vol.2 , pp. 737-740
    • Cho, Y.D.1    Al-Naimi, K.2    Kondoz, A.3
  • 10
    • 0035848709 scopus 로고    scopus 로고
    • Mixed decision-based noise adaptation for speech enhancement
    • Cho Y.D., Al-Naimi K., Kondoz A. Mixed decision-based noise adaptation for speech enhancement. Electron. Lett. 37(8):2001;540-542.
    • (2001) Electron. Lett. , vol.37 , Issue.8 , pp. 540-542
    • Cho, Y.D.1    Al-Naimi, K.2    Kondoz, A.3
  • 16
    • 0030683353 scopus 로고    scopus 로고
    • Environmental noise reduction based on speech/non-speech identification for hearing aids
    • Itoh, K., Mizushima, M., 1997. Environmental noise reduction based on speech/non-speech identification for hearing aids. In: Internat. Conf. on Acoust. Speech Signal Process., Vol. 1, pp. 419-422.
    • (1997) Internat. Conf. on Acoust. Speech Signal Process. , vol.1 , pp. 419-422
    • Itoh, K.1    Mizushima, M.2
  • 18
    • 0037401288 scopus 로고    scopus 로고
    • Towards improving speech detection robustness for speech recognition in adverse environment
    • Karray L., Martin A. Towards improving speech detection robustness for speech recognition in adverse environment. Speech Comm. 40(3):2003;261-276.
    • (2003) Speech Comm. , vol.40 , Issue.3 , pp. 261-276
    • Karray, L.1    Martin, A.2
  • 21
    • 85135379452 scopus 로고
    • An efficient algorithm to estimate the instantaneous SNR of speech signals
    • Martin, R., 1993. An efficient algorithm to estimate the instantaneous SNR of speech signals. In: Eurospeech, Vol. 1, pp. 1093-1096.
    • (1993) Eurospeech , vol.1 , pp. 1093-1096
    • Martin, R.1
  • 22
    • 0036476655 scopus 로고    scopus 로고
    • Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
    • Marzinzik M., Kollmeier B. Speech pause detection for noise spectrum estimation by tracking power envelope dynamics. IEEE Trans. Speech Audio Process. 10(2):2002;109-118.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.2 , pp. 109-118
    • Marzinzik, M.1    Kollmeier, B.2
  • 24
    • 0035274536 scopus 로고    scopus 로고
    • Robust voice activity detection using higher-order statistics in the LPC residual domain
    • Nemer E., Goubran R., Mahmoud S. Robust voice activity detection using higher-order statistics in the LPC residual domain. IEEE Trans. Speech Audio Process. 9(3):2001;217-231.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 217-231
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 27
    • 0031636164 scopus 로고    scopus 로고
    • A voice activity detector employing soft decision based noise spectrum adaptation
    • Sohn, J., Sung, W., 1998. A voice activity detector employing soft decision based noise spectrum adaptation. In: Internat. Conf. on Acoust. Speech Signal Process., Vol. 1, pp. 365-368.
    • (1998) Internat. Conf. on Acoust. Speech Signal Process. , vol.1 , pp. 365-368
    • Sohn, J.1    Sung, W.2
  • 28
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Sohn J., Kim N.S., Sung W. A statistical model-based voice activity detection. IEEE Signal Process. Lett. 6(1):1999;1-3.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 30
    • 0033903480 scopus 로고    scopus 로고
    • Robust voice activity detection algorithm for estimating noise spectrum
    • Woo K., Yang T., Park K., Lee C. Robust voice activity detection algorithm for estimating noise spectrum. Electron. Lett. 36(2):2000;180-181.
    • (2000) Electron. Lett. , vol.36 , Issue.2 , pp. 180-181
    • Woo, K.1    Yang, T.2    Park, K.3    Lee, C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.