메뉴 건너뛰기




Volumn 24, Issue 3, 2010, Pages 515-530

Voice activity detection based on statistical models and machine learning approaches

Author keywords

A posteriori SNR; A priori SNR; Generalized gamma; Likelihood ratio test; Machine learning; Minimum classification error; Predicted SNR; Prior knowledge; Statistical modeling; Support vector machine; Voice activity detection

Indexed keywords

APRIORI; GENERALIZED GAMMA; LIKELIHOOD RATIO TESTS; MINIMUM CLASSIFICATION ERROR; POSTERIORI SNR; PREDICTED SNR; PRIOR KNOWLEDGE; STATISTICAL MODELING; VOICE ACTIVITY DETECTION;

EID: 77950091897     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2009.02.003     Document Type: Article
Times cited : (80)

References (32)
  • 1
    • 77950079839 scopus 로고    scopus 로고
    • 3GPP2, 2001. Selectable mode vocoder service option for wide-band spread spectrum communication systems. 3GPP2 C.S0030-0 v1.0
    • 3GPP2, 2001. Selectable mode vocoder service option for wide-band spread spectrum communication systems. 3GPP2 C.S0030-0 v1.0.
  • 2
    • 0032308777 scopus 로고    scopus 로고
    • A robust voice activity detector for wireless communications using soft computing
    • Beritelli F., Casale S., and Cavallaro A. A robust voice activity detector for wireless communications using soft computing. IEEE J. Sel. Area. Commun. 16 (1998) 1818-1829
    • (1998) IEEE J. Sel. Area. Commun. , vol.16 , pp. 1818-1829
    • Beritelli, F.1    Casale, S.2    Cavallaro, A.3
  • 3
    • 0035445888 scopus 로고    scopus 로고
    • Speech enhancement: new approaches to soft decision
    • Chang J.-H., and Kim N.S. Speech enhancement: new approaches to soft decision. IEICE Trans. Syst. Info. E84-D (2001) 1231-1240
    • (2001) IEICE Trans. Syst. Info. , vol.E84-D , pp. 1231-1240
    • Chang, J.-H.1    Kim, N.S.2
  • 4
    • 33744532633 scopus 로고    scopus 로고
    • Voice activity detection based on multiple statistical models
    • Chang J.-H., Kim N.S., and Mitra S.K. Voice activity detection based on multiple statistical models. IEEE Trans. Signal Process. 54 6 (2006) 1965-1976
    • (2006) IEEE Trans. Signal Process. , vol.54 , Issue.6 , pp. 1965-1976
    • Chang, J.-H.1    Kim, N.S.2    Mitra, S.K.3
  • 5
    • 85009164459 scopus 로고    scopus 로고
    • Chang, J.-H., Shin, J.W., Kim, N.S., 2003. Likelihood ratio test with complex Laplacian model for voice activity detection. In: Proceedings of the Eurospeech 2003, Geneva, Switzerland. pp. 1065-1068.
    • Chang, J.-H., Shin, J.W., Kim, N.S., 2003. Likelihood ratio test with complex Laplacian model for voice activity detection. In: Proceedings of the Eurospeech 2003, Geneva, Switzerland. pp. 1065-1068.
  • 6
    • 10944225892 scopus 로고    scopus 로고
    • Voice activity detector employing generalised gaussian distribution
    • Chang J.-H., Shin J.W., and Kim N.S. Voice activity detector employing generalised gaussian distribution. Electron. Lett. 40 24 (2004) 1561-1563
    • (2004) Electron. Lett. , vol.40 , Issue.24 , pp. 1561-1563
    • Chang, J.-H.1    Shin, J.W.2    Kim, N.S.3
  • 7
    • 0035481845 scopus 로고    scopus 로고
    • Analysis and improvement of a statistical model-based voice activy detector
    • Cho Y.D., and Kondoz A. Analysis and improvement of a statistical model-based voice activy detector. IEEE Signal Process. Lett. 8 10 (2001) 276-278
    • (2001) IEEE Signal Process. Lett. , vol.8 , Issue.10 , pp. 276-278
    • Cho, Y.D.1    Kondoz, A.2
  • 11
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Ephraim Y., and Malah D. Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Trans. Acoustics, Speech, Signal Process. ASSP-32 6 (1984) 1109-1121
    • (1984) IEEE Trans. Acoustics, Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 12
    • 0442317753 scopus 로고    scopus 로고
    • Voice activity detector (VAD) for adaptive multi-rate (AMR) speech teaffic channels. ETSI EN
    • ETSI, 301 708, v7.1.1
    • ETSI, 1999. Voice activity detector (VAD) for adaptive multi-rate (AMR) speech teaffic channels. ETSI EN 301 708, v7.1.1.
    • (1999)
  • 13
    • 0038610905 scopus 로고    scopus 로고
    • Speech probability distribution
    • Gazor S., and Zhang W. Speech probability distribution. IEEE Signal Process. Lett. 10 7 (2003) 204-207
    • (2003) IEEE Signal Process. Lett. , vol.10 , Issue.7 , pp. 204-207
    • Gazor, S.1    Zhang, W.2
  • 14
    • 0027713501 scopus 로고
    • Robust voice activity detection using cepstral feature
    • China. pp
    • Haigh, J.A., Mason, J.S., 1993. Robust voice activity detection using cepstral feature. In: Proceedings of the IEEE TELCON 1993, China. pp. 321-324.
    • (1993) Proceedings of the IEEE TELCON , pp. 321-324
    • Haigh, J.A.1    Mason, J.S.2
  • 16
    • 77950079471 scopus 로고    scopus 로고
    • ITU-T, 1996. A silence compression scheme for G.729 optimized for terminals conforming to recommendation v70. ITU-T Rec. G. 729, Annex B.
    • ITU-T, 1996. A silence compression scheme for G.729 optimized for terminals conforming to recommendation v70. ITU-T Rec. G. 729, Annex B.
  • 17
    • 67651040596 scopus 로고    scopus 로고
    • A support vector machine-based voice activity detection employing effective feature vectors
    • Jo Q.-H., Park Y.-S., Lee K.-H., and Chang J.-H. A support vector machine-based voice activity detection employing effective feature vectors. IEICE Trans. Commun. E91-B 6 (2008) 2090-2093
    • (2008) IEICE Trans. Commun. , vol.E91-B , Issue.6 , pp. 2090-2093
    • Jo, Q.-H.1    Park, Y.-S.2    Lee, K.-H.3    Chang, J.-H.4
  • 18
    • 0031139839 scopus 로고    scopus 로고
    • Mimum classification error rate methods for speech recognition
    • Juang B.-H., Chou W., and Lee C.-H. Mimum classification error rate methods for speech recognition. IEEE Trans. Speech Audio Process. 5 3 (1997) 257-265
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
    • Juang, B.-H.1    Chou, W.2    Lee, C.-H.3
  • 19
    • 67650137747 scopus 로고    scopus 로고
    • Discriminative weight training for a statistical model-based voice activity detection
    • Kang S.-I., Jo Q.-H., and Chang J.-H. Discriminative weight training for a statistical model-based voice activity detection. IEEE Signal Process. Lett. 15 (2008) 170-173
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 170-173
    • Kang, S.-I.1    Jo, Q.-H.2    Chang, J.-H.3
  • 20
    • 33745188004 scopus 로고    scopus 로고
    • Voice activity detection based on optimally weighted combination of multiple feature
    • Kida, Y., Kawahara, T., 2005. Voice activity detection based on optimally weighted combination of multiple feature. In: Proceedings of the Interspeech. pp. 2621-2624.
    • (2005) Proceedings of the Interspeech , pp. 2621-2624
    • Kida, Y.1    Kawahara, T.2
  • 22
    • 0035274536 scopus 로고    scopus 로고
    • Robust voice activity detection using higher-order statistics in the LPC Residual domain
    • Nemer E., Goubran R., and Mahmoud S. Robust voice activity detection using higher-order statistics in the LPC Residual domain. IEEE Trans. Speech Audio Process. 9 (2001) 217-231
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 217-231
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 24
    • 33746363506 scopus 로고    scopus 로고
    • Speech/non-speech discrimination based on contextual information integrated bispectrum LRT
    • Ramirez J., Gorriz J.M., Segura J.C., Puntonet C.G., and Rubio A.J. Speech/non-speech discrimination based on contextual information integrated bispectrum LRT. IEEE Signal. Process. Lett. 13 8 (2006) 497-500
    • (2006) IEEE Signal. Process. Lett. , vol.13 , Issue.8 , pp. 497-500
    • Ramirez, J.1    Gorriz, J.M.2    Segura, J.C.3    Puntonet, C.G.4    Rubio, A.J.5
  • 25
    • 14644439205 scopus 로고    scopus 로고
    • Statistical modeling of speech signals based on generalized gamma distribution
    • Shin J.W., Chang J.-H., and Kim N.S. Statistical modeling of speech signals based on generalized gamma distribution. IEEE Signal Process. Lett. 12 3 (2005) 258-261
    • (2005) IEEE Signal Process. Lett. , vol.12 , Issue.3 , pp. 258-261
    • Shin, J.W.1    Chang, J.-H.2    Kim, N.S.3
  • 26
    • 34249658531 scopus 로고    scopus 로고
    • Voice activity detection based on a family of parametric distributions
    • Shin J.W., Chang J.-H., and Kim N.S. Voice activity detection based on a family of parametric distributions. Pattern Recogn. Lett. 28 (2007) 1295-1299
    • (2007) Pattern Recogn. Lett. , vol.28 , pp. 1295-1299
    • Shin, J.W.1    Chang, J.-H.2    Kim, N.S.3
  • 27
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Sohn J., Kim N.S., and Sung W. A statistical model-based voice activity detection. IEEE Signal Process. Lett. 6 1 (1999) 1-3
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 29
    • 77950068383 scopus 로고    scopus 로고
    • TIA/EIA/IS-127, 1996. Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems.
    • TIA/EIA/IS-127, 1996. Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems.
  • 31
    • 0032594959 scopus 로고    scopus 로고
    • An overview of statistical learning theory
    • Vapnik V.N. An overview of statistical learning theory. IEEE Trans. Neural Netw. 10 5 (1999) 988-999
    • (1999) IEEE Trans. Neural Netw. , vol.10 , Issue.5 , pp. 988-999
    • Vapnik, V.N.1
  • 32
    • 0030192187 scopus 로고    scopus 로고
    • Robust speech pulse-detection using adaptive noise modeling
    • Yoma N.B., McIness F., and Jack M. Robust speech pulse-detection using adaptive noise modeling. Electron. Lett. 32 (1996) 1350-1352
    • (1996) Electron. Lett. , vol.32 , pp. 1350-1352
    • Yoma, N.B.1    McIness, F.2    Jack, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.