메뉴 건너뛰기




Volumn 4, Issue 6, 2010, Pages 1071-1083

Online unsupervised classification with model comparison in the variational bayes framework for voice activity detection

Author keywords

Sequential estimation; speech analysis; variational Bayes (VB); voice activity detection (VAD)

Indexed keywords

DECISION LEVELS; DECISION THRESHOLD; EXPECTATION MAXIMIZATION; EXPERIMENTAL EVALUATION; MODEL COMPARISON; NOISE TYPES; ON-LINE FASHION; REMOTE RECORDING; SEQUENTIAL ESTIMATION; STATISTICAL MODELS; TIME FRAME; UNSUPERVISED CLASSIFICATION; UNSUPERVISED METHOD; VARIATIONAL BAYES; VOICE ACTIVITY DETECTION;

EID: 78649271854     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2080821     Document Type: Article
Times cited : (11)

References (30)
  • 2
    • 17344389852 scopus 로고    scopus 로고
    • Robust speech recognition in noisy environments: The 2001 IBM Spine evaluation system
    • B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: The 2001 IBM Spine evaluation system," in Proc. ICASSP, 2002, pp. I-53-I-56.
    • (2002) Proc. ICASSP
    • Kingsbury, B.1    Saon, G.2    Mangu, L.3    Padmanabhan, M.4    Sarikaya, R.5
  • 3
    • 70749138955 scopus 로고    scopus 로고
    • Robust voiced-unvoiced classification using novel features and Gaussian mixture model
    • J. K. Shah, A. N. Iyer, B. Y. Smolenski, and R. E. Yantorno, "Robust voiced-unvoiced classification using novel features and Gaussian mixture model," in Proc. ICASSP, 2004.
    • (2004) Proc. ICASSP
    • Shah, J.K.1    Iyer, A.N.2    Smolenski, B.Y.3    Yantorno, R.E.4
  • 4
    • 0141702200 scopus 로고    scopus 로고
    • A linked-HMM model for robust voicing and speech detection
    • S. Basu, "A linked-HMM model for robust voicing and speech detection," in Proc. ICASSP, 2003, pp. 816-819.
    • (2003) Proc. ICASSP , pp. 816-819
    • Basu, S.1
  • 6
    • 27644475276 scopus 로고    scopus 로고
    • An improved voice activity detection using high order statistics
    • Sep.
    • K. Li, M. S. S. Swamy, and M. O. Ahmad, "An improved voice activity detection using high order statistics," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 965-974, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 965-974
    • Li, K.1    Swamy, M.S.S.2    Ahmad, M.O.3
  • 7
    • 0141479978 scopus 로고    scopus 로고
    • Robust speech detection and segmentation for real-time ASR applications
    • I. Shafran and R. Rose, "Robust speech detection and segmentation for real-time ASR applications," in Proc. ICASSP, 2003, pp. 432-435.
    • (2003) Proc. ICASSP , pp. 432-435
    • Shafran, I.1    Rose, R.2
  • 8
    • 79951701874 scopus 로고    scopus 로고
    • Evaluation of real-time voice activity detection based on high order statistics
    • D. Cournapeau and T. Kawahara, "Evaluation of real-time voice activity detection based on high order statistics," in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Cournapeau, D.1    Kawahara, T.2
  • 9
    • 0035274536 scopus 로고    scopus 로고
    • Robust voice activity detection using higher-order statistics in the LPC residual domain
    • Mar.
    • E. Nemer, R. Goubran, and S. Mahmoud, "Robust voice activity detection using higher-order statistics in the LPC residual domain," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 217-231, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 217-231
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 10
    • 13844295342 scopus 로고    scopus 로고
    • The variational Bayesian em algorithm for incomplete data: With application to scoring graphical model structures
    • M. J. Beal and Z. Ghahramani, "The variational Bayesian EM algorithm for incomplete data: With application to scoring graphical model structures," Bayesian Statist., vol. 7, pp. 453-464, 2002.
    • (2002) Bayesian Statist. , vol.7 , pp. 453-464
    • Beal, M.J.1    Ghahramani, Z.2
  • 12
    • 3042741069 scopus 로고    scopus 로고
    • Variational Bayesian estimation and clustering for speech recognition
    • Jul.
    • S. Watanabe, Y. Minami, A. Nakamura, and N. Ueda, "Variational Bayesian estimation and clustering for speech recognition," IEEE Trans. Speech Audio Process., vol. 12, no. 4, pp. 365-381, Jul. 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.4 , pp. 365-381
    • Watanabe, S.1    Minami, Y.2    Nakamura, A.3    Ueda, N.4
  • 14
    • 0000147488 scopus 로고    scopus 로고
    • Online model selection based on the variational Bayes
    • M. Sato, "Online model selection based on the variational Bayes," Neural Comput., vol. 13, pp. 1649-1681, 2001.
    • (2001) Neural Comput. , vol.13 , pp. 1649-1681
    • Sato, M.1
  • 15
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • Statisti. Methodol.)
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc.: Ser. B (Statisti. Methodol.), vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc.: Ser. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 16
    • 0034131785 scopus 로고    scopus 로고
    • On-line em algorithm for the normalized Gaussian network
    • M. Sato and S. Ishii, "On-line EM algorithm for the normalized Gaussian network," Neural Comput., vol. 12, pp. 407-432, 2000.
    • (2000) Neural Comput. , vol.12 , pp. 407-432
    • Sato, M.1    Ishii, S.2
  • 17
    • 33947640486 scopus 로고    scopus 로고
    • Recursive em algorithm with applications to DOA estimation
    • O. Cappé, M. Charbit, and E. Moulines, "Recursive EM algorithm with applications to DOA estimation," in Proc. ICASSP, 2006, pp. 664-667.
    • (2006) Proc. ICASSP , pp. 664-667
    • Cappé, O.1    Charbit, M.2    Moulines, E.3
  • 18
    • 0002210265 scopus 로고
    • On the convergence properties of the em algorithm
    • C. F. J. Wu, "On the convergence properties of the EM algorithm," Ann. Statist., vol. 11, pp. 95-103, 1983.
    • (1983) Ann. Statist. , vol.11 , pp. 95-103
    • Wu, C.F.J.1
  • 20
    • 66849104300 scopus 로고    scopus 로고
    • Online em algorithm for latent data models
    • (Statist. Methodol.)
    • O. Cappé and E. Moulines, "Online EM algorithm for latent data models," J. R. Statist. Soc.: Ser. B (Statist. Methodol.), vol. 73, no. 3, pp. 593-613, 2009.
    • (2009) J. R. Statist. Soc.: Ser. B , vol.73 , Issue.3 , pp. 593-613
    • Cappé, O.1    Moulines, E.2
  • 21
    • 0003278032 scopus 로고    scopus 로고
    • Inferring parameters and structure of latent variable models by variational Bayes
    • [Online]. Available
    • H. Attias, "Inferring parameters and structure of latent variable models by variational Bayes," in Proc. 15th Conf. Uncertainty Artif. Intell., 1999, pp. 21-30 [Online]. Available: http://citeseer.ist.psu.edu/at- tias99inferring.html
    • (1999) Proc. 15th Conf. Uncertainty Artif. Intell. , pp. 21-30
    • Attias, H.1
  • 22
    • 0001025418 scopus 로고
    • Bayesian interpolation
    • D. J. C. Mackay, "Bayesian interpolation," Neural Comput., vol. 4, pp. 415-447, 1992.
    • (1992) Neural Comput. , vol.4 , pp. 415-447
    • MacKay, D.J.C.1
  • 24
    • 3543081155 scopus 로고    scopus 로고
    • Ph.D. dissertation Gatsby Computational Neuroscience Unit, Univ. College London, London, U.K.
    • M. Beal, "Variational algorithms for approximate Bayesian inference," Ph.D. dissertation, Gatsby Computational Neuroscience Unit, Univ. College London, London, U.K., 2003.
    • (2003) Variational Algorithms for Approximate Bayesian Inference
    • Beal, M.1
  • 25
    • 51449103678 scopus 로고    scopus 로고
    • Using variational Bayes free energy for unsupervised voice activity-detection
    • D. Cournapeau and T. Kawahara, "Using variational Bayes free energy for unsupervised voice activity-detection," in Proc. IEEE-ICASSP, 2008.
    • (2008) Proc. IEEE-ICASSP
    • Cournapeau, D.1    Kawahara, T.2
  • 27
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Jan.
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett, vol. 6, no., pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett , vol.6 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 29
    • 0021645331 scopus 로고
    • Speech enhancement using minimum mean-square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Audio, Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Audio, Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.