메뉴 건너뛰기




Volumn 50, Issue 6, 2008, Pages 476-486

Voice activity detection based on adjustable linear prediction and GARCH models

Author keywords

AR GARCH model; Kalman filter; Linear prediction; State space representation; Voice activity detection

Indexed keywords

ALGORITHMS; KALMAN FILTERS; LINEAR ACCELERATORS; SIGNAL TO NOISE RATIO;

EID: 44149094560     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2008.02.003     Document Type: Article
Times cited : (15)

References (38)
  • 1
    • 33646805431 scopus 로고    scopus 로고
    • Abdolahi, M., Amindavar, H., 2005. GARCH coefficients as feature for speech recognition in Persian isolated digit. In: Proc. ICASSP, Vol. 1. pp. 957-960.
    • Abdolahi, M., Amindavar, H., 2005. GARCH coefficients as feature for speech recognition in Persian isolated digit. In: Proc. ICASSP, Vol. 1. pp. 957-960.
  • 2
    • 44149118113 scopus 로고    scopus 로고
    • Abramson, A., Cohen, I., 2006. Markov-switching GARCH model and application to speech enhancement in subbands. In: Proc. IWAENC. pp. 1-4.
    • Abramson, A., Cohen, I., 2006. Markov-switching GARCH model and application to speech enhancement in subbands. In: Proc. IWAENC. pp. 1-4.
  • 3
    • 0016355478 scopus 로고
    • A new look at the statistical model identification
    • Akaike H. A new look at the statistical model identification. IEEE Trans. Automat. Contr. AC-19 (1974) 716-723
    • (1974) IEEE Trans. Automat. Contr. , vol.AC-19 , pp. 716-723
    • Akaike, H.1
  • 4
    • 84986734428 scopus 로고
    • Seasonal adjustment by a Bayesian modeling
    • Akaike H. Seasonal adjustment by a Bayesian modeling. J. Time Ser. Anal. 1 1 (1980) 1-13
    • (1980) J. Time Ser. Anal. , vol.1 , Issue.1 , pp. 1-13
    • Akaike, H.1
  • 6
    • 0141702200 scopus 로고    scopus 로고
    • Basu, S., 2003. A linked-HMM model for robust voicing and speech detection. In: Proc. ICASSP, Vol. 1. pp. 816-819.
    • Basu, S., 2003. A linked-HMM model for robust voicing and speech detection. In: Proc. ICASSP, Vol. 1. pp. 816-819.
  • 7
    • 42449156579 scopus 로고
    • Generalized autoregressive conditional heteroskedasticity
    • Bollerslev T. Generalized autoregressive conditional heteroskedasticity. J. Econometrics 51 (1986) 307-327
    • (1986) J. Econometrics , vol.51 , pp. 307-327
    • Bollerslev, T.1
  • 8
    • 33745187132 scopus 로고    scopus 로고
    • Cohen, I., 2005. Supergaussian GARCH models for speech signals. In: Proc. Interspeech. pp. 2053-2056.
    • Cohen, I., 2005. Supergaussian GARCH models for speech signals. In: Proc. Interspeech. pp. 2053-2056.
  • 9
    • 0000051984 scopus 로고
    • Autoregressive conditional heteroskedasticity with estimates of the variance of UK inflation
    • Engle R.F. Autoregressive conditional heteroskedasticity with estimates of the variance of UK inflation. Econometrica 50 (1982) 987-1008
    • (1982) Econometrica , vol.50 , pp. 987-1008
    • Engle, R.F.1
  • 10
    • 44149118554 scopus 로고    scopus 로고
    • ETSI EN 301 708, 1999. Digital cellular telecommunications systems (Phase 2+); Voice Activity Detector (VAD) for Adaptive Multi-Rate (AMR) speech traffic channels; General description (GSM 06.94 version 7.1.1 Release 1998), V7.1.1.
    • ETSI EN 301 708, 1999. Digital cellular telecommunications systems (Phase 2+); Voice Activity Detector (VAD) for Adaptive Multi-Rate (AMR) speech traffic channels; General description (GSM 06.94 version 7.1.1 Release 1998), V7.1.1.
  • 11
    • 44149107520 scopus 로고    scopus 로고
    • ETSI ES 202 050, 2005. Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms. V1.1.4.
    • ETSI ES 202 050, 2005. Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms. V1.1.4.
  • 12
    • 44149101886 scopus 로고    scopus 로고
    • Hirsh, H.G., Pearce, D., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA Tutorial and Research Workshop on Automatic Speech Recognition (ISCA ITRW ASR). pp. 181-188.
    • Hirsh, H.G., Pearce, D., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA Tutorial and Research Workshop on Automatic Speech Recognition (ISCA ITRW ASR). pp. 181-188.
  • 13
    • 0013026333 scopus 로고    scopus 로고
    • ARDOCK, an auto-regressive model analyzer
    • A publication of The Institute of Statistical Mathematics
    • Ishiguro M., Kato H., and Akaike H. ARDOCK, an auto-regressive model analyzer. Computer Science Monographs Vol. 30 (1999), A publication of The Institute of Statistical Mathematics
    • (1999) Computer Science Monographs , vol.30
    • Ishiguro, M.1    Kato, H.2    Akaike, H.3
  • 14
    • 33947649580 scopus 로고    scopus 로고
    • Ishizuka, K., Kato, H., 2006. A feature for voice activity detection derived from speech analysis with the exponential autoregressive model. In: Proc. ICASSP, Vol. 1. pp. 789-792.
    • Ishizuka, K., Kato, H., 2006. A feature for voice activity detection derived from speech analysis with the exponential autoregressive model. In: Proc. ICASSP, Vol. 1. pp. 789-792.
  • 15
    • 44149097319 scopus 로고    scopus 로고
    • ITU-T Recommendation G.729, Annex B, 1996. A silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70.
    • ITU-T Recommendation G.729, Annex B, 1996. A silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70.
  • 16
    • 0028461861 scopus 로고
    • A robust algorithm for word boundary detection in the presence of noise
    • Junqua J.-C., Mak B., and Reaves B. A robust algorithm for word boundary detection in the presence of noise. IEEE Trans. Speech Audio Process. 2 (1994) 406-412
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 406-412
    • Junqua, J.-C.1    Mak, B.2    Reaves, B.3
  • 17
    • 0037401288 scopus 로고    scopus 로고
    • Towards improving speech detection robustness for speech recognition in adverse conditions
    • Karray L., and Martin A. Towards improving speech detection robustness for speech recognition in adverse conditions. Speech Commun. 40 (2003) 261-276
    • (2003) Speech Commun. , vol.40 , pp. 261-276
    • Karray, L.1    Martin, A.2
  • 18
    • 84950428388 scopus 로고
    • A smoothness priors-state space modeling of time series with trend and seasonality
    • Kitagawa G., and Gersch W. A smoothness priors-state space modeling of time series with trend and seasonality. J. Amer. Statist. Assoc. 79 (1984) 378-389
    • (1984) J. Amer. Statist. Assoc. , vol.79 , pp. 378-389
    • Kitagawa, G.1    Gersch, W.2
  • 19
    • 33745218538 scopus 로고    scopus 로고
    • Kristjansson, T., Deligne, S., Olsen, P., 2005. Voicing features for robust speech detection. In: Proc. Interspeech. pp. 369-372.
    • Kristjansson, T., Deligne, S., Olsen, P., 2005. Voicing features for robust speech detection. In: Proc. Interspeech. pp. 369-372.
  • 20
    • 0029290274 scopus 로고
    • Study of voice activity detector and its influence on a noise reduction system
    • Le Bouquin-Jeannès R., and Faucon G. Study of voice activity detector and its influence on a noise reduction system. Speech Commun. 16 (1995) 245-254
    • (1995) Speech Commun. , vol.16 , pp. 245-254
    • Le Bouquin-Jeannès, R.1    Faucon, G.2
  • 21
    • 17244365395 scopus 로고    scopus 로고
    • Asymptotic theory for ARCH models: LAN and residual empirical processes
    • Lee S., and Taniguchi M. Asymptotic theory for ARCH models: LAN and residual empirical processes. Statist. Sinica 15 (2005) 215-234
    • (2005) Statist. Sinica , vol.15 , pp. 215-234
    • Lee, S.1    Taniguchi, M.2
  • 22
    • 27644475276 scopus 로고    scopus 로고
    • An improved voice activity detection using higher order statistics
    • Li K., Swamy M.N.S., and Ahmad M.O. An improved voice activity detection using higher order statistics. IEEE Trans. Speech Audio Process. 13 (2005) 965-974
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , pp. 965-974
    • Li, K.1    Swamy, M.N.S.2    Ahmad, M.O.3
  • 23
    • 1142277400 scopus 로고    scopus 로고
    • Adaptive estimators and tests of stationary and nonstationary short- and long-memory ARFIMA-GARCH models
    • Ling S. Adaptive estimators and tests of stationary and nonstationary short- and long-memory ARFIMA-GARCH models. J. Amer. Statist. Assoc. 98 (2003) 955-967
    • (2003) J. Amer. Statist. Assoc. , vol.98 , pp. 955-967
    • Ling, S.1
  • 24
    • 0037847450 scopus 로고    scopus 로고
    • On adaptive estimation in nonstationary ARMA models with GARCH errors
    • Ling S., and McAleer M. On adaptive estimation in nonstationary ARMA models with GARCH errors. Ann. Statist. 31 (2003) 642-674
    • (2003) Ann. Statist. , vol.31 , pp. 642-674
    • Ling, S.1    McAleer, M.2
  • 25
    • 19944407731 scopus 로고
    • On a measure of lack of fit in time series models
    • Ljung G.M., and Box G.E.P. On a measure of lack of fit in time series models. Biometrica 68 (1978) 189-196
    • (1978) Biometrica , vol.68 , pp. 189-196
    • Ljung, G.M.1    Box, G.E.P.2
  • 26
    • 0036476655 scopus 로고    scopus 로고
    • Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
    • Marzinzik M., and Kollmeier B. Speech pause detection for noise spectrum estimation by tracking power envelope dynamics. IEEE Trans. Speech Audio Process. 10 (2002) 109-118
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , pp. 109-118
    • Marzinzik, M.1    Kollmeier, B.2
  • 27
    • 44149095941 scopus 로고    scopus 로고
    • Nakamura, A., Matsunaga, S., Shimizu, T., Tonomura, M., Sagisaka, Y., 1998. Japanese speech database for robust speech recognition. In: Proc. ICSLP.
    • Nakamura, A., Matsunaga, S., Shimizu, T., Tonomura, M., Sagisaka, Y., 1998. Japanese speech database for robust speech recognition. In: Proc. ICSLP.
  • 29
    • 0035274536 scopus 로고    scopus 로고
    • Robust voice activity detection using higher-order statistics in the LPC residual domain
    • Nemer E., Goubran R., and Mahmoud S. Robust voice activity detection using higher-order statistics in the LPC residual domain. IEEE Trans. Speech Audio Process. 9 (2001) 217-231
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 217-231
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 30
    • 0016470107 scopus 로고
    • An algorithm for determining the endpoints of isolated utterances
    • Rabiner L.R., and Sambur M.R. An algorithm for determining the endpoints of isolated utterances. Bell Syst. Tech. J. 54 (1975) 297-315
    • (1975) Bell Syst. Tech. J. , vol.54 , pp. 297-315
    • Rabiner, L.R.1    Sambur, M.R.2
  • 31
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithm using long-term speech information
    • Ramirez J., Segura J.C., Benitex C., de la Torre A., and Rubio A. Efficient voice activity detection algorithm using long-term speech information. Speech Commun. 42 (2004) 271-287
    • (2004) Speech Commun. , vol.42 , pp. 271-287
    • Ramirez, J.1    Segura, J.C.2    Benitex, C.3    de la Torre, A.4    Rubio, A.5
  • 32
    • 23344452899 scopus 로고    scopus 로고
    • Statistical voice activity detection using a multiple observation likelihood ratio test
    • Ramírez J., and Segura J.C. Statistical voice activity detection using a multiple observation likelihood ratio test. IEEE Signal Process. Lett. 12 10 (2005) 689-692
    • (2005) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 689-692
    • Ramírez, J.1    Segura, J.C.2
  • 33
    • 44149111518 scopus 로고    scopus 로고
    • Shen, J.-L., Hung, J.-W., Lee, L.-S., 1998. Robust entropy-based endpoint detection for speech recognition in noisy environments. In: Proc. ICSLP.
    • Shen, J.-L., Hung, J.-W., Lee, L.-S., 1998. Robust entropy-based endpoint detection for speech recognition in noisy environments. In: Proc. ICSLP.
  • 34
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Sohn J., Kim N.S., and Sung W. A statistical model-based voice activity detection. IEEE Signal Process. Lett. 6 1 (1999) 1-3
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 35
    • 84889333873 scopus 로고    scopus 로고
    • Srinivasan, K., Gersho, A., 1993. Voice activity detection for cellular networks. In: Proc. IEEE Workshop on Speech Coding for Telecommunications. pp. 85-86.
    • Srinivasan, K., Gersho, A., 1993. Voice activity detection for cellular networks. In: Proc. IEEE Workshop on Speech Coding for Telecommunications. pp. 85-86.
  • 36
    • 44149083948 scopus 로고    scopus 로고
    • A soft voice activity detection using GARCH filter and variance Gamma distribution
    • Tahmasbi R., and Rezaei S. A soft voice activity detection using GARCH filter and variance Gamma distribution. IEEE Trans. Audio Speech Lang. Process. 15 (2007) 1129-1134
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 1129-1134
    • Tahmasbi, R.1    Rezaei, S.2
  • 37
    • 0026907622 scopus 로고
    • Voice activity detection using a periodicity measure
    • Tucker R. Voice activity detection using a periodicity measure. IEE Proc.-I 139 (1992) 377-380
    • (1992) IEE Proc.-I , vol.139 , pp. 377-380
    • Tucker, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.