SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 50, Issue 6, 2008, Pages 476-486

Voice activity detection based on adjustable linear prediction and GARCH models

(3) Solvang, Hiroko Kato a,b Ishizuka, Kentaro c Fujimoto, Masakiyo c

a OSLO UNIVERSITY HOSPITAL (Norway)

b UNIVERSITY OF OSLO (Norway)

c Nippon Telegraph and Telephone Corporation (Japan)

Author keywords

AR GARCH model; Kalman filter; Linear prediction; State space representation; Voice activity detection

Indexed keywords

ALGORITHMS; KALMAN FILTERS; LINEAR ACCELERATORS; SIGNAL TO NOISE RATIO;

LINEAR PREDICTION; STATE SPACE REPRESENTATION; VOICE ACTIVITY DETECTION (VAD);

SPEECH RECOGNITION;

EID: 44149094560 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2008.02.003 Document Type: Article

Times cited : (15)

References (38)

1
- 33646805431
- Abdolahi, M., Amindavar, H., 2005. GARCH coefficients as feature for speech recognition in Persian isolated digit. In: Proc. ICASSP, Vol. 1. pp. 957-960.
- Abdolahi, M., Amindavar, H., 2005. GARCH coefficients as feature for speech recognition in Persian isolated digit. In: Proc. ICASSP, Vol. 1. pp. 957-960.

2
- 44149118113
- Abramson, A., Cohen, I., 2006. Markov-switching GARCH model and application to speech enhancement in subbands. In: Proc. IWAENC. pp. 1-4.
- Abramson, A., Cohen, I., 2006. Markov-switching GARCH model and application to speech enhancement in subbands. In: Proc. IWAENC. pp. 1-4.

3
- 0016355478
- A new look at the statistical model identification
- Akaike H. A new look at the statistical model identification. IEEE Trans. Automat. Contr. AC-19 (1974) 716-723
- (1974) IEEE Trans. Automat. Contr. , vol.AC-19 , pp. 716-723
- Akaike, H.¹

4
- 84986734428
- Seasonal adjustment by a Bayesian modeling
- Akaike H. Seasonal adjustment by a Bayesian modeling. J. Time Ser. Anal. 1 1 (1980) 1-13
- (1980) J. Time Ser. Anal. , vol.1 , Issue.1 , pp. 1-13
- Akaike, H.¹

5
- 0004093046
- Prentice Hall, New Jersey
- Anderson B.D.O., and Moore J.B. Optimal Filtering (1979), Prentice Hall, New Jersey
- (1979) Optimal Filtering
- Anderson, B.D.O.¹ Moore, J.B.²

6
- 0141702200
- Basu, S., 2003. A linked-HMM model for robust voicing and speech detection. In: Proc. ICASSP, Vol. 1. pp. 816-819.
- Basu, S., 2003. A linked-HMM model for robust voicing and speech detection. In: Proc. ICASSP, Vol. 1. pp. 816-819.

7
- 42449156579
- Generalized autoregressive conditional heteroskedasticity
- Bollerslev T. Generalized autoregressive conditional heteroskedasticity. J. Econometrics 51 (1986) 307-327
- (1986) J. Econometrics , vol.51 , pp. 307-327
- Bollerslev, T.¹

8
- 33745187132
- Cohen, I., 2005. Supergaussian GARCH models for speech signals. In: Proc. Interspeech. pp. 2053-2056.
- Cohen, I., 2005. Supergaussian GARCH models for speech signals. In: Proc. Interspeech. pp. 2053-2056.

9
- 0000051984
- Autoregressive conditional heteroskedasticity with estimates of the variance of UK inflation
- Engle R.F. Autoregressive conditional heteroskedasticity with estimates of the variance of UK inflation. Econometrica 50 (1982) 987-1008
- (1982) Econometrica , vol.50 , pp. 987-1008
- Engle, R.F.¹

10
- 44149118554
- ETSI EN 301 708, 1999. Digital cellular telecommunications systems (Phase 2+); Voice Activity Detector (VAD) for Adaptive Multi-Rate (AMR) speech traffic channels; General description (GSM 06.94 version 7.1.1 Release 1998), V7.1.1.
- ETSI EN 301 708, 1999. Digital cellular telecommunications systems (Phase 2+); Voice Activity Detector (VAD) for Adaptive Multi-Rate (AMR) speech traffic channels; General description (GSM 06.94 version 7.1.1 Release 1998), V7.1.1.

11
- 44149107520
- ETSI ES 202 050, 2005. Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms. V1.1.4.
- ETSI ES 202 050, 2005. Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms. V1.1.4.

12
- 44149101886
- Hirsh, H.G., Pearce, D., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA Tutorial and Research Workshop on Automatic Speech Recognition (ISCA ITRW ASR). pp. 181-188.
- Hirsh, H.G., Pearce, D., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA Tutorial and Research Workshop on Automatic Speech Recognition (ISCA ITRW ASR). pp. 181-188.

13
- 0013026333
- ARDOCK, an auto-regressive model analyzer
- A publication of The Institute of Statistical Mathematics
- Ishiguro M., Kato H., and Akaike H. ARDOCK, an auto-regressive model analyzer. Computer Science Monographs Vol. 30 (1999), A publication of The Institute of Statistical Mathematics
- (1999) Computer Science Monographs , vol.30
- Ishiguro, M.¹ Kato, H.² Akaike, H.³

14
- 33947649580
- Ishizuka, K., Kato, H., 2006. A feature for voice activity detection derived from speech analysis with the exponential autoregressive model. In: Proc. ICASSP, Vol. 1. pp. 789-792.
- Ishizuka, K., Kato, H., 2006. A feature for voice activity detection derived from speech analysis with the exponential autoregressive model. In: Proc. ICASSP, Vol. 1. pp. 789-792.

15
- 44149097319
- ITU-T Recommendation G.729, Annex B, 1996. A silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70.
- ITU-T Recommendation G.729, Annex B, 1996. A silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70.

16
- 0028461861
- A robust algorithm for word boundary detection in the presence of noise
- Junqua J.-C., Mak B., and Reaves B. A robust algorithm for word boundary detection in the presence of noise. IEEE Trans. Speech Audio Process. 2 (1994) 406-412
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 406-412
- Junqua, J.-C.¹ Mak, B.² Reaves, B.³

17
- 0037401288
- Towards improving speech detection robustness for speech recognition in adverse conditions
- Karray L., and Martin A. Towards improving speech detection robustness for speech recognition in adverse conditions. Speech Commun. 40 (2003) 261-276
- (2003) Speech Commun. , vol.40 , pp. 261-276
- Karray, L.¹ Martin, A.²

18
- 84950428388
- A smoothness priors-state space modeling of time series with trend and seasonality
- Kitagawa G., and Gersch W. A smoothness priors-state space modeling of time series with trend and seasonality. J. Amer. Statist. Assoc. 79 (1984) 378-389
- (1984) J. Amer. Statist. Assoc. , vol.79 , pp. 378-389
- Kitagawa, G.¹ Gersch, W.²

19
- 33745218538
- Kristjansson, T., Deligne, S., Olsen, P., 2005. Voicing features for robust speech detection. In: Proc. Interspeech. pp. 369-372.
- Kristjansson, T., Deligne, S., Olsen, P., 2005. Voicing features for robust speech detection. In: Proc. Interspeech. pp. 369-372.

20
- 0029290274
- Study of voice activity detector and its influence on a noise reduction system
- Le Bouquin-Jeannès R., and Faucon G. Study of voice activity detector and its influence on a noise reduction system. Speech Commun. 16 (1995) 245-254
- (1995) Speech Commun. , vol.16 , pp. 245-254
- Le Bouquin-Jeannès, R.¹ Faucon, G.²

21
- 17244365395
- Asymptotic theory for ARCH models: LAN and residual empirical processes
- Lee S., and Taniguchi M. Asymptotic theory for ARCH models: LAN and residual empirical processes. Statist. Sinica 15 (2005) 215-234
- (2005) Statist. Sinica , vol.15 , pp. 215-234
- Lee, S.¹ Taniguchi, M.²

22
- 27644475276
- An improved voice activity detection using higher order statistics
- Li K., Swamy M.N.S., and Ahmad M.O. An improved voice activity detection using higher order statistics. IEEE Trans. Speech Audio Process. 13 (2005) 965-974
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , pp. 965-974
- Li, K.¹ Swamy, M.N.S.² Ahmad, M.O.³

23
- 1142277400
- Adaptive estimators and tests of stationary and nonstationary short- and long-memory ARFIMA-GARCH models
- Ling S. Adaptive estimators and tests of stationary and nonstationary short- and long-memory ARFIMA-GARCH models. J. Amer. Statist. Assoc. 98 (2003) 955-967
- (2003) J. Amer. Statist. Assoc. , vol.98 , pp. 955-967
- Ling, S.¹

24
- 0037847450
- On adaptive estimation in nonstationary ARMA models with GARCH errors
- Ling S., and McAleer M. On adaptive estimation in nonstationary ARMA models with GARCH errors. Ann. Statist. 31 (2003) 642-674
- (2003) Ann. Statist. , vol.31 , pp. 642-674
- Ling, S.¹ McAleer, M.²

25
- 19944407731
- On a measure of lack of fit in time series models
- Ljung G.M., and Box G.E.P. On a measure of lack of fit in time series models. Biometrica 68 (1978) 189-196
- (1978) Biometrica , vol.68 , pp. 189-196
- Ljung, G.M.¹ Box, G.E.P.²

26
- 0036476655
- Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
- Marzinzik M., and Kollmeier B. Speech pause detection for noise spectrum estimation by tracking power envelope dynamics. IEEE Trans. Speech Audio Process. 10 (2002) 109-118
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , pp. 109-118
- Marzinzik, M.¹ Kollmeier, B.²

27
- 44149095941
- Nakamura, A., Matsunaga, S., Shimizu, T., Tonomura, M., Sagisaka, Y., 1998. Japanese speech database for robust speech recognition. In: Proc. ICSLP.
- Nakamura, A., Matsunaga, S., Shimizu, T., Tonomura, M., Sagisaka, Y., 1998. Japanese speech database for robust speech recognition. In: Proc. ICSLP.

28
- 24144494616
- AURORA-2J: an evaluation framework for Japanese noisy speech recognition
- Nakamura S., Takeda K., Yamamoto K., Yamada T., Kuroiwa S., Kitaoka N., Nishiura T., Sasou A., Mizumachi M., Miyajima C., Fujimoto M., and Endo T. AURORA-2J: an evaluation framework for Japanese noisy speech recognition. IEICE Trans. Inform. Syst. E88-D (2005) 535-544
- (2005) IEICE Trans. Inform. Syst. , vol.E88-D , pp. 535-544
- Nakamura, S.¹ Takeda, K.² Yamamoto, K.³ Yamada, T.⁴ Kuroiwa, S.⁵ Kitaoka, N.⁶ Nishiura, T.⁷ Sasou, A.⁸ Mizumachi, M.⁹ Miyajima, C.¹⁰ Fujimoto, M.¹¹ Endo, T.¹²

29
- 0035274536
- Robust voice activity detection using higher-order statistics in the LPC residual domain
- Nemer E., Goubran R., and Mahmoud S. Robust voice activity detection using higher-order statistics in the LPC residual domain. IEEE Trans. Speech Audio Process. 9 (2001) 217-231
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 217-231
- Nemer, E.¹ Goubran, R.² Mahmoud, S.³

30
- 0016470107
- An algorithm for determining the endpoints of isolated utterances
- Rabiner L.R., and Sambur M.R. An algorithm for determining the endpoints of isolated utterances. Bell Syst. Tech. J. 54 (1975) 297-315
- (1975) Bell Syst. Tech. J. , vol.54 , pp. 297-315
- Rabiner, L.R.¹ Sambur, M.R.²

31
- 1842476689
- Efficient voice activity detection algorithm using long-term speech information
- Ramirez J., Segura J.C., Benitex C., de la Torre A., and Rubio A. Efficient voice activity detection algorithm using long-term speech information. Speech Commun. 42 (2004) 271-287
- (2004) Speech Commun. , vol.42 , pp. 271-287
- Ramirez, J.¹ Segura, J.C.² Benitex, C.³ de la Torre, A.⁴ Rubio, A.⁵

32
- 23344452899
- Statistical voice activity detection using a multiple observation likelihood ratio test
- Ramírez J., and Segura J.C. Statistical voice activity detection using a multiple observation likelihood ratio test. IEEE Signal Process. Lett. 12 10 (2005) 689-692
- (2005) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 689-692
- Ramírez, J.¹ Segura, J.C.²

33
- 44149111518
- Shen, J.-L., Hung, J.-W., Lee, L.-S., 1998. Robust entropy-based endpoint detection for speech recognition in noisy environments. In: Proc. ICSLP.
- Shen, J.-L., Hung, J.-W., Lee, L.-S., 1998. Robust entropy-based endpoint detection for speech recognition in noisy environments. In: Proc. ICSLP.

34
- 0032762471
- A statistical model-based voice activity detection
- Sohn J., Kim N.S., and Sung W. A statistical model-based voice activity detection. IEEE Signal Process. Lett. 6 1 (1999) 1-3
- (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

35
- 84889333873
- Srinivasan, K., Gersho, A., 1993. Voice activity detection for cellular networks. In: Proc. IEEE Workshop on Speech Coding for Telecommunications. pp. 85-86.
- Srinivasan, K., Gersho, A., 1993. Voice activity detection for cellular networks. In: Proc. IEEE Workshop on Speech Coding for Telecommunications. pp. 85-86.

36
- 44149083948
- A soft voice activity detection using GARCH filter and variance Gamma distribution
- Tahmasbi R., and Rezaei S. A soft voice activity detection using GARCH filter and variance Gamma distribution. IEEE Trans. Audio Speech Lang. Process. 15 (2007) 1129-1134
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 1129-1134
- Tahmasbi, R.¹ Rezaei, S.²

37
- 0026907622
- Voice activity detection using a periodicity measure
- Tucker R. Voice activity detection using a periodicity measure. IEE Proc.-I 139 (1992) 377-380
- (1992) IEE Proc.-I , vol.139 , pp. 377-380
- Tucker, R.¹

38
- 0003751465
- Springer-Verlag, Berlin
- West M., and Harrison P.J. Bayesian Forecasting and Dynamic Models. Springer Series in Statistics (1989), Springer-Verlag, Berlin
- (1989) Springer Series in Statistics
- West, M.¹ Harrison, P.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.