메뉴 건너뛰기




Volumn 54, Issue 6 I, 2006, Pages 1965-1976

Voice activity detection based on multiple statistical models

Author keywords

Discrete cosine transform (DCT); Generalized gamma function; Maximum likelihood

Indexed keywords

ALGORITHMS; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; PROBABILITY DENSITY FUNCTION; SPEECH PROCESSING; STATISTICAL METHODS;

EID: 33744532633     PISSN: 1053587X     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSP.2006.874403     Document Type: Article
Times cited : (213)

References (35)
  • 3
    • 84973376206 scopus 로고
    • "A study of endpoint detection algorithms in adverse conditions: Incidence on a DTW and HMM recognize"
    • J. C. Junqua, B. Reaves, and B. Mark, "A study of endpoint detection algorithms in adverse conditions: Incidence on a DTW and HMM recognize," in Proc. Eurospeech, 1991, pp. 1371-1374.
    • (1991) Proc. Eurospeech , pp. 1371-1374
    • Junqua, J.C.1    Reaves, B.2    Mark, B.3
  • 4
    • 0027713501 scopus 로고
    • "Robust voice activity detection using cepstral feature"
    • China
    • J. A. Haigh and J. S. Mason, "Robust voice activity detection using cepstral feature," in Proc. IEEE TELCON, China, 1993, pp. 321-324.
    • (1993) Proc. IEEE TELCON , pp. 321-324
    • Haigh, J.A.1    Mason, J.S.2
  • 5
    • 0030192187 scopus 로고    scopus 로고
    • "Robust speech pulse-detection using adaptive noise modeling"
    • Jul
    • N. B. Yoma, F. McIness, and M. Jack, "Robust speech pulse-detection using adaptive noise modeling," Electron. Lett., vol. 32, pp. 1350-1352, Jul. 1996.
    • (1996) Electron. Lett. , vol.32 , pp. 1350-1352
    • Yoma, N.B.1    McIness, F.2    Jack, M.3
  • 6
    • 0026907622 scopus 로고
    • "Voice activity detection using a periodicity measure"
    • Aug
    • R. Tucker, "Voice activity detection using a periodicity measure," Proc Inst. Electr. Eng., vol. 139, pp. 377-380, Aug. 1992.
    • (1992) Proc Inst. Electr. Eng. , vol.139 , pp. 377-380
    • Tucker, R.1
  • 7
    • 0032308777 scopus 로고    scopus 로고
    • "A robust voice activity detetor for wireless communications using soft computing"
    • Dec
    • F. Beritelli, S. Casale, and A. Cavallaro, "A robust voice activity detetor for wireless communications using soft computing," IEEE J. Sel. Areas Commun., vol. 16, pp. 1818-1829, Dec. 1998.
    • (1998) IEEE J. Sel. Areas Commun. , vol.16 , pp. 1818-1829
    • Beritelli, F.1    Casale, S.2    Cavallaro, A.3
  • 8
    • 0021645331 scopus 로고
    • "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator"
    • Dec
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 9
    • 0006420828 scopus 로고    scopus 로고
    • "A silence compression scheme for G.729 optimized for terminals conforming to ITU-T V.70"
    • ITU, ITU-T Rec. G. 729, Annex B
    • ITU, "A silence compression scheme for G.729 optimized for terminals conforming to ITU-T V.70,", ITU-T Rec. G. 729, Annex B, 1996.
    • (1996)
  • 10
    • 0141697573 scopus 로고    scopus 로고
    • "Selectable mode vocoder service option for wideband spread spectrum communication systems"
    • 3GPP2, 3GPP2 C.S0030-0 ver. 1.0
    • 3GPP2, "Selectable mode vocoder service option for wideband spread spectrum communication systems,", 3GPP2 C.S0030-0 ver. 1.0, 2001.
    • (2001)
  • 11
    • 33744527841 scopus 로고    scopus 로고
    • "Speech coders: Silence compression scheme"
    • ITU, ITU-T Rec. G.723.1, Annex A
    • ITU, "Speech coders: Silence compression scheme,", ITU-T Rec. G.723.1, Annex A, 1996.
    • (1996)
  • 12
    • 0442317753 scopus 로고    scopus 로고
    • "Voice activity detector (VAD) for adaptive multi-rate (AMR) speech traffic channels"
    • ETSI, ETSI EN 301 708 v7.1.1, Dec
    • ETSI, "Voice activity detector (VAD) for adaptive multi-rate (AMR) speech traffic channels,", ETSI EN 301 708 v7.1.1, Dec. 1999.
    • (1999)
  • 13
    • 0031636164 scopus 로고    scopus 로고
    • "A voice activity detector employing soft decision based noise spectrum adaptation"
    • J. Sohn and W. Sung, "A voice activity detector employing soft decision based noise spectrum adaptation," Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., pp. 365-368, 1998.
    • (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 365-368
    • Sohn, J.1    Sung, W.2
  • 14
    • 0032762471 scopus 로고    scopus 로고
    • "A statistical model-based voice activity detection"
    • Jan
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 15
    • 0035481845 scopus 로고    scopus 로고
    • "Analysis and improvement of a statistical model-based voice activity detector"
    • Oct
    • Y. D. Cho and A. Kondoz, "Analysis and improvement of a statistical model-based voice activity detector," IEEE Signal Process. Lett., vol. 8, pp. 276-278, Oct. 2001.
    • (2001) IEEE Signal Process. Lett. , vol.8 , pp. 276-278
    • Cho, Y.D.1    Kondoz, A.2
  • 16
    • 85008053840 scopus 로고    scopus 로고
    • "Spectral enhancement based on global soft decision"
    • May
    • N. S. Kim and J.-H. Chang, "Spectral enhancement based on global soft decision," IEEE Signal Process. Lett., vol. 7, pp. 108-110, May 2000.
    • (2000) IEEE Signal Process. Lett. , vol.7 , pp. 108-110
    • Kim, N.S.1    Chang, J.-H.2
  • 17
    • 0035445888 scopus 로고    scopus 로고
    • "Speech enhancement: New approaches to soft decision"
    • E84-D Sep
    • J.-H. Chang and N. S. Kim, "Speech enhancement: New approaches to soft decision," IEICE Trans. Inf. Syst., vol. 27, no. E84-D, pp. 1231-1240, Sep. 2001.
    • (2001) IEICE Trans. Inf. Syst. , vol.27 , pp. 1231-1240
    • Chang, J.-H.1    Kim, N.S.2
  • 18
    • 84961827168 scopus 로고    scopus 로고
    • "Speech enhancement using warped discrete cosine transform"
    • Tsukuba, Japan, Oct
    • J.-H. Chang and N. S. Kim, "Speech enhancement using warped discrete cosine transform," in Proc. IEEE Speech Coding Workshop, Tsukuba, Japan, Oct. 2002, pp. 175-177.
    • (2002) Proc. IEEE Speech Coding Workshop , pp. 175-177
    • Chang, J.-H.1    Kim, N.S.2
  • 19
    • 0037417326 scopus 로고    scopus 로고
    • "Voice activity detection based on complex Laplacian model"
    • Apr
    • J.-H. Chang and N. S. Kim, "Voice activity detection based on complex Laplacian model," Electron. Lett., vol. 39, no. 7, pp. 632-634, Apr. 2003.
    • (2003) Electron. Lett. , vol.39 , Issue.7 , pp. 632-634
    • Chang, J.-H.1    Kim, N.S.2
  • 20
    • 0035500783 scopus 로고    scopus 로고
    • "Speech enhancement for nonstationary noise environments"
    • Nov
    • I. Cohen and B. Berdugo, "Speech enhancement for nonstationary noise environments," Signal Process., vol. 81, pp. 2403-2418, Nov. 2001.
    • (2001) Signal Process. , vol.81 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 21
    • 0036226165 scopus 로고    scopus 로고
    • "Noise estimation by minima controlled recursive averaging for robust speech enhancement"
    • Jan
    • I. Cohen and B. Berdugo, "Noise estimation by minima controlled recursive averaging for robust speech enhancement," IEEE Signal Process. Lett., vol. 9, pp. 12-15, Jan. 2002.
    • (2002) IEEE Signal Process. Lett. , vol.9 , pp. 12-15
    • Cohen, I.1    Berdugo, B.2
  • 22
    • 0036543522 scopus 로고    scopus 로고
    • "Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator"
    • Apr
    • I. Cohen, "Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator," IEEE Signal Process. Lett., vol. 9, pp. 113-116, Apr. 2002.
    • (2002) IEEE Signal Process. Lett. , vol.9 , pp. 113-116
    • Cohen, I.1
  • 23
    • 0034228994 scopus 로고    scopus 로고
    • "Voice activity detection in nonstationary noise"
    • Jul
    • S. G. Tanyer and H. Özer, "Voice activity detection in nonstationary noise," IEEE Trans. Speech Audio Process., vol. 8, pp. 478-482, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , pp. 478-482
    • Tanyer, S.G.1    Özer, H.2
  • 24
    • 0035274536 scopus 로고    scopus 로고
    • "Robust voice activity detection using higher-order statistics in the LPC Residual domain"
    • Mar
    • E. Nemer, R. Goubran, and S. Mahmoud, "Robust voice activity detection using higher-order statistics in the LPC Residual domain," IEEE Trans. Speech Audio Process., vol. 9, pp. 217-231, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 217-231
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 25
    • 4243315328 scopus 로고    scopus 로고
    • "Enhanced variable rate codec, speech service option 3 for wideband spectrum digital systems"
    • TIA/EIA/IS-127
    • "Enhanced variable rate codec, speech service option 3 for wideband spectrum digital systems,", TIA/EIA/IS-127, 1996.
    • (1996)
  • 27
    • 0035373032 scopus 로고    scopus 로고
    • "Order statistics in goodness-of-fit testing"
    • Jun
    • A. G. Glen, L. M. Leemis, and D. R. Barr, "Order statistics in goodness-of-fit testing," IEEE Trans. Reliab., vol. 50, pp. 209-213, Jun. 2001.
    • (2001) IEEE Trans. Reliab. , vol.50 , pp. 209-213
    • Glen, A.G.1    Leemis, L.M.2    Barr, D.R.3
  • 28
    • 0020766544 scopus 로고
    • "Distributions of the two dimensional DCT coefficients for images"
    • COM-31 Jun
    • R. C. Reininger and J. D. Gibson, "Distributions of the two dimensional DCT coefficients for images," IEEE Trans. Commun., vol. COM-31, no. 6, pp. 835-839, Jun. 1983.
    • (1983) IEEE Trans. Commun. , Issue.6 , pp. 835-839
    • Reininger, R.C.1    Gibson, J.D.2
  • 29
    • 0019009880 scopus 로고
    • "Speech enhancement using a soft-decision noise suppression filter"
    • ASSP-28 Apr
    • R. J. McAulary and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, pp. 137-145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , pp. 137-145
    • McAulary, R.J.1    Malpass, M.L.2
  • 30
    • 0028413241 scopus 로고
    • "Elimination of musical noise phenomenon with the Ephraim and Malah noise suppressor"
    • Apr
    • O. Cappé, "Elimination of musical noise phenomenon with the Ephraim and Malah noise suppressor," IEEE Trans. Speech Audio Process., vol. 2, pp. 345-349, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 345-349
    • Cappé, O.1
  • 31
    • 0036296949 scopus 로고    scopus 로고
    • "Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors"
    • Orlando, FL, May
    • R. Martin, "Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 1, Orlando, FL, May 2002, pp. 1253-1256.
    • (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 1253-1256
    • Martin, R.1
  • 32
    • 0038610905 scopus 로고    scopus 로고
    • "Speech probability distribution"
    • Jul
    • S. Gazor and W. Zhang, "Speech probability distribution," IEEE Signal Process. Lett., vol. 10, pp. 204-207, Jul. 2003.
    • (2003) IEEE Signal Process. Lett. , vol.10 , pp. 204-207
    • Gazor, S.1    Zhang, W.2
  • 33
    • 0035396555 scopus 로고    scopus 로고
    • "Noise power spctral density estimation based on optimal smoothing and minimum statistics"
    • Jul
    • R. Martin, "Noise power spctral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, pp. 504-512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 504-512
    • Martin, R.1
  • 34
    • 0027623210 scopus 로고
    • "Assessment for automatic speech recognition, II - NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems"
    • Jul
    • A. Varga and H. J. M. Steeneken, "Assessment for automatic speech recognition, II - NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, pp. 247-251, Jul. 1993.
    • (1993) Speech Commun. , vol.12 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.