메뉴 건너뛰기




Volumn 16, Issue 8, 2008, Pages 1565-1578

Jointly gaussian pdf-based likelihood ratio test for voice activity detection

Author keywords

Generalized complex gaussian (gcg) probability distribution function; Robust speech recognition; Voice activity detection (vad)

Indexed keywords

CLASSIFICATION ERRORS; COMPLEX GAUSSIAN; CORRELATED OBSERVATIONS; DETECTION PERFORMANCE; GAUSSIAN PDF; GAUSSIAN PROBABILITY DISTRIBUTIONS; GENERALIZED COMPLEX GAUSSIAN (GCG) PROBABILITY DISTRIBUTION FUNCTION; LIKELIHOOD RATIO TESTS; LOW DIMENSIONAL; NOISY ENVIRONMENT; OBSERVATION MODEL; REAL-TIME APPLICATION; ROBUST SPEECH RECOGNITION; SPEECH DETECTION; SPEECH RECOGNITION PERFORMANCE; SPEECH RECOGNITION SYSTEMS; SPEECH/NONSPEECH DETECTION; VOICE ACTIVITY DETECTION; VOICE ACTIVITY DETECTION (VAD); VOICE ACTIVITY DETECTORS;

EID: 70350433096     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2004293     Document Type: Article
Times cited : (21)

References (40)
  • 1
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    • A. Benyassine, E. Shlomot, H. Su, D. Massaloux, C. Lamblin, and J. Petit, ", vol., no, Sep.
    • A. Benyassine, E. Shlomot, H. Su, D. Massaloux, C. Lamblin, and J. Petit, "ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications, " IEEE Commun. Mag., vol. 35, no. 9, pp. 64-73, Sep. 1997.
    • (1997) IEEE Commun. Mag. , vol.35 , Issue.9 , pp. 64-73
  • 2
    • 79851495972 scopus 로고    scopus 로고
    • A silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70
    • "A silence compression scheme for G.729 optimized for terminals conforming to Recommendation V.70, " ITU, ITU-T Rec. G.729-Annex B, 1996.
    • (1996) ITU, ITU-T Rec. G.729-Annex B
  • 3
    • 70350505232 scopus 로고    scopus 로고
    • Voice activity detector (VAD) for adaptive multi-rate (AMR) speech traffic channels
    • "Voice activity detector (VAD) for adaptive multi-rate (AMR) speech traffic channels, " ETSI, ETSI EN 301 708 Rec., 1999.
    • (1999) ETSI, ETSI EN 301 708 Rec
  • 4
    • 85032751786 scopus 로고    scopus 로고
    • The 1974 origins of VoIP, Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms
    • R. M. Gray, "The 1974 origins of VoIP, " IEEE Signal Process. Mag., vol. 22, no. 4, pp. 87-90, Jul. 2005.
    • (2002) ETSI, ETSI ES 202 050 Rec, IEEE Signal Process. Mag. , vol.22 , Issue.4 , pp. 87-90
    • Gray, R.M.1
  • 5
    • 33746373488 scopus 로고    scopus 로고
    • Measurement of the effects of temporal clipping on speech quality
    • Aug
    • L. Ding, A. Radwan, M. El-Hennawey, and R. Goubran, "Measurement of the effects of temporal clipping on speech quality, " IEEE Trans. Instrum. Meas., vol. 55, no. 4, pp. 1197-1203, Aug. 2005.
    • (2005) IEEE Trans. Instrum. Meas. , vol.55 , Issue.4 , pp. 1197-1203
    • Ding, L.1    Radwan, A.2    El-Hennawey, M.3    Goubran, R.4
  • 7
    • 33746612447 scopus 로고    scopus 로고
    • Voice quality prediction models and their application in VoIP networks
    • Aug
    • L. Sun and E. Ifeachor, "Voice quality prediction models and their application in VoIP networks, " IEEE Trans. Multimedia, vol. 8, no. 4, pp. 809-820, Aug. 2006.
    • (2006) IEEE Trans. Multimedia , vol.8 , Issue.4 , pp. 809-820
    • Sun, L.1    Ifeachor, E.2
  • 8
    • 0029290274 scopus 로고
    • Study of a voice activity detector and its influence on a noise reduction system
    • R. L. Bouquin-Jeannes and G. Faucon, "Study of a voice activity detector and its influence on a noise reduction system, " Speech Commun., vol. 16, pp. 245-254, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 245-254
    • Bouquin-Jeannes, R.L.1    Faucon, G.2
  • 10
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 11
    • 0028769421 scopus 로고
    • Proposal of a voice activity detector for noise reduction
    • R. L. Bouquin-Jeannes and G. Faucon, "Proposal of a voice activity detector for noise reduction, " Electron. Lett., vol. 30, no. 12, pp. 930-932, 1994.
    • (1994) Electron. Lett. , vol.30 , Issue.12 , pp. 930-932
    • Bouquin-Jeannes, R.L.1    Faucon, G.2
  • 13
    • 50449102573 scopus 로고    scopus 로고
    • Environmental sniffing: Noise knowledge estimation for robust speech systems
    • Feb
    • M. Akbacak and J. H. L. Hansen, "Environmental sniffing: Noise knowledge estimation for robust speech systems, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 465-477, Feb. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 465-477
    • Akbacak, M.1    Hansen, J.H.L.2
  • 14
    • 0033903480 scopus 로고    scopus 로고
    • Robust voice activity detection algorithm for estimating noise spectrum
    • 2000
    • K.Woo, T. Yang, K. Park, and C. Lee, "Robust voice activity detection algorithm for estimating noise spectrum, " Electron. Lett., vol. 36, no. 2, pp. 180-181, 2000.
    • Electron. Lett. , vol.36 , Issue.2 , pp. 180-181
    • Woo, K.1    Yang, T.2    Park, K.3    Lee, C.4
  • 15
    • 0036508040 scopus 로고    scopus 로고
    • Robust endpoint detection and energy normalization for real-time speech and speaker recognition
    • Mar.
    • Q. Li, J. Zheng, A. Tsai, and Q. Zhou, "Robust endpoint detection and energy normalization for real-time speech and speaker recognition, " IEEE Trans. Speech Audio Process., vol. 10, no. 3, pp. 146-157, Mar. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 146-157
    • Li, Q.1    Zheng, J.2    Tsai, A.3    Zhou, Q.4
  • 16
    • 0036476655 scopus 로고    scopus 로고
    • Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
    • Feb
    • M. Marzinzik and B. Kollmeier, "Speech pause detection for noise spectrum estimation by tracking power envelope dynamics, " IEEE Trans. Speech Audio Processing, vol. 10, no. 2, pp. 341-351, Feb. 2002.
    • (2002) IEEE Trans. Speech Audio Processing , vol.10 , Issue.2 , pp. 341-351
    • Marzinzik, M.1    Kollmeier, B.2
  • 17
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Jan.
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 16, no. 1, pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.16 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 18
    • 85026719883 scopus 로고    scopus 로고
    • Robust energy normalization using speech/nonspeech discriminator for German connected digit recognition
    • Budapest, Hungary, Sep.
    • R. Chengalvarayan, "Robust energy normalization using speech/nonspeech discriminator for German connected digit recognition, " in Proc. Eurospeech, Budapest, Hungary, Sep. 1999, pp. 61-64.
    • (1999) Proc. Eurospeech , pp. 61-64
    • Chengalvarayan, R.1    Tucker, R.2
  • 19
    • 0026907622 scopus 로고
    • Voice activity detection using a periodicity measure
    • , "Voice activity detection using a periodicity measure, " IEE Proc. Commun., Speech, Vision, vol. 139, no. 4, pp. 377-380, 1992.
    • (1992) IEE Proc. Commun., Speech, Vision , vol.139 , Issue.4 , pp. 377-380
  • 21
    • 33746363506 scopus 로고    scopus 로고
    • Speech/non-speech discrimination based on contextual information integrated bispectrum LRT
    • Aug
    • J. Ramírez, J. M. Górriz, J. C. Segura, C. G. Puntonet, and A. Rubio, "Speech/non-speech discrimination based on contextual information integrated bispectrum LRT, " IEEE Signal Process. Lett., vol. 13, no. 8, pp. 497-500, Aug. 2006.
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.8 , pp. 497-500
    • Ramírez, J.1    Górriz, J.M.2    Segura, J.C.3    Puntonet, C.G.4    Rubio, A.5
  • 22
    • 0034228994 scopus 로고    scopus 로고
    • Voice activity detection in nonstationary noise
    • Jul.
    • S. G. Tanyer and H.Özer, "Voice activity detection in nonstationary noise, " IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 478-482, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 478-482
    • Tanyer, S.G.1    Özer, H.2
  • 23
    • 10944225892 scopus 로고    scopus 로고
    • Voice activity detector employing generalised Gaussian distribution
    • J.-H. Chang, J. W. Shin, and N. S. Kim, "Voice activity detector employing generalised Gaussian distribution, " Electron. Lett., vol. 40, no. 24, pp. 1561-1563, 2004.
    • (2004) Electron. Lett. , vol.40 , Issue.24 , pp. 1561-1563
    • Chang, J.-H.1    Shin, J.W.2    Kim, N.S.3
  • 24
    • 23344452899 scopus 로고    scopus 로고
    • Statistical voice activity detection using a multiple observation likelihood ratio test
    • Oct
    • J. Ramírez, J. C. Segura, C. Benítez, L. García, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test, " IEEE Signal Process. Lett., vol. 12, no. 10, pp. 837-844, Oct. 2001.
    • (2001) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 837-844
    • Ramírez, J.1    Segura, J.C.2    Benítez, C.3    García, L.4    Rubio, A.5
  • 25
    • 33745759906 scopus 로고    scopus 로고
    • An effective cluster-based model for robust speech detection and speech recognition in noisy environments
    • J. M.Górriz, J. Ramírez, J. C. Segura, and C. G. Puntonet, "An effective cluster-based model for robust speech detection and speech recognition in noisy environments, " J. Acoust. Soc. Amer., vol. 120, no. 470, pp. 470-481, 2006.
    • (2006) J. Acoust. Soc. Amer. , vol.120 , Issue.470 , pp. 470-481
    • M.Górriz, J.1    Ramírez, J.2    Segura, J.C.3    Puntonet, C.G.4
  • 26
    • 33751423044 scopus 로고    scopus 로고
    • Hard c-means clustering for voice activity detection
    • J. M. Górriz, J. Ramírez, E. W. Lang, and C. G. Puntonet, "Hard c-means clustering for voice activity detection, " Speech Commun., vol. 44, pp. 1638-1649, 2006.
    • (2006) Speech Commun. , vol.44 , pp. 1638-1649
    • Górriz, J.M.1    Ramírez, J.2    Lang, E.W.3    Puntonet, C.G.4
  • 27
    • 23344432506 scopus 로고    scopus 로고
    • An improvedMO-LRT VAD based on a bispectra Gaussian model
    • J. M. Górriz, J. Ramirez, J. C. Segura, and C. G. Puntonet, "An improvedMO-LRT VAD based on a bispectra Gaussian model, " Electron. Lett., vol. 41, no. 15, pp. 877-879, 2005.
    • (2005) Electron. Lett. , vol.41 , Issue.15 , pp. 877-879
    • Górriz, J.M.1    Ramirez, J.2    Segura, J.C.3    Puntonet, C.G.4
  • 30
    • 0031582149 scopus 로고    scopus 로고
    • The analytic inversion of any finite symmetric tridiagonal matrix
    • H. Yamani and M. Abdelmonem, "The analytic inversion of any finite symmetric tridiagonal matrix, " J. Phys. A: Math. Gen., vol. 30, pp. 2889-2893, 1997.
    • (1997) J. Phys. A: Math. Gen , vol.30 , pp. 2889-2893
    • Yamani, H.1    Abdelmonem, M.2
  • 34
    • 0029290274 scopus 로고
    • Study of a voice activity detector and its influence on a noise reduction system
    • R. L. Bouquin-Jeannes and G. Faucon, "Study of a voice activity detector and its influence on a noise reduction system, " Speech Commun., vol. 16, pp. 245-254, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 245-254
    • Bouquin-Jeannes, R.L.1    Faucon, G.2
  • 35
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • J. Ramírez, J. C. Segura, M. C. Benítez, A. d. l. Torre, and A. Rubio, "Efficient voice activity detection algorithms using long-term speech information, " Speech Commun., vol. 42, no. 3-4, pp. 271-287, 2004.
    • (2004) Speech Commun , vol.42 , Issue.3-4 , pp. 271-287
    • Ramírez, J.1    Segura, J.C.2    Benítez, M.C.3    Torre, A.D.L.4    Rubio, A.5
  • 36
    • 0037401288 scopus 로고    scopus 로고
    • Towards improving speech detection robustness for speech recognition in adverse environments
    • no
    • L. Karray and A. Martin, "Towards improving speech detection robustness for speech recognition in adverse environments, " Speech Commun., no. 3, pp. 261-276, 2003.
    • (2003) Speech Commun , Issue.3 , pp. 261-276
    • Karray, L.1    Martin, A.2
  • 37
    • 26844492797 scopus 로고    scopus 로고
    • Speech processing, transmission, and quality aspects (STQ); distributed speech recognition; front-end feature extraction algorithm; compression algorithms
    • ETSI
    • "Speech processing, transmission, and quality aspects (STQ); distributed speech recognition; front-end feature extraction algorithm; compression algorithms, " ETSI, ETSI ES 201 108 Rec., 2000.
    • (2000) ETSI ES 201 108 Rec.
  • 39
    • 27744483317 scopus 로고    scopus 로고
    • An effective subband OSF-based VAD with noise reduction for robust speech recognition
    • Nov
    • J. Ramírez, J. C. Segura, C. Benítez, A. d. l. Torre, and A. Rubio, "An effective subband OSF-based VAD with noise reduction for robust speech recognition, " IEEE Trans. Speech Audio Process., vol. 13, no. 6, pp. 1119-1129, Nov. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.6 , pp. 1119-1129
    • Ramírez, J.1    Segura, J.C.2    Benítez, C.3    Torre, A.D.L.4    Rubio, A.5
  • 40
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluation of speech recognition systems under noise conditions
    • Paris, France, Sep., CD-ROM
    • H. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluation of speech recognition systems under noise conditions, " in Proc. ISCA ITRW ASR2000 Automatic Speech Recognition: Challenges for the Next Millennium, Paris, France, Sep. 2000, CD-ROM.
    • (2000) Proc. ISCA ITRW ASR2000 Automatic Speech Recognition: Challenges for The Next Millennium
    • Hirsch, H.1    Pearce, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.