메뉴 건너뛰기




Volumn 13, Issue 6, 2005, Pages 1119-1129

An effective subband OSF-based VAD with noise reduction for robust speech recognition

Author keywords

Noise reduction; Robust speech recognition; Speech nonspeech detection; Subband order statistics filters

Indexed keywords

NOISY ENVIRONMENT; ROBUST SPEECH RECOGNITION; SPEECH/NONSPEECH DETECTION; SUBBAND ORDER STATISTICS FILTERS;

EID: 27744483317     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.853212     Document Type: Article
Times cited : (90)

References (37)
  • 1
    • 0037401288 scopus 로고    scopus 로고
    • Toward improving speech detection robustness for speech recognition in adverse environments
    • L. Karray and A. Martin, "Toward improving speech detection robustness for speech recognition in adverse environments," Speech Commun., no. 3, pp. 261-276, 2003.
    • (2003) Speech Commun. , Issue.3 , pp. 261-276
    • Karray, L.1    Martin, A.2
  • 2
    • 85009230767 scopus 로고    scopus 로고
    • A new adaptive long-term spectral estimation voice activity detector
    • Geneva, Switzerland, Sep.
    • J. Ramírez, J. C. Segura, M. C. Benítez, A. de la Torre, and A. Rubio, "A new adaptive long-term spectral estimation voice activity detector," in Proc. EUROSPEECH 2003, Geneva, Switzerland, Sep. 2003, pp. 3041-3044.
    • (2003) Proc. EUROSPEECH 2003 , pp. 3041-3044
    • Ramírez, J.1    Segura, J.C.2    Benítez, M.C.3    De La Torre, A.4    Rubio, A.5
  • 6
    • 0037224018 scopus 로고    scopus 로고
    • Noise reduction and echo cancellation front-end for speech codecs
    • Jan.
    • F. Basbug, K. Swaminathan, and S. Nandkumar, "Noise reduction and echo cancellation front-end for speech codecs," IEEE Trans. Speech Audio Processing, vol. 11, no. 1, pp. 1-13, Jan. 2004.
    • (2004) IEEE Trans. Speech Audio Processing , vol.11 , Issue.1 , pp. 1-13
    • Basbug, F.1    Swaminathan, K.2    Nandkumar, S.3
  • 7
    • 0036649285 scopus 로고    scopus 로고
    • A psychoacoustic approach to combined acoustic echo cancellation and noise reduction
    • S. Gustafsson, R. Martin, P. Jax, and P. Vary, "A psychoacoustic approach to combined acoustic echo cancellation and noise reduction," IEEE Trans. Speech Audio Processing, vol. 10, no. 5, pp. 245-256, 2002.
    • (2002) IEEE Trans. Speech Audio Processing , vol.10 , Issue.5 , pp. 245-256
    • Gustafsson, S.1    Martin, R.2    Jax, P.3    Vary, P.4
  • 8
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Jan.
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 16, no. 1, pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.16 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 9
    • 0035481845 scopus 로고    scopus 로고
    • Analysis and improvement of a statistical model-based voice activity detector
    • Aug.
    • Y. D. Cho and A. Kondoz, "Analysis and improvement of a statistical model-based voice activity detector," IEEE Signal Process. Lett., vol. 8, no. 10, pp. 276-278, Aug. 2001.
    • (2001) IEEE Signal Process. Lett. , vol.8 , Issue.10 , pp. 276-278
    • Cho, Y.D.1    Kondoz, A.2
  • 10
    • 0042863279 scopus 로고    scopus 로고
    • A soft voice activity detector based on a Laplacian-Gaussian model
    • Sep.
    • S. Gazor and W. Zhang, "A soft voice activity detector based on a Laplacian-Gaussian model," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 498-505, Sep. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 498-505
    • Gazor, S.1    Zhang, W.2
  • 11
    • 85009183774 scopus 로고    scopus 로고
    • Use of a CSP-based voice activity detector for distant-talking ASR
    • Geneva, Switzerland, Sep.
    • L. Armani, M. Matassoni, M. Omologo, and P. Svaizer, "Use of a CSP-based voice activity detector for distant-talking ASR," in Proc. EUROSPEECH 2003, Geneva, Switzerland, Sep. 2003, pp. 501-504.
    • (2003) Proc. EUROSPEECH 2003 , pp. 501-504
    • Armani, L.1    Matassoni, M.2    Omologo, M.3    Svaizer, P.4
  • 12
    • 0029290274 scopus 로고
    • Study of a voice activity detector and its influence on a noise reduction system
    • R. L. Bouquin-Jeannes and G. Faucon, "Study of a voice activity detector and its influence on a noise reduction system," Speech Commun., vol. 16, pp. 245-254, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 245-254
    • Bouquin-Jeannes, R.L.1    Faucon, G.2
  • 13
    • 0033903480 scopus 로고    scopus 로고
    • Robust voice activity detection algorithm for estimating noise spectrum
    • K. Woo, T. Yang, K. Park, and C. Lee, "Robust voice activity detection algorithm for estimating noise spectrum," Electron. Lett., vol. 36, no. 2, pp. 180-181, 2000.
    • (2000) Electron. Lett. , vol.36 , Issue.2 , pp. 180-181
    • Woo, K.1    Yang, T.2    Park, K.3    Lee, C.4
  • 14
    • 0036508040 scopus 로고    scopus 로고
    • Robust endpoint detection and energy normalization for real-time speech and speaker recognition
    • May
    • Q. Li, J. Zheng, A. Tsai, and Q. Zhou, "Robust endpoint detection and energy normalization for real-time speech and speaker recognition," IEEE Trans. Speech Audio Process., vol. 10, no. 3, pp. 146-157, May 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 146-157
    • Li, Q.1    Zheng, J.2    Tsai, A.3    Zhou, Q.4
  • 15
    • 0036476655 scopus 로고    scopus 로고
    • Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
    • Nov.
    • M. Marzinzik and B. Kollmeier, "Speech pause detection for noise spectrum estimation by tracking power envelope dynamics," IEEE Trans. Speech Audio Process., vol. 10, no. 6, pp. 341-351, Nov. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.6 , pp. 341-351
    • Marzinzik, M.1    Kollmeier, B.2
  • 16
    • 85026719883 scopus 로고    scopus 로고
    • Robust energy normalization using speech/non-speech discriminator for German connected digit recognition
    • Budapest, Hungary, Sep.
    • R. Chengalvarayan, "Robust energy normalization using speech/non-speech discriminator for German connected digit recognition," in Proc. EUROSPEECH 1999, Budapest, Hungary, Sep. 1999, pp. 61-64.
    • (1999) Proc. EUROSPEECH 1999 , pp. 61-64
    • Chengalvarayan, R.1
  • 17
    • 0026907622 scopus 로고
    • Voice activity detection using a periodicity measure
    • R. Tucker, "Voice activity detection using a periodicity measure," Proc. Inst. Elect. Eng., vol. 139, no. 4, pp. 377-380, 1992.
    • (1992) Proc. Inst. Elect. Eng. , vol.139 , Issue.4 , pp. 377-380
    • Tucker, R.1
  • 18
    • 0035274536 scopus 로고    scopus 로고
    • Robust voice activity detection using higher-order statistics in the lpc residual domain
    • May
    • E. Nemer, R. Goubran, and S. Mahmoud, "Robust voice activity detection using higher-order statistics in the lpc residual domain," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 217-231, May 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 217-231
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 19
    • 0034228994 scopus 로고    scopus 로고
    • Voice activity detection in nonstationary noise
    • Jul.
    • S. G. Tanyer and H. Özer, "Voice activity detection in nonstationary noise," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 478-482, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 478-482
    • Tanyer, S.G.1    Özer, H.2
  • 21
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • _, "Efficient voice activity detection algorithms using long-term speech information," Speech Commun., vol. 42, no. 3-4, pp. 271-287, 2004.
    • (2004) Speech Commun. , vol.42 , Issue.3-4 , pp. 271-287
  • 22
    • 0022808786 scopus 로고
    • A computational approach to edge detection
    • J. Canny, "A computational approach to edge detection," IEEE Trans. Pattern Anal. Machine Intell., vol. PAM1-8, pp. 679-698, 1986.
    • (1986) IEEE Trans. Pattern Anal. Machine Intell. , vol.PAM1-8 , pp. 679-698
    • Canny, J.1
  • 23
    • 0028375881 scopus 로고
    • Multilevel nonlinear filters for edge detection and noise suppression
    • Feb.
    • H. Hwang and R. Haddad, "Multilevel nonlinear filters for edge detection and noise suppression," IEEE Trans. Signal Process., vol. 42, no. 2, pp. 249-258, Feb. 1994.
    • (1994) IEEE Trans. Signal Process. , vol.42 , Issue.2 , pp. 249-258
    • Hwang, H.1    Haddad, R.2
  • 24
    • 0037230192 scopus 로고    scopus 로고
    • An efficient method for L-filter design
    • Jan.
    • R. Öten and R. J. P. de Figueiredo, "An efficient method for L-filter design," IEEE Trans. Signal Process., vol. 51, no. 1, pp. 193-203, Jan. 2003.
    • (2003) IEEE Trans. Signal Process. , vol.51 , Issue.1 , pp. 193-203
    • Öten, R.1    De Figueiredo, R.J.P.2
  • 26
    • 0019071184 scopus 로고
    • Nonparametric rank-order statistics applied to robust voiced-unvoiced-silence classification
    • May
    • B. V. Cox and L. K. Tinothy, "Nonparametric rank-order statistics applied to robust voiced-unvoiced-silence classification," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 5, pp. 550-561, May 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.5 , pp. 550-561
    • Cox, B.V.1    Tinothy, L.K.2
  • 28
    • 0026222410 scopus 로고
    • Center weighted median filters and their applications to image enhancement
    • Sep.
    • S. Ko and Y. Lee, "Center weighted median filters and their applications to image enhancement," IEEE Trans. Circuits Syst., vol. 38, no. 9, pp. 984-993, Sep. 1991.
    • (1991) IEEE Trans. Circuits Syst. , vol.38 , Issue.9 , pp. 984-993
    • Ko, S.1    Lee, Y.2
  • 29
    • 0027191717 scopus 로고
    • Application of adaptive order statistic filters in digital image/image sequence filtering
    • I. Pitas and A. V. Pitas, "Application of adaptive order statistic filters in digital image/image sequence filtering," in Proc. IEEE Int. Symp. Circuits and Systems (ISCAS), vol. 2, 1993, pp. 327-330.
    • (1993) Proc. IEEE Int. Symp. Circuits and Systems (ISCAS) , vol.2 , pp. 327-330
    • Pitas, I.1    Pitas, A.V.2
  • 33
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T Recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    • A. Benyassine, E. Shlomot, H. Su, D. Massaloux, C. Lamblin, and J. Petit, "ITU-T Recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications," IEEE Commun. Mag., vol. 35, no. 9, pp. 64-73, 1997.
    • (1997) IEEE Commun. Mag. , vol.35 , Issue.9 , pp. 64-73
    • Benyassine, A.1    Shlomot, E.2    Su, H.3    Massaloux, D.4    Lamblin, C.5    Petit, J.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.