메뉴 건너뛰기




Volumn , Issue , 2009, Pages

Using artificial neural network For robust voice activity detection under adverse conditions

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL NEURAL NETWORK; DEVELOPED MODEL; EMPIRICAL RESULTS; HARSH ENVIRONMENT; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MODEL-BASED; NEURAL NETWORK CLASSIFIER; NEURAL NETWORK TRAINING; NOISY SPEECH; OPTIMIZATION PROCEDURES; RECENT STATE; RELIABLE MODELS; VOICE ACTIVITY DETECTION;

EID: 71049181730     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/RIVF.2009.5174662     Document Type: Conference Paper
Times cited : (20)

References (20)
  • 1
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    • A. Benyassine, E. Shlomot, H.-Y. Su, D. Massaloux, C. Lamblin, and J.-P. Petit, "ITU-T Recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications," IEEE Communications Magazine, vol. 35, no. 9, pp. 64-73, 1997.
    • (1997) IEEE Communications Magazine , vol.35 , Issue.9 , pp. 64-73
    • Benyassine, A.1    Shlomot, E.2    Su, H.-Y.3    Massaloux, D.4    Lamblin, C.5    Petit, J.-P.6
  • 2
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. on Speech and Audio Processing, vol. 11, no. 5, pp. 466-475, 2003.
    • (2003) IEEE Trans. on Speech and Audio Processing , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 4
    • 38149039412 scopus 로고    scopus 로고
    • chapter Speaker Segmentation for Air Traffic Control, Springer
    • M. Neffe, T. V. Pham, H. Hering, and G. Kubin, Speaker Classification II, LNCS, vol. 4441, chapter Speaker Segmentation for Air Traffic Control, pp. 177-191, Springer, 2007.
    • (2007) Speaker Classification II, LNCS , vol.4441 , pp. 177-191
    • Neffe, M.1    Pham, T.V.2    Hering, H.3    Kubin, G.4
  • 5
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) IEEE Signal Processing Letters , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 6
    • 0042863279 scopus 로고    scopus 로고
    • A soft voice activity detector based on a laplacian-gaussian model
    • S. Gazor and W. Zhang, "A soft voice activity detector based on a Laplacian-Gaussian model," IEEE Trans. on Speech and Audio Processing, vol. 11, no. 5, pp. 498-505, 2003.
    • (2003) IEEE Trans. on Speech and Audio Processing , vol.11 , Issue.5 , pp. 498-505
    • Gazor, S.1    Zhang, W.2
  • 7
    • 33744532633 scopus 로고    scopus 로고
    • Voice activity detection based on multiple statistical models
    • J.H. Chang, N.S. Kim, and S.K. Mitra, "Voice activity detection based on multiple statistical models," IEEE Trans. on Signal Processing, vol. 54, no. 6, pp. 1965-1976, 2006.
    • (2006) IEEE Trans. on Signal Processing , vol.54 , Issue.6 , pp. 1965-1976
    • Chang, J.H.1    Kim, N.S.2    Mitra, S.K.3
  • 8
    • 34249676923 scopus 로고    scopus 로고
    • Robust voice activity detection using perceptual wavelet-packet transform and Teager energy operator
    • S.H. Chen, H.T. Wu, Y. Chang, and T. K. Truong, "Robust voice activity detection using perceptual wavelet-packet transform and Teager energy operator," Pattern Recognition Letters, vol. 28, no. 11, pp. 1327-1332,2007.
    • (2007) Pattern Recognition Letters , vol.28 , Issue.11 , pp. 1327-1332
    • Chen, S.H.1    Wu, H.T.2    Chang, Y.3    Truong, T.K.4
  • 10
    • 66149186195 scopus 로고    scopus 로고
    • Voice activity detection based on conditional MAP criterion
    • J. W. Shin, H. J. Kwon, S. H. Jin, and N. S. Kim, "Voice activity detection based on conditional MAP criterion," Signal Processing Letters, vol. 15, pp. 257-260, 2008.
    • (2008) Signal Processing Letters , vol.15 , pp. 257-260
    • Shin, J.W.1    Kwon, H.J.2    Jin, S.H.3    Kim, N.S.4
  • 12
    • 84867193135 scopus 로고    scopus 로고
    • Voice activity detection algorithms using subband power distance feature for noisy environments
    • Brisbane, Australia
    • T. V. Pham, M. Stadtschnitzer, F. Pernkopf, and G. Kubin, "Voice activity detection algorithms using subband power distance feature for noisy environments," in Proc. Interspeech, Brisbane, Australia, 2008.
    • (2008) Proc. Interspeech
    • Pham, T.V.1    Stadtschnitzer, M.2    Pernkopf, F.3    Kubin, G.4
  • 14
    • 38749086536 scopus 로고    scopus 로고
    • Voice/nonvoice classification using reliable fundamental frequency estimator for voice activated powered wheelchair control
    • Soo-Young Suk, Hyun-Yeol Chung, and Hiroaki Kojima, "Voice/nonvoice classification using reliable fundamental frequency estimator for voice activated powered wheelchair control," Lecture Notes in Computer Science, vol. 4523/2007, pp. 347-357, 2007.
    • (2007) Lecture Notes in Computer Science , vol.4523 , Issue.2007 , pp. 347-357
    • Suk, S.-Y.1    Chung, H.-Y.2    Kojima, H.3
  • 16
    • 71049141519 scopus 로고    scopus 로고
    • The Rice University, "Noisex-92 database,"
    • The Rice University, "Noisex-92 database," http://spib.rice. edu/spib/.
  • 17
    • 44949128271 scopus 로고    scopus 로고
    • Evaluation of objective measures for speech enhancement
    • Philadelphia, PA
    • Y. Hu and P. Loizou, "Evaluation of objective measures for speech enhancement," in Proceedings of INTERSPEECH-2006, Philadelphia, PA, 2006.
    • (2006) Proceedings of INTERSPEECH-2006
    • Hu, Y.1    Loizou, P.2
  • 18
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition," Trans. Acoust., Speech, Signal Processing, Vol. 28, pp. 357-366, Aug. 1980.
    • (1980) Trans. Acoust., Speech, Signal Processing , vol.28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 19
    • 69049105093 scopus 로고    scopus 로고
    • A brief description of the levenberg-marquardt algorithm implemened
    • Foundation for Research and Technology, Hellas
    • M. I. A. Lourakis, "A brief description of the Levenberg-Marquardt algorithm implemened," Tech. Rep., Foundation for Research and Technology, Hellas, 2007.
    • (2007) Tech. Rep.
    • Lourakis, M.I.A.1
  • 20
    • 71049188723 scopus 로고    scopus 로고
    • Strategic Targeted Research Project in the 6th Frame Program of the European Union, FP6-511587
    • "Services for NOmadic Workers (snow)," Strategic Targeted Research Project in the 6th Frame Program of the European Union, FP6-511587.
    • Services for NOmadic Workers (snow)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.