메뉴 건너뛰기




Volumn , Issue , 2008, Pages 1293-1296

Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection

Author keywords

Average phoneme duration; Dynamic feature; Long term temporal information; Voice activity detection

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; AVERAGE PHONEME DURATION; CEPSTRUM; CONVENTIONAL METHODS; DYNAMIC FEATURE; DYNAMIC FEATURES; ERROR REDUCTION; FEATURE PARAMETERS; LONG TERM DYNAMICS; LOW SNR; NOISE ROBUSTNESS; TEMPORAL INFORMATION; VOICE ACTIVITY DETECTION;

EID: 84867218934     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (17)
  • 1
    • 0008808539 scopus 로고    scopus 로고
    • A comparative study of speech detection methods
    • S. V. Gerven and F. Xie, "A comparative study of speech detection methods," Proc. Eurospeech '97, vol. III, pp.1095-1098 (1997).
    • (1997) Proc. Eurospeech '97 , vol.3 , pp. 1095-1098
    • Gerven, S.V.1    Xie, F.2
  • 2
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, Vol. 6, pp. 1-3 (1999).
    • (1999) IEEE Signal Processing Letters , vol.6 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 3
    • 0035481845 scopus 로고    scopus 로고
    • Analysis and improvement of a statistical model-based voice activity detector
    • Y. D. Cho and A. Kondoz, "Analysis and improvement of a statistical model-based voice activity detector," IEEE Signal Processing Letters, Vol. 8, No. 10, pp.276-278 (2001).
    • (2001) IEEE Signal Processing Letters , vol.8 , Issue.10 , pp. 276-278
    • Cho, Y.D.1    Kondoz, A.2
  • 4
    • 0034854659 scopus 로고    scopus 로고
    • Robust speech / nonspeech detection using LDA applied to MFCC
    • A. Martin, D. Charlet, and M. Manuuary, "Robust speech / nonspeech detection using LDA applied to MFCC" Proc. ICASSP '01, vol. I, pp.237-240 (2001).
    • (2001) Proc. ICASSP '01 , vol.1 , pp. 237-240
    • Martin, A.1    Charlet, D.2    Manuuary, M.3
  • 5
    • 27644475276 scopus 로고    scopus 로고
    • An improved voice activity detection using higher order statistics
    • K. Li, M. N. S. Swamy, and M. O. Ahmad, "An improved voice activity detection using higher order statistics," IEEE Trans. Speech and Audio Processing, Vol. 13, No. 5, pp. 965-974 (2005).
    • (2005) IEEE Trans. Speech and Audio Processing , vol.13 , Issue.5 , pp. 965-974
    • Li, K.1    Swamy, M.N.S.2    Ahmad, M.O.3
  • 6
    • 33947627138 scopus 로고    scopus 로고
    • Robust endpoint detection for speech recognition based on discriminative feature extraction
    • K. Yamamoto, F. Jabloun, K. Reinhard, and A. Kawamura, "Robust endpoint detection for speech recognition based on discriminative feature extraction," Proc. ICASSP '06, vol. I, pp.805-808 (2006).
    • (2006) Proc. ICASSP '06 , vol.1 , pp. 805-808
    • Yamamoto, K.1    Jabloun, F.2    Reinhard, K.3    Kawamura, A.4
  • 7
    • 0032658253 scopus 로고    scopus 로고
    • TRAPS - Classifiers of Temporal Patterns
    • H. Hermansky and S. Sharma, "TRAPS - Classifiers of Temporal Patterns", Proc. ICASSP '99, Vol. I, pp. 289-292 (1999).
    • (1999) Proc. ICASSP '99 , vol.1 , pp. 289-292
    • Hermansky, H.1    Sharma, S.2
  • 8
    • 27144509179 scopus 로고    scopus 로고
    • Learning long-term temporal features in LVCSR using neural networks
    • B. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks", Proc. ICSLP, pp. 612-615 (2004).
    • (2004) Proc. ICSLP , pp. 612-615
    • Chen, B.1    Zhu, Q.2    Morgan, N.3
  • 9
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • J. Ramirez, J. C. Segura, C. Benitez, A. Torre, and A. Rubio, "Efficient voice activity detection algorithms using long-term speech information," Speech Communication, Vol. 42, pp. 271-287 (2004).
    • (2004) Speech Communication , vol.42 , pp. 271-287
    • Ramirez, J.1    Segura, J.C.2    Benitez, C.3    Torre, A.4    Rubio, A.5
  • 10
    • 0038694713 scopus 로고    scopus 로고
    • The analysis of speech in different temporal integration windows: Cerebral lateralization as asymmetric sampling in time
    • D. Poeppel, "The analysis of speech in different temporal integration windows: cerebral lateralization as asymmetric sampling in time," Speech Communication, Vol. 41, pp. 245-255 (2003).
    • (2003) Speech Communication , vol.41 , pp. 245-255
    • Poeppel, D.1
  • 11
    • 0027957839 scopus 로고
    • Effect of temporal envelope smearing on speech perception
    • R. Drullman, J. M. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech perception," J. Acoust. Soc. Amer., Vol. 95, pp. 1053-1064 (1994).
    • (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 1053-1064
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 12
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech perception
    • R. Drullman, J. M. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech perception," J. Acoust. Soc. Amer., Vol. 95, pp. 2670-2680 (1994).
    • (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 13
    • 84856269531 scopus 로고    scopus 로고
    • Desired characteristics of modulation spectrum for robust automatic speech recognition
    • N. Kanedera, H. Hermansky, and T. Arai, "Desired characteristics of modulation spectrum for robust automatic speech recognition," Proc. ICASSP'98, pp.613-616 (1998).
    • (1998) Proc. ICASSP'98 , pp. 613-616
    • Kanedera, N.1    Hermansky, H.2    Arai, T.3
  • 14
    • 84867218137 scopus 로고    scopus 로고
    • Short- and Long-term Dynamic Features for Robust Speech Recognition
    • T. Fukuda, O. Ichikawa, and M. Nishimura, "Short- and Long-term Dynamic Features for Robust Speech Recognition," Interspeech 2008.
    • Interspeech 2008
    • Fukuda, T.1    Ichikawa, O.2    Nishimura, M.3
  • 15
    • 51449094649 scopus 로고    scopus 로고
    • Development of evaluation framework for voice activity detection under noisy environment
    • 2006-SLP-63, in Japanese
    • N. Kitaoka et. al, "Development of evaluation framework for voice activity detection under noisy environment," IPSJ sig. technical reports, 2006-SLP-63, pp. 1-6 (2006), in Japanese.
    • (2006) IPSJ Sig. Technical Reports , pp. 1-6
    • Kitaoka, N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.