메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1558-1562

Improving the speech activity detection for the DARPA RATS phase-3 evaluation

Author keywords

Bottleneck features; Neural network; Speech activity detection

Indexed keywords

NEURAL NETWORKS; RATS; SPEECH; SPEECH COMMUNICATION;

EID: 84910088867     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (10)
  • 1
    • 84910038669 scopus 로고    scopus 로고
    • Developing a speech activity detection system for the DARPA RATS program
    • Ng, T., Zhang, B., etc., "Developing a Speech Activity Detection System for the DARPA RATS Program", ICASSP, 2012.
    • (2012) ICASSP
    • Ng, T.1    Zhang, B.2
  • 2
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from non speech based on multi scale spectro-temporal modulations
    • May
    • Mesgarani, N., Slaney, M. and Shamma, S., "Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations", IEEE Trans. on Audio, Speech and Language Processing, vol. 14, pp. 920-930, May 2006.
    • (2006) IEEE Trans. on Audio, Speech and Language Processing , vol.14 , pp. 920-930
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.3
  • 4
    • 0032137627 scopus 로고    scopus 로고
    • Channel normalization techniques for automatic speech recognition over the telephone
    • De Veth, J. and Boves, L., "Channel normalization techniques for automatic speech recognition over the telephone", Speech Communications 25 (1998) 149-164.
    • (1998) Speech Communications , vol.25 , pp. 149-164
    • De Veth, J.1    Boves, L.2
  • 5
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear trasfromations for HMM-based speech recognition
    • Gales, M., "Maximum Likelihood Linear Trasfromations for HMM-based Speech Recognition", Computer Speech and Language 12, 1998, S. 75-98.
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.1
  • 6
    • 33646777278 scopus 로고    scopus 로고
    • A generalization of linear discriminant analysis in maximum likelihood framework
    • Kumar, N. and Andreou, A., "A generalization of linear discriminant analysis in maximum likelihood framework, " Johns Hopkins University, Tech. Rep., 1996.
    • (1996) Johns Hopkins University, Tech. Rep.
    • Kumar, N.1    Andreou, A.2
  • 8
    • 85032751458 scopus 로고    scopus 로고
    • Deep neural networks for acoustic modeling in speech recognition
    • November
    • Hinton, G., Deng, L., etc, "Deep Neural Networks for Acoustic Modeling in Speech Recognition", IEEE Signal Processing Magazine, 29, November 2012.
    • (2012) IEEE Signal Processing Magazine , vol.29
    • Hinton, G.1    Deng, L.2
  • 10
    • 84906222432 scopus 로고    scopus 로고
    • The IBM speech activity detection system for the DARPA RATS program
    • Saon, G., Thomas, S., Soltau, H., etc., "The IBM Speech Activity Detection System for the DARPA RATS Program", Inter Speech 2013.
    • (2013) Inter Speech
    • Saon, G.1    Thomas, S.2    Soltau, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.