메뉴 건너뛰기




Volumn , Issue , 2013, Pages 3512-3516

Reverberant speech recognition based on denoising autoencoder

Author keywords

CENSREC 4; Denoising autoencoder; Distant talking speech recognition; Restricted boltzmann machine; Reverberant speech recognition

Indexed keywords

EXPERIMENTS; IMPULSE RESPONSE; LEARNING SYSTEMS; SPEECH RECOGNITION; STATISTICAL TESTS;

EID: 84906237188     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (93)

References (15)
  • 1
    • 85032750883 scopus 로고    scopus 로고
    • Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors
    • IEEE
    • K. Kumatani, J. McDonough, and B. Raj, "Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 127-140, 2012.
    • (2012) Signal Processing Magazine , vol.29 , Issue.6 , pp. 127-140
    • Kumatani, K.1    McDonough, J.2    Raj, B.3
  • 2
    • 65249167097 scopus 로고    scopus 로고
    • Suppression of late reverberation effect on speech signal using long- 3515 term multiple-step linear prediction
    • may
    • K. Kinoshita, M. Delcroix, T. Nakatani, and M. Miyoshi, "Suppression of late reverberation effect on speech signal using long- 3515 term multiple-step linear prediction, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, no. 4, pp. 534 -545, may 2009.
    • (2009) Audio, Speech, and Language Processing, IEEE Transactions on , vol.17 , Issue.4 , pp. 534-545
    • Kinoshita, K.1    Delcroix, M.2    Nakatani, T.3    Miyoshi, M.4
  • 3
    • 77956752049 scopus 로고    scopus 로고
    • Dynamic features in the linear-logarithmic hybrid domain for automatic speech recognition in a reverberant environment
    • O. Ichikawa, T. Fukuda, and M. Nishimura, "Dynamic features in the linear-logarithmic hybrid domain for automatic speech recognition in a reverberant environment, " Selected Topics in Signal Processing, IEEE Journal of, vol. 4, pp. 816-823, 2010.
    • (2010) Selected Topics in Signal Processing, IEEE Journal of , vol.4 , pp. 816-823
    • Ichikawa, O.1    Fukuda, T.2    Nishimura, M.3
  • 4
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. Hinton and R. Salakhutdinov, "Reducing the dimensionality of data with neural networks, " Science, vol. 313, no. 5786, pp. 504- 507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.1    Salakhutdinov, R.2
  • 6
    • 79551480483 scopus 로고    scopus 로고
    • Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
    • Dec
    • P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P. A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion, " J. Mach. Learn. Res., vol. 11, pp. 3371-3408, Dec. 2010.
    • (2010) J. Mach. Learn. Res. , vol.11 , pp. 3371-3408
    • Vincent, P.1    Larochelle, H.2    Lajoie, I.3    Bengio, Y.4    Manzagol, P.A.5
  • 11
    • 68049138790 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • G. Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 14, p. 2002, 2000.
    • (2000) Neural Computation , vol.14 , pp. 2002
    • Hinton, G.1
  • 13
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, " IEEE Transaction on Acoustic Speech and Singal Processing, vol. 28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Transaction on Acoustic Speech and Singal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 15
    • 44849131087 scopus 로고    scopus 로고
    • The titech large vocabulary wfst speech recognition system
    • P. R. Dixon, D. A. Caseiro, T. Oonishi, and S. Furui, "The titech large vocabulary wfst speech recognition system, " in Proc. IEEE ASRU, 2007, pp. 443-448.
    • (2007) Proc. IEEE ASRU , pp. 443-448
    • Dixon, P.R.1    Caseiro, D.A.2    Oonishi, T.3    Furui, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.