메뉴 건너뛰기




Volumn , Issue , 2013, Pages 297-302

Learning filter banks within a deep neural network framework

Author keywords

[No Author keywords available]

Indexed keywords

BROADCAST NEWS; CROSS ENTROPY; DEEP NEURAL NETWORKS; LEARNING APPROACH; LEARNING FILTERS; MEL-FILTER BANKS; SPEECH PRODUCTION; WORD ERROR RATE;

EID: 84893688455     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2013.6707746     Document Type: Conference Paper
Times cited : (151)

References (21)
  • 2
    • 84867711674 scopus 로고    scopus 로고
    • Learning invariant feature hierarchies
    • vol. 7583 of Lecture Notes in Computer Science, Springer
    • Y. LeCun, "Learning Invariant Feature Hierarchies, " in European Conference on Computer Vision (ECCV). 2012, vol. 7583 of Lecture Notes in Computer Science, pp. 496-505, Springer.
    • (2012) European Conference on Computer Vision (ECCV) , pp. 496-505
    • Lecun, Y.1
  • 3
    • 84867585919 scopus 로고    scopus 로고
    • Understanding how deep belief networks perform acoustic modelling
    • A. Mohamed, G. Hinton, and G. Penn, "Understanding how Deep Belief Networks Perform Acoustic Modelling, " in ICASSP, 2012.
    • (2012) ICASSP
    • Mohamed, A.1    Hinton, G.2    Penn, G.3
  • 7
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. Davis and P. Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences, " IEEE Transacations on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357 - 366, 1980. (Pubitemid 11464930)
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 8
    • 79953288449 scopus 로고    scopus 로고
    • Data driven design of filter bank for speech recognition
    • Text, Speech and Dialogue
    • L. Burget and H. Hěrmansk̀y, "Data Driven Design of Filter Bank for Speech Recognition, " in Text, Speech and Dialogue. Springer, 2001, pp. 299-304. (Pubitemid 33329242)
    • (2001) Lecture Notes in Computer Science , Issue.2166 , pp. 299-304
    • Burget, L.1    Hermansky, H.2
  • 9
    • 64849109603 scopus 로고    scopus 로고
    • Data-driven filter-bank-based feature extraction for speech recognition
    • Y. Suh and H. Kim, "Data-Driven Filter-Bank-based Feature Extraction for Speech Recognition, " in Proc. SPECOM, 2004.
    • (2004) Proc. SPECOM
    • Suh, Y.1    Kim, H.2
  • 10
    • 0000551146 scopus 로고
    • A discriminative filter bank model for speech recognition
    • A. Biem, E. Mcdermott, and S. Katagiri, "A Discriminative Filter Bank Model For Speech Recognition, " in Proc. ICASSP, 1995.
    • (1995) Proc. ICASSP
    • Biem, A.1    Mcdermott, E.2    Katagiri, S.3
  • 12
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
    • B. Kingsbury, "Lattice-Based Optimization of Sequence Classification Criteria for Neural-Network Acoustic Modeling, " in Proc. ICASSP, 2009.
    • (2009) Proc. ICASSP
    • Kingsbury, B.1
  • 14
    • 79951796005 scopus 로고    scopus 로고
    • The ibm attila speech recognition toolkit
    • H. Soltau, G. Saon, and B. Kingsbury, "The IBM Attila Speech Recognition Toolkit, " in Proc. SLT, 2010.
    • (2010) Proc. SLT
    • Soltau, H.1    Saon, G.2    Kingsbury, B.3
  • 15
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural network concepts to hybrid NN-HMM model for speech recognition
    • O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying Convolutional Neural Network Concepts to Hybrid NN-HMM Model for Speech Recognition, " in Proc. ICASSP, 2012.
    • (2012) Proc. ICASSP
    • Abdel-Hamid, O.1    Mohamed, A.2    Jiang, H.3    Penn, G.4
  • 18
    • 84890545163 scopus 로고    scopus 로고
    • A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
    • L. Deng, O. Abdel-Hamid, and D. Yu, "A Deep Convolutional Neural Network using Heterogeneous Pooling for Trading Acoustic Invariance with Phonetic Confusion, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Deng, L.1    Abdel-Hamid, O.2    Yu, D.3
  • 19
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • L. Lee and R. C. Rose, "Speaker Normalization using Efficient Frequency Warping Procedures, " in Proc. ICASSP, 1996.
    • (1996) Proc. ICASSP
    • Lee, L.1    Rose, R.C.2
  • 21
    • 0028496580 scopus 로고
    • Weight smoothing to improve network generalization
    • J. Jean and Jin Wang, "Weight Smoothing to Improve Network Generalization, " Neural Networks, IEEE Transactions on, vol. 5, no. 5, pp. 752-763, 1994.
    • (1994) Neural Networks, IEEE Transactions on , vol.5 , Issue.5 , pp. 752-763
    • Jean, J.1    Wang, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.