메뉴 건너뛰기




Volumn , Issue , 2012, Pages 107-112

Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modeling

Author keywords

continuous speech recognition; reservoir computing; tandem acoustic modeling

Indexed keywords

ACOUSTIC MODELING; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; PHONEME RECOGNITION; RECOGNITION ACCURACY; RESERVOIR COMPUTING; WORD ERROR RATE REDUCTIONS;

EID: 84874271357     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SLT.2012.6424206     Document Type: Conference Paper
Times cited : (3)

References (19)
  • 2
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional hmm systems
    • H. Hermansky, D.P.W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional hmm systems," in Proc. of ICASSP, 2000, pp. 1635-1638.
    • (2000) Proc. of ICASSP , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 3
    • 84865704330 scopus 로고    scopus 로고
    • A bottom-up stepwise knowledgeintegration approach to large vocabulary continuous speech recognition using weighted finite state machines
    • Sabato Marco Siniscalchi, Torbjørn Svendsen, and Chin-Hui Lee, "A bottom-up stepwise knowledgeintegration approach to large vocabulary continuous speech recognition using weighted finite state machines," in Proc. of INTERSPEECH, 2011, pp. 901-904.
    • (2011) Proc. of INTERSPEECH , pp. 901-904
    • Siniscalchi, S.M.1    Svendsen, T.2    Lee, C.3
  • 4
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Frank Seide, Gang Li, and Dong Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. of INTERSPEECH, 2011, pp. 437-440.
    • (2011) Proc. of INTERSPEECH , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 5
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent pre-trained deep neural networks for largevocabulary speech recognition
    • G.E. Dahl, Dong Yu, Li Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for largevocabulary speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 6
    • 84865768819 scopus 로고    scopus 로고
    • Deep convex net: A scalable architecture for speech pattern classification
    • Li Deng and Dong Yu, "Deep convex net: A scalable architecture for speech pattern classification," in Proc. of INTERSPEECH, 2011, pp. 2285-2288.
    • (2011) Proc. of INTERSPEECH , pp. 2285-2288
    • Deng, L.1    Yu, D.2
  • 7
    • 84867614591 scopus 로고    scopus 로고
    • Scalable stacking and learning for building deep architectures
    • Li Deng, Dong Yu, and John Platt, "Scalable stacking and learning for building deep architectures," in Proc. of ICASSP, 2012, pp. 2133-2136.
    • (2012) Proc. of ICASSP , pp. 2133-2136
    • Deng, L.1    Yu, D.2    Platt, J.3
  • 8
    • 84867606917 scopus 로고    scopus 로고
    • A deep architecture with bilinear modeling of hidden representations: Applications to phonetic recognition
    • Brian Hutchinson, Li Deng, and Dong Yu, "A deep architecture with bilinear modeling of hidden representations: applications to phonetic recognition," in Proc. of ICASSP, 2012, pp. 4805-4808.
    • (2012) Proc. of ICASSP , pp. 4805-4808
    • Hutchinson, B.1    Deng, L.2    Yu, D.3
  • 11
    • 0025503558 scopus 로고
    • Backpropagation through time: What it does and how to do it
    • oct
    • P.J. Werbos, "Backpropagation through time: what it does and how to do it," Proceedings of the IEEE, vol. 78, no. 10, pp. 1550-1560, oct 1990.
    • (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1550-1560
    • Werbos, P.J.1
  • 13
    • 84878591993 scopus 로고    scopus 로고
    • Continuous digit recognition in noise: Reservoirs can do an excellent job
    • Azarakhsh Jalalvand, Fabian Triefenbach, and Jean-Pierre Martens, "Continuous digit recognition in noise: Reservoirs can do an excellent job," in Proc. of INTERSPEECH, 2012.
    • (2012) Proc. of INTERSPEECH
    • Jalalvand, A.1    Triefenbach, F.2    Martens, J.3
  • 15
    • 84871612455 scopus 로고    scopus 로고
    • Optimal feature sub-space selection based on discriminant analysis
    • Kris Demuynck, Jacques Duchateau, and Dirk Van Compernolle, "Optimal feature sub-space selection based on discriminant analysis," in Proc. of EUROSPEECH, 1999, pp. 1311-1314.
    • (1999) Proc. of EUROSPEECH , pp. 1311-1314
    • Demuynck, K.1    Duchateau, J.2    Van Compernolle, D.3
  • 16
    • 84865734256 scopus 로고    scopus 로고
    • Analysis and comparison of recent mlp features for lvcsr systems
    • Fabio Valente, Mathew Magimai-Doss, and Wen Wang, "Analysis and comparison of recent mlp features for lvcsr systems," in Proc. of INTERSPEECH, 2011, pp. 1245-1248.
    • (2011) Proc. of INTERSPEECH , pp. 1245-1248
    • Valente, F.1    Magimai-Doss, M.2    Wang, W.3
  • 19
    • 33646759445 scopus 로고    scopus 로고
    • Pronunciation variation modeling for ASR: Large improvements are possible but small ones are likely
    • Cheng Yang, Jean-Pierre Martens, Pol Ghesquiere, and Dirk Van Compernolle, "Pronunciation Variation Modeling for ASR: Large Improvements are possible but small ones are likely," in Proc. of ITRWon PMLA, 2002, pp. 123-128.
    • (2002) Proc. of ITRWon PMLA , pp. 123-128
    • Yang, C.1    Martens, J.2    Ghesquiere, P.3    Van Compernolle, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.