메뉴 건너뛰기




Volumn , Issue , 2013, Pages 6744-6748

Revisiting hybrid and GMM-HMM system combination techniques

Author keywords

deep neural networks; hybrid; system combination; tandem; TED

Indexed keywords

DEEP NEURAL NETWORKS; HYBRID; SYSTEM COMBINATION; TANDEM; TED;

EID: 84890492591     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6638967     Document Type: Conference Paper
Times cited : (59)

References (37)
  • 1
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • JG Fiscus, "A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)," in Proc. IEEE ASRU, 1997, pp. 347-352
    • (1997) Proc. IEEE ASRU , pp. 347-352
    • Fiscus, J.G.1
  • 2
    • 85061808589 scopus 로고    scopus 로고
    • Explicit word error minimization in n-best list rescoring
    • A Stolcke, Y Konig, and M Weintraub, "Explicit word error minimization in n-best list rescoring.," in EUROSPEECH, 1997
    • (1997) EUROSPEECH
    • Stolcke, A.1    Konig, Y.2    Weintraub, M.3
  • 3
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other appli-cations of confusion networks
    • L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: word error minimization and other appli-cations of confusion networks," Computer Speech and Language, vol. 14, no. 4, pp. 373-400, 2000
    • (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 5
    • 44949249226 scopus 로고    scopus 로고
    • Generating complementary systems for speech recognition
    • C Breslin and MJF Gales, "Generating complementary systems for speech recognition.," in INTERSPEECH, 2006
    • (2006) INTERSPEECH
    • Breslin, C.1    Gales, M.2
  • 6
    • 58149202339 scopus 로고    scopus 로고
    • Directed decision trees for generating complementary systems
    • C. Breslin and M. J. F. Gales, "Directed decision trees for generating complementary systems," Speech Communication, vol. 51, no. 3, pp. 284-295, 2009
    • (2009) Speech Communication , vol.51 , Issue.3 , pp. 284-295
    • Breslin, C.1    Gales, M.J.F.2
  • 10
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H Hermansky, DPW Ellis, and S Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE ICASSP, 2000
    • (2000) Proc. IEEE ICASSP
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 13
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • GE Dahl, D Yu, L Deng, and A Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 1, pp. 30-42, 2012
    • (2012) IEEE Transactions on Audio, Speech &Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 14
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • April
    • MJF Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, April 1998
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.1
  • 16
    • 0028204660 scopus 로고
    • Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition
    • jan
    • C. Dugast, L. Devillers, and X. Aubert, "Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition," Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 1, pp. 217-223, jan 1994
    • (1994) Speech and Audio Processing, IEEE Transactions on , vol.2 , Issue.1 , pp. 217-223
    • Dugast, C.1    Devillers, L.2    Aubert, X.3
  • 18
    • 0002384092 scopus 로고
    • Large vocabulary continuous speech recognition using a hybrid connectionist/ HMM system
    • 1994
    • M. Hochberg, S. Renals, T. Robinson, and D. Kershaw, "Large vocabulary continuous speech recognition using a hybrid connectionist/ HMM system," in Proc. ICSLP, Yokohama
    • (1994) Proc. ICSLP, Yokohama
    • Hochberg, M.1    Renals, S.2    Robinson, T.3    Kershaw, D.4
  • 19
    • 0028288775 scopus 로고
    • A hybrid segmental neural net/hidden Markov model system for continuous speech recognition
    • jan
    • G. Zavaliagkos, Y. Zhao, R. Schwartz, and J. Makhoul, "A hybrid segmental neural net/hidden Markov model system for continuous speech recognition," Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 1, pp. 151-160, jan 1994
    • (1994) Speech and Audio Processing, IEEE Transactions on , vol.2 , Issue.1 , pp. 151-160
    • Zavaliagkos, G.1    Zhao, Y.2    Schwartz, R.3    Makhoul, J.4
  • 20
    • 0029732695 scopus 로고    scopus 로고
    • Multilayer perceptrons for statedependent weightings of HMM likelihoods
    • Y. J. Chung and C. K. Un, "Multilayer perceptrons for statedependent weightings of HMM likelihoods," Speech Communication, vol. 18, no. 1, pp. 79-89, 1996
    • (1996) Speech Communication , vol.18 , Issue.1 , pp. 79-89
    • Chung, Y.J.1    Un, C.K.2
  • 22
    • 84878539964 scopus 로고    scopus 로고
    • Application of pretrained deep neural networks to large vocabulary speech recognition
    • N Jaitly, P Nguyen, A Senior, and V Vanhoucke, "Application of pretrained deep neural networks to large vocabulary speech recognition," in Interspeech, 2012
    • (2012) Interspeech
    • Jaitly, N.1    Nguyen, P.2    Senior, A.3    Vanhoucke, V.4
  • 23
    • 79959814724 scopus 로고    scopus 로고
    • Scarf: A segmental conditional random field toolkit for speech recognition
    • G Zweig and P Nguyen, "Scarf: A segmental conditional random field toolkit for speech recognition," in Interspeech, 2010, pp. 2858-2861
    • (2010) Interspeech , pp. 2858-2861
    • Zweig, G.1    Nguyen, P.2
  • 24
    • 0034825241 scopus 로고    scopus 로고
    • Multi-stream adaptive evidence combination for noise robust ASR
    • A Morris, A Hagen, H Glotin, and H Bourlard, "Multi-stream adaptive evidence combination for noise robust ASR," Speech Communication, vol. 34, no. 1-2, pp. 25-40, 2001
    • (2001) Speech Communication , vol.34 , Issue.1-2 , pp. 25-40
    • Morris, A.1    Hagen, A.2    Glotin, H.3    Bourlard, H.4
  • 25
    • 79953250475 scopus 로고    scopus 로고
    • Minimum bayes risk decoding and system combination based on a recursion for edit distance
    • October
    • H Xu, D Povey, L Mangu, and J Zhu, "Minimum bayes risk decoding and system combination based on a recursion for edit distance," Computer Speech and Language, vol. 25, no. 4, pp. 802-828, October 2011
    • (2011) Computer Speech and Language , vol.25 , Issue.4 , pp. 802-828
    • Xu, H.1    Povey, D.2    Mangu, L.3    Zhu, J.4
  • 35
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F Seide, G Li, X Chen, and D Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. IEEE ASRU, 2011
    • (2011) Proc. IEEE ASRU
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.