메뉴 건너뛰기




Volumn , Issue , 2014, Pages 5597-5601

Standalone training of context-dependent deep neural network acoustic models

Author keywords

[No Author keywords available]

Indexed keywords

HIDDEN MARKOV MODELS; ITERATIVE METHODS; SPEECH RECOGNITION;

EID: 84905222971     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854674     Document Type: Conference Paper
Times cited : (25)

References (22)
  • 3
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Florence, Italy, Sep
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech'11, Florence, Italy, Sep. 2011, pp. 437-440.
    • (2011) Proc. Interspeech'11 , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 4
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • Jan
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, Jan. 2012.
    • (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 6
    • 84890492030 scopus 로고    scopus 로고
    • An investigation of deep neural networks for noise robust speech recognition
    • Vancouver, Canada
    • M. L. Seltzer, D. Yu, and Y.-Q. Wang, "An investigation of deep neural networks for noise robust speech recognition," in Proc. ICASSP'13, Vancouver, Canada, 2013, pp. 7398-7402.
    • (2013) Proc. ICASSP'13 , pp. 7398-7402
    • Seltzer, M.L.1    Yu, D.2    Wang, Y.-Q.3
  • 7
    • 84890474716 scopus 로고    scopus 로고
    • Deep neural network features and semi-supervised training for low resource speech recognition
    • Vancouver, Canada
    • S. Thomas, M. L. Seltzer, K. Church, and H. Hermansky, "Deep neural network features and semi-supervised training for low resource speech recognition," in Proc. ICASSP'13, Vancouver, Canada, 2013, pp. 6704-6708.
    • (2013) Proc. ICASSP'13 , pp. 6704-6708
    • Thomas, S.1    Seltzer, M.L.2    Church, K.3    Hermansky, H.4
  • 8
    • 84890537527 scopus 로고    scopus 로고
    • Multi-level adaptive networks in tandem and hybrid ASR systems
    • Vancouver, Canada
    • P. Bell, P. Swietojanski, and S. Renals, "Multi-level adaptive networks in tandem and hybrid ASR systems," in Proc. ICASSP'13, Vancouver, Canada, 2013, pp. 7947-7951.
    • (2013) Proc. ICASSP'13 , pp. 7947-7951
    • Bell, P.1    Swietojanski, P.2    Renals, S.3
  • 9
    • 84893668957 scopus 로고    scopus 로고
    • Investigation of multilingual deep neural networks for spoken term detection
    • Olomouc, Czech Republic
    • K. M. Knill, M. J. F. Gales, S. P. Rath, P. C. Woodland, C. Zhang, and S.-X. Zhang, "Investigation of multilingual deep neural networks for spoken term detection," in Proc. ASRU'13, Olomouc, Czech Republic, 2013, pp. 138-143.
    • (2013) Proc. ASRU'13 , pp. 138-143
    • Knill, K.M.1    Gales, M.J.F.2    Rath, S.P.3    Woodland, P.C.4    Zhang, C.5    Zhang, S.-X.6
  • 10
    • 0030648426 scopus 로고    scopus 로고
    • Speech recognition using neural networks with forward-backward probability generated targets
    • Munich, Germany
    • Y.-H. Yan, M. Fanty, and R. Cole, "Speech recognition using neural networks with forward-backward probability generated targets," in Proc. ICASSP'97, Munich, Germany, 1997, pp. 3241-3244.
    • (1997) Proc. ICASSP'97 , pp. 3241-3244
    • Yan, Y.-H.1    Fanty, M.2    Cole, R.3
  • 11
    • 0002144369 scopus 로고
    • Tree-based state tying for high accuracy acoustic modelling
    • Plainsboro, NJ, USA
    • S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustic modelling," in Proc. Human Language Technology Workshop, Plainsboro, NJ, USA, 1994, pp. 307-312.
    • (1994) Proc. Human Language Technology Workshop , pp. 307-312
    • Young, S.J.1    Odell, J.J.2    Woodland, P.C.3
  • 12
    • 84926060821 scopus 로고
    • Large vocabulary continuous speech recognition using HTK
    • Adelaide, Australia
    • P. C. Woodland, J. J. Odell, V. Valtchev, and S. J. Young, "Large vocabulary continuous speech recognition using HTK," in Proc. ICASSP'94, Adelaide, Australia, 1994, vol. 2, pp. 125-128.
    • (1994) Proc. ICASSP'94 , vol.2 , pp. 125-128
    • Woodland, P.C.1    Odell, J.J.2    Valtchev, V.3    Young, S.J.4
  • 15
    • 0742286348 scopus 로고    scopus 로고
    • Robust combination of neural networks and hidden Markov models for speech recognition
    • Nov
    • E. Trentin and M. Gori, "Robust combination of neural networks and hidden Markov models for speech recognition," IEEE Transactions on Neural Networks, vol. 14, no. 6, pp. 1519-1531, Nov. 2003.
    • (2003) IEEE Transactions on Neural Networks , vol.14 , Issue.6 , pp. 1519-1531
    • Trentin, E.1    Gori, M.2
  • 16
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • Waikoloa, HI, USA
    • F. Seide, G. Li, X. Chen, and Y. Dong, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU'11, Waikoloa, HI, USA, 2011, pp. 24-29.
    • (2011) Proc. ASRU'11 , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, X.3    Dong, Y.4
  • 17
  • 19
    • 0141703325 scopus 로고    scopus 로고
    • Automatic complexity control for HLDA systems
    • Hong Kong, Hong Kong, Apr
    • X.-Y. Liu, M. J. F. Gales, and P. C. Woodland, "Automatic complexity control for HLDA systems," in Proc. ICASSP'03, Hong Kong, Hong Kong, Apr. 2003, vol. 1, pp. 132-135.
    • (2003) Proc. ICASSP'03 , vol.1 , pp. 132-135
    • Liu, X.-Y.1    Gales, M.J.F.2    Woodland, P.C.3
  • 20
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • Orlando, FL, USA
    • D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP'02, Orlando, FL, USA, 2002, vol. 1, pp. 105-108.
    • (2002) Proc. ICASSP'02 , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.