메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 3141-3144

The IBM 2015 English conversational telephone speech recognition system

Author keywords

Conversational speech recognition; Convolutional neural networks; Recurrent neural networks

Indexed keywords

CONVOLUTION; MODELING LANGUAGES; NEURAL NETWORKS; RECURRENT NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION; TELEPHONE SETS;

EID: 84959129849     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (84)

References (30)
  • 1
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chien, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Seide, F.1    Li, G.2    Chien, X.3    Yu, D.4
  • 2
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • R. P. Lippmann, "Speech recognition by machines and humans," Speech communication, vol. 22, no. 1, pp. 1-15, 1997.
    • (1997) Speech Communication , vol.22 , Issue.1 , pp. 1-15
    • Lippmann, R.P.1
  • 13
    • 84905265980 scopus 로고    scopus 로고
    • Joint training of convolutional and non-convolutional neural networks
    • H. Soltau, G. Saon, and T. N. Sainath, "Joint training of convolutional and non-convolutional neural networks," to Proc. ICASSP, 2014.
    • (2014) To Proc. ICASSP
    • Soltau, H.1    Saon, G.2    Sainath, T.N.3
  • 14
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using i-vectors
    • G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors," in Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 15
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
    • B. Kingsbury, T. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Kingsbury, B.1    Sainath, T.2    Soltau, H.3
  • 20
    • 84890454527 scopus 로고    scopus 로고
    • Low-rank matrix factorization for deep neural network training with high-dimensional output targets
    • T. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran, "Low-rank matrix factorization for deep neural network training with high-dimensional output targets," in Proc. of ICASSP, 2013.
    • (2013) Proc. of ICASSP
    • Sainath, T.1    Kingsbury, B.2    Sindhwani, V.3    Arisoy, E.4    Ramabhadran, B.5
  • 23
    • 0033329799 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Computer Speech & Language, vol. 13, no. 4, pp. 359-393, 1999.
    • (1999) Computer Speech & Language , vol.13 , Issue.4 , pp. 359-393
    • Chen, S.F.1    Goodman, J.2
  • 25
    • 84863387613 scopus 로고    scopus 로고
    • Shrinking exponential language models
    • S. F. Chen, "Shrinking exponential language models," in Proc. NAACL-HLT, 2009, pp. 468-476.
    • (2009) Proc. NAACL-HLT , pp. 468-476
    • Chen, S.F.1
  • 27
    • 85055309630 scopus 로고    scopus 로고
    • Ph.D. dissertation, Johns Hopkins University, Baltimore, MD, USA
    • A. Emami, "A neural syntactic language model," Ph.D. dissertation, Johns Hopkins University, Baltimore, MD, USA, 2006.
    • (2006) A Neural Syntactic Language Model
    • Emami, A.1
  • 28
    • 33847610331 scopus 로고    scopus 로고
    • Continuous space language models
    • H. Schwenk, "Continuous space language models," Computer Speech & Language, vol. 21, no. 3, pp. 492-518, 2007.
    • (2007) Computer Speech & Language , vol.21 , Issue.3 , pp. 492-518
    • Schwenk, H.1
  • 29
    • 44849092930 scopus 로고    scopus 로고
    • Empirical study of neural network language models for Arabic speech recognition
    • A. Emami and L. Mangu, "Empirical study of neural network language models for Arabic speech recognition," in Proc. ASRU, 2007, pp. 147-152.
    • (2007) Proc. ASRU , pp. 147-152
    • Emami, A.1    Mangu, L.2
  • 30


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.