메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 3581-3585

A general artificial neural network extension for HTK

Author keywords

[No Author keywords available]

Indexed keywords

FEATURE EXTRACTION; HIDDEN MARKOV MODELS; HYBRID SYSTEMS; MARKOV PROCESSES; NEURAL NETWORKS; PROGRAM PROCESSORS; SPEECH COMMUNICATION; SPEECH PROCESSING;

EID: 84959142742     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (36)

References (40)
  • 1
    • 84976225254 scopus 로고    scopus 로고
    • http: //htk. eng. cam. ac. uk
  • 3
    • 0002144369 scopus 로고
    • Tree-basedstate tying for high accuracy acoustic modelling
    • Plainsboro, NJ, USA: MorganKaufman Publishers Inc
    • S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-basedstate tying for high accuracy acoustic modelling, " Proc. HumanLanguage Technology Workshop, Plainsboro, NJ, USA: MorganKaufman Publishers Inc, 1994.
    • (1994) Proc. HumanLanguage Technology Workshop
    • Young, S.J.1    Odell, J.J.2    Woodland, P.C.3
  • 4
    • 0029288633 scopus 로고
    • Maximum likelihood linearregression for speaker adaptation of continuous density hiddenMarkov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linearregression for speaker adaptation of continuous density hiddenMarkov models, " Computer Speech & Language, Vol. 9, No. 2, pp. 171-185, 1995.
    • (1995) Computer Speech & Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 5
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone errorand I-smoothing for improved discriminative training
    • Orland o, FL, USA
    • D. Povey and P. C. Woodland, "Minimum phone errorand I-smoothing for improved discriminative training, " Proc. ICASSP'02, Orland o, FL, USA, 2002.
    • (2002) Proc. ICASSP'02
    • Povey, D.1    Woodland, P.C.2
  • 7
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription
    • Waikoloa, HI, USA
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription, " Proc. ASRU'11, Waikoloa, HI, USA, 2011.
    • (2011) Proc. ASRU'11
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 8
    • 85032751458 scopus 로고    scopus 로고
    • Deep neural networks foracoustic modeling in speech recognition
    • Nov.
    • G. E. Hinton, L. Deng, D. Yu et al., "Deep neural networks foracoustic modeling in speech recognition, " IEEE Signal ProcessingMagazine, pp. 2-17, Nov. 2012.
    • (2012) IEEE Signal ProcessingMagazine , pp. 2-17
    • Hinton, G.E.1    Deng, L.2    Yu, D.3
  • 9
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionalityof data with neural networks
    • Jul
    • G. E. Hinton and R. Salakhutdinov, "Reducing the dimensionalityof data with neural networks, " Science, vol. 313, no. 5786, pp. 504-507, Jul 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.2
  • 11
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimumBayes risk training of deep neural network acoustic models usingdistributed Hessian-free optimization
    • Portland, OR, USA
    • B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimumBayes risk training of deep neural network acoustic models usingdistributed Hessian-free optimization, " Proc. Interspeech'12, Portland, OR, USA, 2012.
    • (2012) Proc. Interspeech'12
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3
  • 12
    • 84910072497 scopus 로고    scopus 로고
    • Unfoldedrecurrent neural networks for speech recognition
    • Singapore
    • G. Saon, H. Soltau, A. Emami, and M. Picheny, "Unfoldedrecurrent neural networks for speech recognition, " Proc. Interspeech'14, Singapore, 2014.
    • (2014) Proc. Interspeech'14
    • Saon, G.1    Soltau, H.2    Emami, A.3    Picheny, M.4
  • 13
    • 84906225757 scopus 로고    scopus 로고
    • A scalable approach to usingDNN-derived features in GMM-HMM based acoustic modelingfor LVCSR
    • Lyon, France
    • Z.-J. Yan, Q. Huo, and J. Xu, "A scalable approach to usingDNN-derived features in GMM-HMM based acoustic modelingfor LVCSR, " Proc. Interspeech'13, Lyon, France, 2013.
    • (2013) Proc. Interspeech'13
    • Yan, Z.-J.1    Huo, Q.2    Xu, J.3
  • 15
    • 84910067710 scopus 로고    scopus 로고
    • Efficient GPU-based training of recurrent neural networklanguage models using spliced sentence bunch
    • Singapore
    • X. Chen, Y.-Q. Wang, X.-Y. Liu, M. J. F. Gales, and P. C. Woodland, "Efficient GPU-based training of recurrent neural networklanguage models using spliced sentence bunch, " Proc. Interspeech'14, Singapore, 2014.
    • (2014) Proc. Interspeech'14
    • Chen, X.1    Wang, Y.-Q.2    Liu, X.-Y.3    Gales, M.J.F.4    Woodland, P.C.5
  • 17
    • 84905222840 scopus 로고    scopus 로고
    • RASR/NN: The RWTH neural network toolkit for speech recognition
    • Florence, Italy
    • S. Wiesler, A. Richard, P. Golik, R. Schlüter, and H. Ney, "RASR/NN: The RWTH neural network toolkit for speech recognition, "Proc. ICASSP'14, Florence, Italy, 2014.
    • (2014) Proc. ICASSP'14
    • Wiesler, S.1    Richard, A.2    Golik, P.3    Schlüter, R.4    Ney, H.5
  • 18
    • 84893712779 scopus 로고    scopus 로고
    • D. Johnson, "Quicknet, " http: //www1. icsi. berkeley. edu/speech/qn. html.
    • Quicknet
    • Johnson, D.1
  • 19
    • 84959109976 scopus 로고    scopus 로고
    • The Cambridge university 2014 BOLT conversationaltelephone mand arin Chinese LVCSR system for speechtranslation
    • Dresden, Germany
    • X.-Y. Liu, F. Flego, L.-L. Wang, C. Zhang, M. J. F. Gales, and P. C. Woodland, "The Cambridge University 2014 BOLT conversationaltelephone Mand arin Chinese LVCSR system for speechtranslation, " Proc. Interspeech'15, Dresden, Germany, 2015.
    • (2015) Proc. Interspeech'15
    • Liu, X.-Y.1    Flego, F.2    Wang, L.-L.3    Zhang, C.4    Gales, M.J.F.5    Woodland, P.C.6
  • 20
    • 84959166110 scopus 로고    scopus 로고
    • Joint decoding of tand em and hybrid systemsfor improved keyword spotting on low resource languages
    • Dresden, Germany
    • H.-P. Wang, A. Ragni, M. J. F. Gales, K. M. Knill, P. C. Woodland, and C. Zhang, "Joint decoding of tand em and hybrid systemsfor improved keyword spotting on low resource languages, " Proc. Interspeech'15, Dresden, Germany, 2015.
    • (2015) Proc. Interspeech'15
    • Wang, H.-P.1    Ragni, A.2    Gales, M.J.F.3    Knill, K.M.4    Woodland, P.C.5    Zhang, C.6
  • 21
    • 84890543852 scopus 로고    scopus 로고
    • Error back propagation forsequence training of context-dependent deep networks for conversationalspeech transcription
    • Vancouver, Canada
    • H. Su, G. Li, D. Yu, and F. Seide, "Error back propagation forsequence training of context-dependent deep networks for conversationalspeech transcription, " Proc. ICASSP'13, Vancouver, Canada, 2013.
    • (2013) Proc. ICASSP'13
    • Su, H.1    Li, G.2    Yu, D.3    Seide, F.4
  • 23
    • 84983119674 scopus 로고    scopus 로고
    • Learning hidden unit contributionsfor unsupervised speaker adaptation of neural networkacoustic models
    • Lake Tahoe, USA, Dec.
    • P. Swietojanski and S. Renals, "Learning hidden unit contributionsfor unsupervised speaker adaptation of neural networkacoustic models, " Proc. IWSLT'14, Lake Tahoe, USA, Dec. 2014.
    • (2014) Proc. IWSLT'14
    • Swietojanski, P.1    Renals, S.2
  • 24
    • 84890542079 scopus 로고    scopus 로고
    • KL-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition
    • Vancouver, Canada
    • D. Yu, K. Yao, H. Su, G. Li, and F. Seide, "KL-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition, " Proc. ICASSP'13, Vancouver, Canada, 2013.
    • (2013) Proc. ICASSP'13
    • Yu, D.1    Yao, K.2    Su, H.3    Li, G.4    Seide, F.5
  • 25
    • 84959174678 scopus 로고    scopus 로고
    • Parameterised sigmoid and ReLUhidden activation functions for DNN acoustic modelling
    • Dresden, Germany
    • C. Zhang and P. C. Woodland, "Parameterised sigmoid and ReLUhidden activation functions for DNN acoustic modelling, " Proc. Interspeech'15, Dresden, Germany, 2015.
    • (2015) Proc. Interspeech'15
    • Zhang, C.1    Woodland, P.C.2
  • 26
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptationof neural network acoustic models using i-vectors
    • Olomouc, Czech Republic
    • G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptationof neural network acoustic models using i-vectors, " Proc. ASRU'13, Olomouc, Czech Republic, 2013.
    • (2013) Proc. ASRU'13
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 31
    • 84905284804 scopus 로고    scopus 로고
    • Fine context, low-rank, softplus deep neuralnetworks for mobile speech recognition
    • Florence, Italy
    • A. Senior and X. Lei, "Fine context, low-rank, softplus deep neuralnetworks for mobile speech recognition, " Proc. ICASSP'14, Florence, Italy, 2014.
    • (2014) Proc. ICASSP'14
    • Senior, A.1    Lei, X.2
  • 33
    • 80052250414 scopus 로고    scopus 로고
    • Adaptive subgradient methodsfor online learning and stochastic optimization
    • J. Duchi, E. Hazan, and Y. Singer, "Adaptive subgradient methodsfor online learning and stochastic optimization, " The Journal ofMachine Learning Research, vol. 12, pp. 2121-2159, 2011.
    • (2011) The Journal OfMachine Learning Research , vol.12 , pp. 2121-2159
    • Duchi, J.1    Hazan, E.2    Singer, Y.3
  • 34
    • 84910072353 scopus 로고    scopus 로고
    • Asynchronous stochastic optimization for sequence trainingof deep neural networks: Towards big data
    • Singapore
    • E. McDermott, G. Heigold, P. J. Moreno, A. Senior, and M. Bacchiani, "Asynchronous stochastic optimization for sequence trainingof deep neural networks: Towards big data, " Proc. Interspeech'14, Singapore, 2014.
    • (2014) Proc. Interspeech'14
    • McDermott, E.1    Heigold, G.2    Moreno, P.J.3    Senior, A.4    Bacchiani, M.5
  • 36
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markovmodels
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markovmodels, " IEEE Transactions on Speech and Audio Processing, vol. 7, no. 3, pp. 272-281, 1999.
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 37
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations forHMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations forHMM-based speech recognition, " Computer speech & language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech & Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 38
    • 84867614591 scopus 로고    scopus 로고
    • Scalable stacking and learningfor building deep architectures
    • Kyoto, Japan
    • L. Deng, D. Yu, and J. Platt, "Scalable stacking and learningfor building deep architectures, " Proc. ICASSP'12, Kyoto, Japan, 2012.
    • (2012) Proc. ICASSP'12
    • Deng, L.1    Yu, D.2    Platt, J.3
  • 39
    • 84905222971 scopus 로고    scopus 로고
    • Stand alone training ofcontext-dependent deep neural network acoustic models
    • Florence, Italy
    • C. Zhang and P. C. Woodland, "Stand alone training ofcontext-dependent deep neural network acoustic models, " Proc. ICASSP'14, Florence, Italy, 2014.
    • (2014) Proc. ICASSP'14
    • Zhang, C.1    Woodland, P.C.2
  • 40
    • 84890492591 scopus 로고    scopus 로고
    • Revisiting hybridand GMM-HMM system combination techniques
    • Vancouver, Canada
    • P. Swietojanski, A. Ghoshal, and S. Renals, "Revisiting hybridand GMM-HMM system combination techniques, " Proc. ICASSP'13, Vancouver, Canada, 2013.
    • (2013) Proc. ICASSP'13
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.