메뉴 건너뛰기




Volumn 22, Issue 11, 2014, Pages 1660-1669

Regression-based context-dependent modeling of deep neural networks for speech recognition

Author keywords

Articulatory features; context dependent modeling; deep neural network; logistic regression

Indexed keywords

DECISION TREES; REGRESSION ANALYSIS; TELEPHONE SETS; TREES (MATHEMATICS);

EID: 84916199887     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2014.2344855     Document Type: Article
Times cited : (18)

References (31)
  • 1
    • 0030244826 scopus 로고    scopus 로고
    • A review of large-vocabulary continuous-speech recognition
    • Sep
    • S. Young, "A review of large-vocabulary continuous-speech recognition," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 45-57, Sep. 1996.
    • (1996) IEEE Signal Process. Mag , vol.13 , Issue.5 , pp. 45-57
    • Young, S.1
  • 2
    • 0027683813 scopus 로고
    • Shared-distribution hiddenMarkov models for speech recognition
    • Oct
    • M. Hwang and X. Huang, "Shared-distribution hiddenMarkov models for speech recognition," IEEE Trans. Speech Audio Process., vol. 1, no. 4, pp. 414-420, Oct. 1993.
    • (1993) IEEE Trans. Speech Audio Process , vol.1 , Issue.4 , pp. 414-420
    • Hwang, M.1    Huang, X.2
  • 3
    • 0002144369 scopus 로고
    • Tree-based state tying for high accuracy acoustic modelling
    • S. J. Young, J. J. Odell, and P. C.Woodland, "Tree-based state tying for high accuracy acoustic modelling," in Proc. HLT, 1994, pp. 307-312.
    • (1994) Proc. HLT , pp. 307-312
    • Young, S.J.1    Odell, J.J.2    Woodland, P.C.3
  • 4
    • 0034273299 scopus 로고    scopus 로고
    • Robust decision tree state tying for continuous speech recognition
    • Sep
    • W. Reichl and W. Chou, "Robust decision tree state tying for continuous speech recognition," IEEE Trans. Speech Audio Process., vol. 8, no. 5, pp. 555-566, Sep. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.5 , pp. 555-566
    • Reichl, W.1    Chou, W.2
  • 5
    • 84867620524 scopus 로고    scopus 로고
    • An investigation of tied-mixture GMMbased triphone state clustering
    • G.Wang and K. C. Sim, "An investigation of tied-mixture GMMbased triphone state clustering," in Proc. ICASSP, 2012, pp. 4717-4720.
    • (2012) Proc. ICASSP , pp. 4717-4720
    • Wang, G.1    Sim, K.C.2
  • 6
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 7
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets," Neural Comput., vol. 18, no. 7, pp. 1527-1554, 2006.
    • (2006) Neural Comput , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.-W.3
  • 8
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pretrained deep neural networks for large vocabulary speech recognition
    • Jan.
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 9
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide,G.Li,X.Chen, andD.Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, 2011, pp. 24-29.
    • (2011) Proc. ASRU , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 13
    • 0025419316 scopus 로고
    • Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition
    • Apr
    • K. F. Lee, "Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 38, no. 4, pp. 599-609, Apr. 1990.
    • (1990) IEEE Trans. Acoust., Speech, Signal Process , vol.38 , Issue.4 , pp. 599-609
    • Lee, K.F.1
  • 15
    • 84893699565 scopus 로고    scopus 로고
    • Context-dependent modelling of deep neural network using logistic regression
    • G.Wang and K. C. Sim, "Context-dependent modelling of deep neural network using logistic regression," in Proc. IEEE Workshop Autom. Speech Recogn. Understand., 2013, pp. 338-343.
    • (2013) Proc IEEE Workshop Autom. Speech Recogn. Understand , pp. 338-343
    • Wang, G.1    Sim, K.C.2
  • 16
    • 84916235883 scopus 로고
    • Multiple codebook semi-continuous hidden Markov models for speaker-independent continuous speech recognition
    • X. Huang, H. Hon, and K. Lee, "Multiple codebook semi-continuous hidden Markov models for speaker-independent continuous speech recognition," Carnegie Mellon Univ., Computer Science Dept., Tech. Rep. v. 89-136, 1989.
    • (1989) Carnegie Mellon Univ., Computer Science Dept., Tech. Rep , vol.89 , Issue.136
    • Huang, X.1    Hon, H.2    Lee, K.3
  • 18
    • 79955538498 scopus 로고    scopus 로고
    • Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis
    • K. Yu, H. Zen, F.Mairesse, and S. Young, "Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis," Speech Commun., vol. 53, no. 6, pp. 914-923, 2011.
    • (2011) Speech Commun , vol.53 , Issue.6 , pp. 914-923
    • Yu, K.1    Zen, H.2    Mairesse, F.3    Young, S.4
  • 19
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • G. E. Hinton, "Training products of experts by minimizing contrastive divergence," Neural Comput., vol. 14, no. 8, pp. 1771-1800, 2002.
    • (2002) Neural Comput , vol.14 , Issue.8 , pp. 1771-1800
    • Hinton, G.E.1
  • 20
    • 77949342620 scopus 로고    scopus 로고
    • Discriminative product-of-expert acoustic mapping for cross-lingual phone recognition
    • K. C. Sim, "Discriminative product-of-expert acoustic mapping for cross-lingual phone recognition," in Proc. ASRU, 2009, pp. 546-551.
    • (2009) Proc. ASRU , pp. 546-551
    • Sim, K.C.1
  • 21
    • 77949394249 scopus 로고    scopus 로고
    • Phoneme recognition based on long temporal context. Brno, Czech Republic: Brno Univ. of Technology
    • P. Schwarz, Phoneme recognition based on long temporal context. Brno, Czech Republic: Brno Univ. of Technology, Faculty of Inf. Technol., 2008.
    • (2008) Faculty of Inf. Technol
    • Schwarz, P.1
  • 23
    • 84875405186 scopus 로고    scopus 로고
    • Exploiting deep neural networks for detection-based speech recognition
    • Apr
    • S. M. Siniscalchi, D. Yu, L. Deng, and C.-H. Lee, "Exploiting deep neural networks for detection-based speech recognition," Neurocomput., vol. 106, pp. 148-157, Apr. 2013.
    • (2013) Neurocomput , vol.106 , pp. 148-157
    • Siniscalchi, S.M.1    Yu, D.2    Deng, L.3    Lee, C.-H.4
  • 24
    • 0028234947 scopus 로고
    • A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
    • L. Deng and D. X. Sun, "A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features," J. Acoust. Soc. Amer., vol. 95, no. 5, pp. 2702-2719, 1994.
    • (1994) J. Acoust. Soc. Amer , vol.95 , Issue.5 , pp. 2702-2719
    • Deng, L.1    Sun, D.X.2
  • 26
    • 60749097551 scopus 로고    scopus 로고
    • Cambridge, U.K.: Cambridge Univ. Engineering Dept
    • S. J. Y. et al., The HTK Book, version 3.4. Cambridge, U.K.: Cambridge Univ. Engineering Dept., 2009.
    • (2009) The HTK Book, Version 3 , vol.4
  • 28
    • 0031222490 scopus 로고    scopus 로고
    • MMIE training of large vocabulary recognition systems
    • V. Valtchev, J. J. Odell, P. C. Woodland, and S. J. Young, "MMIE training of large vocabulary recognition systems," Speech Commun., vol. 22, no. 4, pp. 303-314, 1997.
    • (1997) Speech Commun , vol.22 , Issue.4 , pp. 303-314
    • Valtchev, V.1    Odell, J.J.2    Woodland, P.C.3    Young, S.J.4
  • 30
    • 84255177123 scopus 로고    scopus 로고
    • Deep and wide: Multiple layers in automatic speech recognition
    • Jan.
    • N. Morgan, "Deep and wide: Multiple layers in automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 7-13, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.1 , pp. 7-13
    • Morgan, N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.