메뉴 건너뛰기




Volumn 15, Issue 3, 2001, Pages 233-255

Applying dynamic context into MLP/HMM speech recognition system

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; SIGNAL TO NOISE RATIO; VECTORS;

EID: 0035412937     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1006/csla.2001.0167     Document Type: Article
Times cited : (2)

References (42)
  • 3
    • 0027695851 scopus 로고
    • Continuous speech recognition by connectionist statistical methods
    • Bourlard, H. & Morgan, N. (1993). Continuous speech recognition by connectionist statistical methods. IEEE Transactions on Neural Networks, 4, 893-909.
    • (1993) IEEE Transactions on Neural Networks , vol.4 , pp. 893-909
    • Bourlard, H.1    Morgan, N.2
  • 5
    • 0000767590 scopus 로고    scopus 로고
    • Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition
    • Chou, W. (2000). Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition. Proceedings of the IEEE, 88, 1201-1223.
    • (2000) Proceedings of the IEEE , vol.88 , pp. 1201-1223
    • Chou, W.1
  • 8
    • 0028516022 scopus 로고
    • Speech recognition using hidden Markov models with polynomial regression function as nonstationary states
    • Deng, L., Askmanovic, M., Sun, X. & Wu, C. (1994). Speech recognition using hidden Markov models with polynomial regression function as nonstationary states. IEEE Transactions on Speech and Audio Processing, 2, 507-520.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 507-520
    • Deng, L.1    Askmanovic, M.2    Sun, X.3    Wu, C.4
  • 11
    • 0022667694 scopus 로고
    • Speaker independent isolated word recognition using dynamic features of speech spectrum
    • Furui, S. (1986). Speaker independent isolated word recognition using dynamic features of speech spectrum. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-34, 52-59.
    • (1986) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.ASSP-34 , pp. 52-59
    • Furui, S.1
  • 12
    • 0026203445 scopus 로고
    • Isolated-utterance speech recognition using hidden Markov models with bounded state durations
    • Gu, H., Tseng, C. & Lee, L. (1991). Isolated-utterance speech recognition using hidden Markov models with bounded state durations. IEEE Transactions on Signal Processing, 39, 1743-1752.
    • (1991) IEEE Transactions on Signal Processing , vol.39 , pp. 1743-1752
    • Gu, H.1    Tseng, C.2    Lee, L.3
  • 14
    • 0026944057 scopus 로고
    • A combined self-organizing feature map and multilayer perceptron for isolated word recognition
    • Huang, Z. & Kuh, A. (1992). A combined self-organizing feature map and multilayer perceptron for isolated word recognition. IEEE Transactions on Signal Processing, 40, 2651-2657.
    • (1992) IEEE Transactions on Signal Processing , vol.40 , pp. 2651-2657
    • Huang, Z.1    Kuh, A.2
  • 24
    • 0025557399 scopus 로고
    • Using self-organizing maps and multi-layered feed-forward nets to obtain phonemic transcription of spoken utterances
    • Kokkonen, M. & Torkkola, K. (1990). Using self-organizing maps and multi-layered feed-forward nets to obtain phonemic transcription of spoken utterances. Speech Communication, 9, 541-549.
    • (1990) Speech Communication , vol.9 , pp. 541-549
    • Kokkonen, M.1    Torkkola, K.2
  • 25
    • 0033556867 scopus 로고    scopus 로고
    • Hidden neural networks
    • Krogh, A. & Riis, S. (1999). Hidden neural networks. Neural Computation, 11, 541-563.
    • (1999) Neural Computation , vol.11 , pp. 541-563
    • Krogh, A.1    Riis, S.2
  • 27
    • 0029308753 scopus 로고
    • Neural networks for statistical recognition of continuous speech
    • Morgan, N. & Bourland, H. (1995). Neural networks for statistical recognition of continuous speech. Proceedings of the IEEE, 83, 741-770.
    • (1995) Proceedings of the IEEE , vol.83 , pp. 741-770
    • Morgan, N.1    Bourland, H.2
  • 30
    • 0012315045 scopus 로고    scopus 로고
    • From HMMs to segment models: Stochastic modelling for CSR
    • (C.-H. Lee, F. Soong and K. Paliwal, eds); Kluwer Academic Publishers, Norwell, MA, USA
    • Ostendorf, M. (1996). From HMMs to segment models: stochastic modelling for CSR. In Automatic Speech and Speaker Recognition (C.-H. Lee, F. Soong and K. Paliwal, eds), pp. 185-210. Kluwer Academic Publishers, Norwell, MA, USA.
    • (1996) Automatic Speech and Speaker Recognition , pp. 185-210
    • Ostendorf, M.1
  • 32
    • 0024610919 scopus 로고
    • Tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L. Tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77, 257-286.
    • (1989) Proceedings of the IEEE , vol.77 , pp. 257-286
    • Rabiner, L.1
  • 33
    • 0001595997 scopus 로고
    • Neural network classifiers estimate Bayesian a posteriori probabilities
    • Richard, M. & Lippmann, R. (1991). Neural network classifiers estimate Bayesian a posteriori probabilities. Neural Computation, 3, 461-483.
    • (1991) Neural Computation , vol.3 , pp. 461-483
    • Richard, M.1    Lippmann, R.2
  • 35
    • 0001592322 scopus 로고    scopus 로고
    • The use of recurrent neural networks in continuous speech recognition
    • (C.-H. Lee, F. Soong and K. Paliwal, eds); Kluwer Academic Publishers, Norwell, MA, USA
    • Robinson, T., Hochberg, M. & Renals, S. (1996). The use of recurrent neural networks in continuous speech recognition. In Automatic Speech and Speaker Recognition (C.-H. Lee, F. Soong and K. Paliwal, eds), pp. 233-258. Kluwer Academic Publishers, Norwell, MA, USA.
    • (1996) Automatic Speech and Speaker Recognition , pp. 233-258
    • Robinson, T.1    Hochberg, M.2    Renals, S.3
  • 37
  • 38
    • 85156213225 scopus 로고    scopus 로고
    • Forward-backward retraining of recurrent neural networks
    • (D. Touretzky, M. Mozer and M. Hasselmo, eds); The MIT Press, Massachusetts, USA
    • Senior, A. & Robinson, T. (1996). Forward-backward retraining of recurrent neural networks. In Advances in Neural Information Processing Systems 8 (D. Touretzky, M. Mozer and M. Hasselmo, eds), pp. 743-749. The MIT Press, Massachusetts, USA
    • (1996) Advances in Neural Information Processing Systems 8 , pp. 743-749
    • Senior, A.1    Robinson, T.2
  • 39
    • 0004080016 scopus 로고
    • Speech recognition using neural networks
    • PhD dissertation, School of Computer Science, Carnegie Mellon University, CMU-CS-95-142, Pittsburgh, Pennsylvania, USA
    • Tebelskis, J. (1995). Speech recognition using neural networks. PhD dissertation, School of Computer Science, Carnegie Mellon University, CMU-CS-95-142, Pittsburgh, Pennsylvania, USA.
    • (1995)
    • Tebelskis, J.1
  • 40
    • 0032141206 scopus 로고    scopus 로고
    • Cepstral domain segmental feature vector normalization for noise robust speech recognition
    • Viikki, O. & Laurila, K. (1998). Cepstral domain segmental feature vector normalization for noise robust speech recognition. Speech Communication, 25, 133-147.
    • (1998) Speech Communication , vol.25 , pp. 133-147
    • Viikki, O.1    Laurila, K.2
  • 41
  • 42
    • 0003901486 scopus 로고
    • Token passing: A simple conceptual model for connected speech recognition systems
    • Technical Report, University of Cambridge, Department of Engineering, Cambridge, England
    • Young, S., Russell, N. & Thornton, J. (1989). Token Passing: A Simple Conceptual Model for Connected Speech Recognition Systems. Technical Report, University of Cambridge, Department of Engineering, Cambridge, England.
    • (1989)
    • Young, S.1    Russell, N.2    Thornton, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.