SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 15, Issue 3, 2001, Pages 233-255

Applying dynamic context into MLP/HMM speech recognition system

(1) Salmela, Petri a

a TAMPERE UNIVERSITY OF TECHNOLOGY (Finland)

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; SIGNAL TO NOISE RATIO; VECTORS;

FEATURE VECTORS; HIDDEN MARKOV MODELS; LINEAR DISCRIMINANT ANALYSIS;

SPEECH RECOGNITION;

EID: 0035412937 PISSN: 08852308 EISSN: None Source Type: Journal
DOI: 10.1006/csla.2001.0167 Document Type: Article

Times cited : (2)

References (42)

1
- 0003487601
- Oxford University Press, New York, USA
- Bishop, C. (1996). Neural Networks for Pattern Recognition. Oxford University Press, New York, USA.
- (1996) Neural Networks for Pattern Recognition
- Bishop, C.¹

2
- 0025547193
- Links between Markov models and multilayer perceptrons
- Bourlard, H. & Wellekens, C. (1990). Links between Markov models and multilayer perceptrons. IEEE Transactions on Pattern Analysis and Machine Intelligence, 12, 1167-1178.
- (1990) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.12 , pp. 1167-1178
- Bourlard, H.¹ Wellekens, C.²

3
- 0027695851
- Continuous speech recognition by connectionist statistical methods
- Bourlard, H. & Morgan, N. (1993). Continuous speech recognition by connectionist statistical methods. IEEE Transactions on Neural Networks, 4, 893-909.
- (1993) IEEE Transactions on Neural Networks , vol.4 , pp. 893-909
- Bourlard, H.¹ Morgan, N.²

4
- 85079089741
- Optimizing recognition and rejection performance in wordspotting systems
- Bourlard, H., D'hoore, B. & Boite, J. (1994). Optimizing recognition and rejection performance in wordspotting systems. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia, volume 1, pp. 373-376.
- (1994) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia , vol.1 , pp. 373-376
- Bourlard, H.¹ D'Hoore, B.² Boite, J.³

5
- 0000767590
- Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition
- Chou, W. (2000). Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition. Proceedings of the IEEE, 88, 1201-1223.
- (2000) Proceedings of the IEEE , vol.88 , pp. 1201-1223
- Chou, W.¹

6
- 84892178304
- Transcribing broadcast news with the 1997 Abbot system
- Cook, G. & Robinson, T. (1998). Transcribing broadcast news with the 1997 Abbot system. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Seattle, USA, volume 2, pp. 917-920.
- (1998) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Seattle, USA , vol.2 , pp. 917-920
- Cook, G.¹ Robinson, T.²

7
- 0003424145
- Macmillan Publishing Company, New York, USA
- Deller, J., Proakis, G. & Hansen, J. (1993). Discrete-Time Processing of Speech Signals. Macmillan Publishing Company, New York, USA.
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.¹ Proakis, G.² Hansen, J.³

8
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression function as nonstationary states
- Deng, L., Askmanovic, M., Sun, X. & Wu, C. (1994). Speech recognition using hidden Markov models with polynomial regression function as nonstationary states. IEEE Transactions on Speech and Audio Processing, 2, 507-520.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 507-520
- Deng, L.¹ Askmanovic, M.² Sun, X.³ Wu, C.⁴

9
- 0028204660
- Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition
- Dugast, C., Devillers, L. & Aubert, X. (1994). Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition. IEEE Transactions on Speech and Audio Processing, 2, 217-223.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 217-223
- Dugast, C.¹ Devillers, L.² Aubert, X.³

10
- 0030685510
- Context modeling in a hybrid HMM-neural net speech recognition system
- Franco, H., Weintraub, M. & Cohen, M. (1997). Context modeling in a hybrid HMM-neural net speech recognition system. Proceedings of the IEEE International Conference on Neural Networks, Houston, Texas, USA, pp. 2089-2092.
- (1997) Proceedings of the IEEE International Conference on Neural Networks, Houston, Texas, USA , pp. 2089-2092
- Franco, H.¹ Weintraub, M.² Cohen, M.³

11
- 0022667694
- Speaker independent isolated word recognition using dynamic features of speech spectrum
- Furui, S. (1986). Speaker independent isolated word recognition using dynamic features of speech spectrum. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-34, 52-59.
- (1986) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.ASSP-34 , pp. 52-59
- Furui, S.¹

12
- 0026203445
- Isolated-utterance speech recognition using hidden Markov models with bounded state durations
- Gu, H., Tseng, C. & Lee, L. (1991). Isolated-utterance speech recognition using hidden Markov models with bounded state durations. IEEE Transactions on Signal Processing, 39, 1743-1752.
- (1991) IEEE Transactions on Signal Processing , vol.39 , pp. 1743-1752
- Gu, H.¹ Tseng, C.² Lee, L.³

13
- 0027239233
- Improvements in connected digit recognition using linear discriminant analysis and mixture densities
- Haeb-Umbach, R., Geller, D. & Ney, H. (1993). Improvements in connected digit recognition using linear discriminant analysis and mixture densities. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Minneapolis, MN, USA, pp. 239-242.
- (1993) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Minneapolis, MN, USA , pp. 239-242
- Haeb-Umbach, R.¹ Geller, D.² Ney, H.³

14
- 0026944057
- A combined self-organizing feature map and multilayer perceptron for isolated word recognition
- Huang, Z. & Kuh, A. (1992). A combined self-organizing feature map and multilayer perceptron for isolated word recognition. IEEE Transactions on Signal Processing, 40, 2651-2657.
- (1992) IEEE Transactions on Signal Processing , vol.40 , pp. 2651-2657
- Huang, Z.¹ Kuh, A.²

15
- 0012314213
- Optimal adaptive garbage modelling in speech recognition
- Iso-Sipilä, J., Laurila, K. & Haavisto, P. (1996). Optimal adaptive garbage modelling in speech recognition. Proceedings of IEEE Nordic Signal Processing Symposium, Espoo, Finland, pp. 107-110.
- (1996) Proceedings of IEEE Nordic Signal Processing Symposium, Espoo, Finland , pp. 107-110
- Iso-Sipilä, J.¹ Laurila, K.² Haavisto, P.³

16
- 0003786003
- The MIT Press, Massachusetts, USA
- Jelinek, F. (1998). Statistical Methods for Speech Recognition. The MIT Press, Massachusetts, USA.
- (1998) Statistical Methods for Speech Recognition
- Jelinek, F.¹

17
- 0003675176
- Prentice Hall, New Jersey, USA
- Johnson, R. & Wichern, D. (1992). Applied Multivariate Statistical Analysis. Prentice Hall, New Jersey, USA.
- (1992) Applied Multivariate Statistical Analysis
- Johnson, R.¹ Wichern, D.²

18
- 0031139839
- Minimum error rate methods for speech recognition
- Juang, B.-H., Chou, W. & Lee, C.-H. (1997). Minimum error rate methods for speech recognition. IEEE Transactions on Speech and Audio Processing, 5, 257-265.
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , pp. 257-265
- Juang, B.-H.¹ Chou, W.² Lee, C.-H.³

19
- 0025567349
- Time-delayed self-organizing maps
- Kangas, J. (1990). Time-delayed self-organizing maps. Proceedings of the International Joint Conference on Neural Networks, San Diego, California, USA, pp. 331-336.
- (1990) Proceedings of the International Joint Conference on Neural Networks, San Diego, California, USA , pp. 331-336
- Kangas, J.¹

20
- 0012268434
- Time-dependent self-organizing maps for speech recognition
- Kangas, J. (1991). Time-dependent self-organizing maps for speech recognition. Proceedings of the 1991 International Conference on Artificial Neural Networks, Espoo, Finland, pp. 1591-1594.
- (1991) Proceedings of the 1991 International Conference on Artificial Neural Networks, Espoo, Finland , pp. 1591-1594
- Kangas, J.¹

21
- 0026271562
- New discriminative training algorithms based on the generalized probabilistic descent method
- Katagiri, S., Lee, C.-H. & Juang, B.-H. (1991). New discriminative training algorithms based on the generalized probabilistic descent method. Proceedings of the 1991 IEEE Workshop on Neural Networks for Signal Processing, Princeton, New Jersey, USA, pp. 299-308.
- (1991) Proceedings of the 1991 IEEE Workshop on Neural Networks for Signal Processing, Princeton, New Jersey, USA , pp. 299-308
- Katagiri, S.¹ Lee, C.-H.² Juang, B.-H.³

22
- 0024881459
- Investigation of phonemic context in speech using self-organizing feature maps
- Kepuska, V. & Gowdy, J. (1989). Investigation of phonemic context in speech using self-organizing feature maps. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Glasgow, Scotland, pp. 504-507.
- (1989) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Glasgow, Scotland , pp. 504-507
- Kepuska, V.¹ Gowdy, J.²

23
- 0003410791
- Springer-Verlag, New York, USA
- Kohonen, T. (2001). Self-Organizing Maps. 3rd edition. Springer-Verlag, New York, USA.
- (2001) Self-Organizing Maps. 3rd Edition
- Kohonen, T.¹

24
- 0025557399
- Using self-organizing maps and multi-layered feed-forward nets to obtain phonemic transcription of spoken utterances
- Kokkonen, M. & Torkkola, K. (1990). Using self-organizing maps and multi-layered feed-forward nets to obtain phonemic transcription of spoken utterances. Speech Communication, 9, 541-549.
- (1990) Speech Communication , vol.9 , pp. 541-549
- Kokkonen, M.¹ Torkkola, K.²

25
- 0033556867
- Hidden neural networks
- Krogh, A. & Riis, S. (1999). Hidden neural networks. Neural Computation, 11, 541-563.
- (1999) Neural Computation , vol.11 , pp. 541-563
- Krogh, A.¹ Riis, S.²

26
- 0030701379
- Noise robust speech recognition with state duration constraints
- Laurila, K. (1997). Noise robust speech recognition with state duration constraints. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Munich, Germany, pp. 871-874.
- (1997) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Munich, Germany , pp. 871-874
- Laurila, K.¹

27
- 0029308753
- Neural networks for statistical recognition of continuous speech
- Morgan, N. & Bourland, H. (1995). Neural networks for statistical recognition of continuous speech. Proceedings of the IEEE, 83, 741-770.
- (1995) Proceedings of the IEEE , vol.83 , pp. 741-770
- Morgan, N.¹ Bourland, H.²

28
- 0842348476
- Reducing errors by increasing the error rate: MLP acoustic modelling for broadcast news transcription
- Morgan, N., Ellis, D., Fosler-Lussier, E., Janin, A. & Kingsbury, B. (1999). Reducing errors by increasing the error rate: MLP acoustic modelling for broadcast news transcription. Proceedings of the DARPA Broadcast News Workshop, Herndon, Virginia.
- (1999) Proceedings of the DARPA Broadcast News Workshop, Herndon, Virginia
- Morgan, N.¹ Ellis, D.² Fosler-Lussier, E.³ Janin, A.⁴ Kingsbury, B.⁵

29
- 0025536870
- Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights
- Nguyen, D. & Widrow, B. (1990). Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights. Proceedings of International Joint Conference of Neural Networks, San Diego, California, USA, pp. 21-26.
- (1990) Proceedings of International Joint Conference of Neural Networks, San Diego, California, USA , pp. 21-26
- Nguyen, D.¹ Widrow, B.²

30
- 0012315045
- From HMMs to segment models: Stochastic modelling for CSR
- (C.-H. Lee, F. Soong and K. Paliwal, eds); Kluwer Academic Publishers, Norwell, MA, USA
- Ostendorf, M. (1996). From HMMs to segment models: stochastic modelling for CSR. In Automatic Speech and Speaker Recognition (C.-H. Lee, F. Soong and K. Paliwal, eds), pp. 185-210. Kluwer Academic Publishers, Norwell, MA, USA.
- (1996) Automatic Speech and Speaker Recognition , pp. 185-210
- Ostendorf, M.¹

31
- 84912899544
- Integrated phoneme-function word architecture of hidden control neural networks for continuous speech recognition
- Petek, B., Waibel, A. & Tebelskis, J. (1991). Integrated phoneme-function word architecture of hidden control neural networks for continuous speech recognition. Proceedings of the European Conference on Speech Communication and Technology, Genova, Italy, pp. 1407-1410.
- (1991) Proceedings of the European Conference on Speech Communication and Technology, Genova, Italy , pp. 1407-1410
- Petek, B.¹ Waibel, A.² Tebelskis, J.³

32
- 0024610919
- Tutorial on hidden Markov models and selected applications in speech recognition
- Rabiner, L. Tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77, 257-286.
- (1989) Proceedings of the IEEE , vol.77 , pp. 257-286
- Rabiner, L.¹

33
- 0001595997
- Neural network classifiers estimate Bayesian a posteriori probabilities
- Richard, M. & Lippmann, R. (1991). Neural network classifiers estimate Bayesian a posteriori probabilities. Neural Computation, 3, 461-483.
- (1991) Neural Computation , vol.3 , pp. 461-483
- Richard, M.¹ Lippmann, R.²

34
- 85079097438
- IPA: Improved phone modelling with recurrent neural networks
- Robinson, T., Hochberg, M. & Renals, S. (1994). IPA: improved phone modelling with recurrent neural networks. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia, pp. 37-40.
- (1994) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia , pp. 37-40
- Robinson, T.¹ Hochberg, M.² Renals, S.³

35
- 0001592322
- The use of recurrent neural networks in continuous speech recognition
- (C.-H. Lee, F. Soong and K. Paliwal, eds); Kluwer Academic Publishers, Norwell, MA, USA
- Robinson, T., Hochberg, M. & Renals, S. (1996). The use of recurrent neural networks in continuous speech recognition. In Automatic Speech and Speaker Recognition (C.-H. Lee, F. Soong and K. Paliwal, eds), pp. 233-258. Kluwer Academic Publishers, Norwell, MA, USA.
- (1996) Automatic Speech and Speaker Recognition , pp. 233-258
- Robinson, T.¹ Hochberg, M.² Renals, S.³

36
- 0012259840
- On string level training in MLP/HMM speech recognition system
- Salmela, P., Laurila, K., Lehtkangas, M. & Saarinen, J. (1999a). On string level training in MLP/HMM speech recognition system. Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, Tokyo, Japan, pp. 165-171.
- (1999) Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, Tokyo, Japan , pp. 165-171
- Salmela, P.¹ Laurila, K.² Lehtkangas, M.³ Saarinen, J.⁴

37
- 0033358230
- Neural network based digit recognition system for voice dialling in noisy environments
- Salmela, P., Lehtokangas, M. & Saarinen, J. (1999b). Neural network based digit recognition system for voice dialling in noisy environments. International Journal of Information Sciences, 121, 171-199.
- (1999) International Journal of Information Sciences , vol.121 , pp. 171-199
- Salmela, P.¹ Lehtokangas, M.² Saarinen, J.³

38
- 85156213225
- Forward-backward retraining of recurrent neural networks
- (D. Touretzky, M. Mozer and M. Hasselmo, eds); The MIT Press, Massachusetts, USA
- Senior, A. & Robinson, T. (1996). Forward-backward retraining of recurrent neural networks. In Advances in Neural Information Processing Systems 8 (D. Touretzky, M. Mozer and M. Hasselmo, eds), pp. 743-749. The MIT Press, Massachusetts, USA
- (1996) Advances in Neural Information Processing Systems 8 , pp. 743-749
- Senior, A.¹ Robinson, T.²

39
- 0004080016
- Speech recognition using neural networks
- PhD dissertation, School of Computer Science, Carnegie Mellon University, CMU-CS-95-142, Pittsburgh, Pennsylvania, USA
- Tebelskis, J. (1995). Speech recognition using neural networks. PhD dissertation, School of Computer Science, Carnegie Mellon University, CMU-CS-95-142, Pittsburgh, Pennsylvania, USA.
- (1995)
- Tebelskis, J.¹

40
- 0032141206
- Cepstral domain segmental feature vector normalization for noise robust speech recognition
- Viikki, O. & Laurila, K. (1998). Cepstral domain segmental feature vector normalization for noise robust speech recognition. Speech Communication, 25, 133-147.
- (1998) Speech Communication , vol.25 , pp. 133-147
- Viikki, O.¹ Laurila, K.²

41
- 34250094997
- Accelerating the convergence of the back-propagation method
- Vogl, T., Mangis, J., Rigler, A., Zink, W. & Alkon, D. (1988). Accelerating the convergence of the back-propagation method. Biological Cybernetics, 59, 257-263.
- (1988) Biological Cybernetics , vol.59 , pp. 257-263
- Vogl, T.¹ Mangis, J.² Rigler, A.³ Zink, W.⁴ Alkon, D.⁵

42
- 0003901486
- Token passing: A simple conceptual model for connected speech recognition systems
- Technical Report, University of Cambridge, Department of Engineering, Cambridge, England
- Young, S., Russell, N. & Thornton, J. (1989). Token Passing: A Simple Conceptual Model for Connected Speech Recognition Systems. Technical Report, University of Cambridge, Department of Engineering, Cambridge, England.
- (1989)
- Young, S.¹ Russell, N.² Thornton, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.