메뉴 건너뛰기




Volumn 4, Issue 6, 1993, Pages 893-909

Continuous Speech Recognition by Connectionist Statistical Methods

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING SYSTEMS; NEURAL NETWORKS; SPECTRUM ANALYSIS;

EID: 0027695851     PISSN: 10459227     EISSN: 19410093     Source Type: Journal    
DOI: 10.1109/72.286885     Document Type: Article
Times cited : (75)

References (72)
  • 1
    • 0000920843 scopus 로고
    • A theory of adaptive pattern classifiers
    • S. I. Amari, “A theory of adaptive pattern classifiers,” IEEE Trans. Elec. Commun., vol. EC-16, pp. 279–307, 1967.
    • (1967) IEEE Trans. Elec. Commun. , vol.EC-16 , pp. 279-307
    • Amari, S.I.1
  • 3
    • 0020719320 scopus 로고
    • A maximum likelihood approach to continuous speech recognition
    • Mar.
    • L. R. Bahl, F. Jelinek, and R. Mercer, “A maximum likelihood approach to continuous speech recognition,” IEEE Trans. Pattern Anal. Machine Intell., vol. PAMI-5, no. 2, pp. 179–190, Mar. 1983.
    • (1983) IEEE Trans. Pattern Anal. Machine Intell , vol.PAMI-5 , Issue.2 , pp. 179-190
    • Bahl, L.R.1    Jelinek, F.2    Mercer, R.3
  • 4
    • 0001862769 scopus 로고
    • An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes
    • L. E. Baum, “An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes,” Inequalities, vol. 3, pp. 1–8, 1972.
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.E.1
  • 5
    • 0042945288 scopus 로고
    • Une approche théorique de l'apprentissage connexioniste: Applications à la reconnaissance de la parole
    • L. Bottou, “Une approche theorique de l’apprentissage connexioniste: Applications a la reconnaissance de la parole,” Ph.D. dissertation, Univ. Paris Sud, Centre d'Orsay, 1991.
    • (1991) Ph.D. dissertation, Univ. Paris Sud, Centre d'Orsay
    • Bottou, L.1
  • 6
    • 0038338085 scopus 로고
    • A continuous speech recognition system embedding MLP into HMM
    • D. Touretzky, Ed. San Mateo, CA: Morgan Kaufmann
    • H. Bourlard and N. Morgan, “A continuous speech recognition system embedding MLP into HMM,” in Advances in Neural Information Processing Systems 2, D. Touretzky, Ed. San Mateo, CA: Morgan Kaufmann, 1990, pp. 186–193.
    • (1990) Advances in Neural Information Processing Systems 2 , pp. 186-193
    • Bourlard, H.1    Morgan, N.2
  • 7
    • 0011916896 scopus 로고
    • Merging multilayer perceptrons and hidden Markov models: Some experiments in continuous speech recognition
    • E. Gelenbe, Ed. Amsterdam, The Netherlands: Elsevier
    • H. Bourlard and N. Morgan, “Merging multilayer perceptrons and hidden Markov models: Some experiments in continuous speech recognition,” in Neural Networks: Advances and Applications, E. Gelenbe, Ed. Amsterdam, The Netherlands: Elsevier, 1991.
    • (1991) Neural Networks: Advances and Applications
    • Bourlard, H.1    Morgan, N.2
  • 9
    • 0001373629 scopus 로고
    • Links between Markov models and multilayer perceptrons
    • D. Touretzky, Ed. San Mateo, CA: Morgan Kaufmann
    • H. Bourlard and C. J. Wellekens, “Links between Markov models and multilayer perceptrons,” in Advances in Neural Information Processing 1, D. Touretzky, Ed. San Mateo, CA: Morgan Kaufmann, 1989, pp. 502–510.
    • (1989) Advances in Neural Information Processing 1 , pp. 502-510
    • Bourlard, H.1    Wellekens, C.J.2
  • 10
    • 0002493084 scopus 로고
    • Speech pattern discrimination and multilayer perceptrons
    • H. Bourlard and C. J. Wellekens, “Speech pattern discrimination and multilayer perceptrons,” Computer, Speech and Language, vol. 3, pp. 1–19, 1989.
    • (1989) Computer, Speech and Language , vol.3 , pp. 1-19
    • Bourlard, H.1    Wellekens, C.J.2
  • 11
    • 0025547193 scopus 로고
    • Links between Markov models and multilayer perceptrons
    • Dec.
    • H. Bourlard and C. J. Wellekens, “Links between Markov models and multilayer perceptrons,” IEEE Trans. Pattern Anal. Machine Intell., vol. 12, no. 12, pp. 1167–1178, Dec. 1990.
    • (1990) IEEE Trans. Pattern Anal. Machine Intell , vol.12 , Issue.12 , pp. 1167-1178
    • Bourlard, H.1    Wellekens, C.J.2
  • 12
    • 0025385598 scopus 로고
    • Alphanets: A recurrent “neural” network architecture with a hidden Markov interpretation
    • Feb.
    • J. Bridle, “Alpha-nets: A recurrent “neural” network architecture with a hidden Markov interpretation,” Speech Commun., vol. 9, no. 1, pp. 83–92, Feb. 1990.
    • (1990) Speech Commun. , vol.9 , Issue.1 , pp. 83-92
    • Bridle, J.1
  • 13
  • 16
    • 0002629270 scopus 로고
    • Maximum likelihood estimation from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, “Maximum likelihood estimation from incomplete data via the EM algorithm,” J. Roy. Statist. Soc., vol. 39(B), pp. 1–38, 1977.
    • (1977) J. Roy. Statist. Soc. , vol.39(B) , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 18
    • 0025594074 scopus 로고
    • Connectionist Viterbi training: A new hybrid method for continuous speech recognition
    • Albuquerque, NM, Apr.
    • M. A. Franzini, K. F. Lee, and A. Waibel, “Connectionist Viterbi training: A new hybrid method for continuous speech recognition,” in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Albuquerque, NM, Apr. 1990, pp. 425-428.
    • (1990) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 425-428
    • Franzini, M.A.1    Lee, K.F.2    Waibel, A.3
  • 19
    • 0021122328 scopus 로고
    • Ignorance-based systems
    • San Diego, CA
    • A. Gevins and N. Morgan, “Ignorance-based systems,” in IEEE Proc. 1985 Int. Conf.ASSP, vol. 3, San Diego, CA, 1985, pp. 39A5.1-39A5.4.
    • (1985) IEEE Proc. 1985 Int. Conf. ASSP , pp. 39A5.1-39A5.4
    • Gevins, A.1    Morgan, N.2
  • 20
    • 0025671510 scopus 로고
    • A probabilistic approach to the understanding and training of neural network classifiers
    • Albuquerque, NM, Apr.
    • H. Gish, “A probabilistic approach to the understanding and training of neural network classifiers,” in IEEE Proc. 1990 Int. Conf. ASSP, vol. 3, Albuquerque, NM, Apr. 1990, pp. 1361–1364.
    • (1990) IEEE Proc. 1990 Int. Conf. ASSP , vol.3 , pp. 1361-1364
    • Gish, H.1
  • 21
    • 0026385261 scopus 로고
    • Integrating time alignment and neural networks for high performance continuous speech recognition
    • Toronto, Canada
    • P. Haffner, M. Franzini, and A. Waibel, “Integrating time alignment and neural networks for high performance continuous speech recognition,” in IEEE Proc. 1991 Int. Conf. ASSP, vol. 1, Toronto, Canada, 1991, pp. 105–108.
    • (1991) IEEE Proc. 1991 Int. Conf. ASSP , vol.1 , pp. 105-108
    • Haffner, P.1    Franzini, M.2    Waibel, A.3
  • 22
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Apr.
    • H. Hermansky, “Perceptual linear predictive (PLP) analysis of speech,” J. Acoust. Soc. Amer., vol. 87, no. 4, Apr. 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4
    • Hermansky, H.1
  • 24
    • 0016939124 scopus 로고
    • Continuous speech recognition by statistical methods
    • F. Jelinek, “Continuous speech recognition by statistical methods,” Proc. IEEE, vol. 64, no. 4, pp. 532–555, 1976.
    • (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 532-555
    • Jelinek, F.1
  • 25
    • 0004121079 scopus 로고
    • Serial order: A parallel distributed processing approach
    • M. L. Jordan, “Serial order: A parallel distributed processing approach,” Tech. Rep. 8604, UCSD, 1986.
    • (1986) Tech. Rep. 8604, UCSD
    • Jordan, M.L.1
  • 26
    • 85069971499 scopus 로고    scopus 로고
    • GDNN: A gender-dependent neural network for continuous speech recognition
    • Y. Konig and N. Morgan, “GDNN: A gender-dependent neural network for continuous speech recognition,” in Proc. IJCNN ' 92, pp. 332–337.
    • Proc. IJCNN '92 , pp. 332-337
    • Konig, Y.1    Morgan, N.2
  • 30
    • 0025419316 scopus 로고
    • Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition
    • K. F. Lee, “Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. 38, no. 4, pp. 599–609, 1990.
    • (1990) IEEE Trans. Acoust., Speech, Signal Processing , vol.38 , Issue.4 , pp. 599-609
    • Lee, K.F.1
  • 31
    • 84912916870 scopus 로고
    • Speech recognition using hidden control neural network architecture
    • Albuquerque, NM
    • E. Levin, “Speech recognition using hidden control neural network architecture,” in IEEE Proc. 1990 Int. Conf. ASSP, Albuquerque, NM, 1990.
    • (1990) IEEE Proc. 1990 Int. Conf. ASSP
    • Levin, E.1
  • 32
    • 0020734214 scopus 로고
    • An introduction to the theory of application of probabilistic functions on a Markov process to automatic speech recognition
    • Apr.
    • S. E. Levinson, L. R. Rabiner, and M. M. Sondhi, “An introduction to the theory of application of probabilistic functions on a Markov process to automatic speech recognition,” Bell Syst. Tech. J., vol. 62, no. 4, Apr. 1983.
    • (1983) Bell Syst. Tech. J , vol.62 , Issue.4
    • Levinson, S.E.1    Rabiner, L.R.2    Sondhi, M.M.3
  • 33
    • 84912571324 scopus 로고
    • Neural classifiers useful for speech recognition
    • San Diego, CA
    • R. P. Lippmann and B. Gold, “Neural classifiers useful for speech recognition,” in Proc. First Int. Conf. Neural Networks, San Diego, CA, 1987, pp. IV-417.
    • (1987) Proc. First Int. Conf. Neural Networks , pp. IV-417
    • Lippmann, R.P.1    Gold, B.2
  • 34
    • 0023331258 scopus 로고
    • An introduction to computing with neural nets
    • R. P. Lippmann, “An introduction to computing with neural nets,” IEEE ASSP Mag., vol. 3, pp. 4–22, 1987.
    • (1987) IEEE ASSP Mag , vol.3 , pp. 4-22
    • Lippmann, R.P.1
  • 35
    • 84942484648 scopus 로고
    • Continuous speech recognition on the resource management database using connectionist probability estimation
    • Kobe, Japan, Nov.
    • N. Morgan, C. Wooters, H. Bourlard, and M. Cohen, “Continuous speech recognition on the resource management database using connectionist probability estimation,” in Proc. Int. Conf Spoken Language Processing, Kobe, Japan, Nov. 1990, pp. 31.1.1-31.1.4.
    • (1990) Proc. Int. Conf Spoken Language Processing , pp. 31.1.1-31.1.4
    • Morgan, N.1    Wooters, C.2    Bourlard, H.3    Cohen, M.4
  • 36
    • 0025659256 scopus 로고
    • Continuous speech recognition using multilayer perceptrons with hidden Markov models
    • Albuquerque, NM, Apr.
    • N. Morgan and H. Bourlard, “Continuous speech recognition using multilayer perceptrons with hidden Markov models,” in IEEE Proc. 1990 Int. Conf. ASSP, Albuquerque, NM, Apr. 1990, pp. 413-416.
    • (1990) IEEE Proc. 1990 Int. Conf. ASSP , pp. 413-416
    • Morgan, N.1    Bourlard, H.2
  • 38
    • 0026384344 scopus 로고
    • Continuous speech recognition using PLP analysis with multilayer perceptrons
    • Toronto, Canada, May
    • N. Morgan, H. Hermansky, H. Bourlard, P. Kohn, and C. Wooters, “Continuous speech recognition using PLP analysis with multilayer perceptrons,” in IEEE Proc. 1991 Int. Conf ASSP, Toronto, Canada, May 1991, pp. 49–52.
    • (1991) IEEE Proc. 1991 Int. Conf ASSP , pp. 49-52
    • Morgan, N.1    Hermansky, H.2    Bourlard, H.3    Kohn, P.4    Wooters, C.5
  • 40
    • 0023833734 scopus 로고
    • 1000-word speaker-independent continuous-speech recognition using hidden Markov models
    • New York
    • H. Murveit and M. Weintraub, “1000-word speaker-independent continuous-speech recognition using hidden Markov models,” in IEEE Proc. 1988 Int. Conf. ASSP, vol. 1, New York, 1988, pp. 115–118.
    • (1988) IEEE Proc. 1988 Int. Conf. ASSP , vol.1 , pp. 115-118
    • Murveit, H.1    Weintraub, M.2
  • 44
    • 0021406359 scopus 로고
    • The use of one-stage dynamic programming algorithm for connected word recognition
    • H. Ney, “The use of one-stage dynamic programming algorithm for connected word recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-32, pp. 263–272, 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-32 , pp. 263-272
    • Ney, H.1
  • 45
    • 0026382113 scopus 로고
    • Speech recognition in a neural network framework: Discriminative training of Gaussian models and mixture densities as radial basis functions
    • Toronto, Canada
    • H. Ney, “Speech recognition in a neural network framework: Discriminative training of Gaussian models and mixture densities as radial basis functions,” in IEEE Proc. 1991 Int. Conf. ASSP, vol. 1, Toronto, Canada, 1991, pp. 573–576.
    • (1991) IEEE Proc. 1991 Int. Conf. ASSP , vol.1 , pp. 573-576
    • Ney, H.1
  • 46
    • 0023671791 scopus 로고
    • Phoneme modeling using continuous mixture densities
    • New York
    • H. Ney and A. Noll, “Phoneme modeling using continuous mixture densities,” in IEEE Proc. 1990 Int. Conf. ASSP, vol. 1, New York, 1988, pp. 437–440.
    • (1988) IEEE Proc. 1990 Int. Conf. ASSP , vol.1 , pp. 437-440
    • Ney, H.1    Noll, A.2
  • 47
    • 0024899341 scopus 로고
    • How limited training data can allow a neural network classifier to outperform an 'optimal' statistical classifier
    • Glasgow, Scotland
    • L. Niles, H. Silverman, G. Tajchman, and M. Bush, “How limited training data can allow a neural network classifier to outperform an ‘optimal’ statistical classifier,” in IEEE Proc. 1989 Int. Conf. ASSP, vol. 1, Glasgow, Scotland, 1989, pp. 17–20.
    • (1989) IEEE Proc. 1989 Int. Conf. ASSP , vol.1 , pp. 17-20
    • Niles, L.1    Silverman, H.2    Tajchman, G.3    Bush, M.4
  • 48
    • 0025592386 scopus 로고
    • Combining hidden Markov models and neural network classifiers
    • Albuquerque, NM, Apr.
    • L. T. Niles and H. F. Silverman, “Combining hidden Markov models and neural network classifiers,” in IEEE Proc. 1990 Int. Conf ASSP, Albuquerque, NM, Apr. 1990, pp. 417–420.
    • (1990) IEEE Proc. 1990 Int. Conf ASSP , pp. 417-420
    • Niles, L.T.1    Silverman, H.F.2
  • 51
    • 0024899363 scopus 로고
    • The Lincoln robust continuous speech recognizer
    • Glasgow, Scotland, May
    • D. B. Paul, “The Lincoln robust continuous speech recognizer,” in IEEE Proc. 1989 Int. Conf. ASSP, Glasgow, Scotland, May 1989.
    • (1989) IEEE Proc. 1989 Int. Conf. ASSP
    • Paul, D.B.1
  • 52
    • 0026401049 scopus 로고
    • The Lincoln tied-mixture HMM continuous speech recognizer
    • Toronto, Canada
    • D. B. Paul, “The Lincoln tied-mixture HMM continuous speech recognizer,” in IEEE Proc. 1991 Int. Conf. ASSP, Toronto, Canada, 1991, pp. 329–-332.
    • (1991) IEEE Proc. 1991 Int. Conf. ASSP , pp. 329-332
    • Paul, D.B.1
  • 53
    • 0026400228 scopus 로고
    • On the interaction between true source, training, and testing language models
    • Toronto, Canada
    • D. B. Paul, J. K. Baker, and J. M. Baker, “On the interaction between true source, training, and testing language models,” in IEEE Proc. 1991 Int. Conf ASSP, Toronto, Canada, 1991, pp. 569–572.
    • (1991) IEEE Proc. 1991 Int. Conf ASSP , pp. 569-572
    • Paul, D.B.1    Baker, J.K.2    Baker, J.M.3
  • 54
    • 0024135866 scopus 로고
    • Isolated digit recognition experiments using the multilayer perceptron
    • S. M. Peeling and R. K. Moore, “Isolated digit recognition experiments using the multilayer perceptron,” Speech Commun., vol. 7, pp. 403–409, 1988.
    • (1988) Speech Commun. , vol.7 , pp. 403-409
    • Peeling, S.M.1    Moore, R.K.2
  • 55
    • 0025490985 scopus 로고
    • Networks for approximation and learning
    • T. Poggio and F. Girosi, “Networks for approximation and learning,” Proc. IEEE, vol. 78, no. 9, pp. 1481–1497, 1989.
    • (1989) Proc. IEEE , vol.78 , Issue.9 , pp. 1481-1497
    • Poggio, T.1    Girosi, F.2
  • 57
    • 84941603741 scopus 로고
    • Connectionist optimization of tied mixture hidden Markov models
    • R. P. Lippmann, J. E. Moody, and D. S. Touretzky, Eds. San Mateo, CA: Morgan Kaufmann
    • S. Renals, N. Morgan, H. Bourlard, H. Franco, and M. Cohen, “Connectionist optimization of tied mixture hidden Markov models,” in Advances in Neural Information Processing Systems 4, R. P. Lippmann, J. E. Moody, and D. S. Touretzky, Eds. San Mateo, CA: Morgan Kaufmann, 1992.
    • (1992) Advances in Neural Information Processing Systems 4
    • Renals, S.1    Morgan, N.2    Bourlard, H.3    Franco, H.4    Cohen, M.5
  • 59
    • 0000329355 scopus 로고
    • A recurrent error propagation network speech recognition system
    • T. Robinson and F. Fallside, “A recurrent error propagation network speech recognition system,” Comput., Speech, Language, 1991.
    • (1991) Comput., Speech, Language
    • Robinson, T.1    Fallside, F.2
  • 63
    • 0025206332 scopus 로고
    • Probabilistic neural networks
    • D. F. Specht, “Probabilistic neural networks,” Neural Networks, vol. 3, no. 1, pp. 109–118, 1990.
    • (1990) Neural Networks , vol.3 , Issue.1 , pp. 109-118
    • Specht, D.F.1
  • 64
    • 84942395924 scopus 로고
    • A flexible VLSI 60,000 word real-time continuous speech recognition system
    • H. S. Moscovitz, K. Yao, and R. Jain, Eds. New York: IEEE Press
    • A. Sitoelzle, S. Narayanaswamy, P. Schrupp, B. Richards, R. Yu, J. Rabaey, and R. Brodersen, “A flexible VLSI 60,000 word real-time continuous speech recognition system,” in VLSI Signal Processing IV, H. S. Moscovitz, K. Yao, and R. Jain, Eds. New York: IEEE Press, 1991.
    • (1991) VLSI Signal Processing IV
    • Sitoelzle, A.1    Narayanaswamy, S.2    Schrupp, P.3    Richards, B.4    Yu, R.5    Rabaey, J.6    Brodersen, R.7
  • 66
    • 0023548825 scopus 로고
    • Learning phonetic features using connectionist networks: An experiment in speech recognition
    • San Diego, CA
    • R. L. Watrous and L. Shastri, “Learning phonetic features using connectionist networks: An experiment in speech recognition,” in Proc. First Int. Conf. Neural Networks, San Diego, CA, 1987, pp. IV-381-388.
    • (1987) Proc. First Int. Conf. Neural Networks , pp. IV-381-IV-388
    • Watrous, R.L.1    Shastri, L.2
  • 67
    • 0025331805 scopus 로고
    • Complete gradient optimization of a recurrent network applied to ib/,/d/,/g/ discrimination
    • Mar.
    • R. L. Watrous, B. Ladendorf, and G. Kuhn, “Complete gradient optimization of a recurrent network applied to ib/,/d/,/g/ discrimination,” J. Acoust. Soc. Amer., vol. 87, no. 3, pp. 1302–1309, Mar. 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.3 , pp. 1302-1309
    • Watrous, R.L.1    Ladendorf, B.2    Kuhn, G.3
  • 68
    • 0003529238 scopus 로고
    • Beyond regression: New tools for prediction and analysis in the behavioral sciences
    • P. J. Werbos, “Beyond regression: New tools for prediction and analysis in the behavioral sciences,” Ph.D. dissertation, Dep. Appl. Math., Harvard Univ., 1974.
    • (1974) Ph.D. dissertation, Dep. Appl. Math., Harvard Univ
    • Werbos, P.J.1
  • 70
    • 84913475828 scopus 로고
    • Supervised phonetic segmentation with with applications to speech recognition
    • X. L. Aubert, “Supervised phonetic segmentation with with applications to speech recognition,” Proc. European Con & Speech Technology, Edinburgh, (GB), vol 2, pp. 161–164, 1987.
    • (1987) Proc. European Con & Speech Technology, Edinburgh, (GB) , vol.2 , pp. 161-164
    • Aubert, X.L.1
  • 71
    • 0001595997 scopus 로고
    • Neural network classifiers estimate Bayesian a posteriori probabilities
    • M. D. Richard and R. P. Lippman, “Neural network classifiers estimate Bayesian a posteriori probabilities,” Neural Computation, vol. 3, no. 4, pp. 461–483, 1991.
    • (1991) Neural Computation , vol.3 , Issue.4 , pp. 461-483
    • Richard, M.D.1    Lippman, R.P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.