메뉴 건너뛰기




Volumn , Issue , 2003, Pages 149-190

Large vocabulary speech recognition based on statistical methods

Author keywords

[No Author keywords available]

Indexed keywords


EID: 33645586060     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1201/9780203010525     Document Type: Chapter
Times cited : (10)

References (152)
  • 1
    • 85055771355 scopus 로고    scopus 로고
    • http://coretex.itc.it
  • 4
    • 0030362995 scopus 로고    scopus 로고
    • A Compact Model for Speaker Adaptation Training
    • Philadelphia, PA
    • T. Anastasakos, J. McDonough, R. Schwartz and J. Makhoul, A Compact Model for Speaker Adaptation Training, Proc. ICSLP’96, 1137-1140, Philadelphia, PA, October 1996.
    • (1996) Proc. ICSLP’96 , pp. 1137-1140
    • Anastasakos, T.1    McDonough, J.2    Schwartz, R.3    Makhoul, J.4
  • 5
    • 85009118347 scopus 로고    scopus 로고
    • One Pass Cross Word Decoding for Large Vocabularies Based on a Lexical Tree Search Organization
    • Budapest, Hungary
    • X. Aubert, One Pass Cross Word Decoding for Large Vocabularies Based on a Lexical Tree Search Organization, Proc. ESCA Eurospeech’99, 4:1559-1562, Budapest, Hungary, September 1999.
    • (1999) Proc. ESCA Eurospeech’99 , vol.4 , pp. 1559-1562
    • Aubert, X.1
  • 6
    • 0026382117 scopus 로고
    • The Forward-Backward Search Strategy for Real-Time Speech Recognition
    • Toronto, Canada
    • S. Austin, R. Schwartz and P. Placeway, The Forward-Backward Search Strategy for Real-Time Speech Recognition, Proc. IEEE ICASSP-91, 697-700, Toronto, Canada, May 1991.
    • (1991) Proc. IEEE ICASSP-91 , pp. 697-700
    • Austin, S.1    Schwartz, R.2    Placeway, P.3
  • 8
    • 0023725866 scopus 로고
    • Acoustic Markov Models used in the Tangora Speech Recognition System
    • New York, NY
    • L.R. Bahl, P. Brown, P. de Souza, R.L. Mercer and M. Picheny, Acoustic Markov Models used in the Tangora Speech Recognition System, Proc. IEEE ICASSP-88 1:497-500, New York, NY, April 1988.
    • (1988) Proc. IEEE ICASSP-88 1 , pp. 497-500
    • Bahl, L.R.1    Brown, P.2    De Souza, P.3    Mercer, R.L.4    Picheny, M.5
  • 10
  • 12
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • Baum, L.E., T. Petrie, G. Soules, and N. Weiss, A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains, Ann. Math. Stat. 41:164-171,1970.
    • (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 13
    • 0027297381 scopus 로고
    • Vector quantization for efficient computation of continuous density likelihoods
    • Minneapolis, MN
    • E. Bocchieri, Vector quantization for efficient computation of continuous density likelihoods, Proc. IEEE ICASSP-93, 2:692-695, Minneapolis, MN, May 1993.
    • (1993) Proc. IEEE ICASSP-93 , vol.2 , pp. 692-695
    • Bocchieri, E.1
  • 15
    • 85135168075 scopus 로고    scopus 로고
    • Word and acoustic confidence annotation for large vocabulary speech recognition
    • Rhodes, Greece
    • L. Chase, Word and acoustic confidence annotation for large vocabulary speech recognition, Proc. ESCA Eurospeech’97, 815-818, Rhodes, Greece, September 1997.
    • (1997) Proc. ESCA Eurospeech’97 , pp. 815-818
    • Chase, L.1
  • 17
    • 0033329799 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S.F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Computer, Speech &Language, 13(4):359-394, October 1999.
    • (1999) Computer, Speech &Language , vol.13 , Issue.4 , pp. 359-394
    • Chen, S.F.1    Goodman, J.2
  • 18
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion
    • Landsdowne, VA
    • S.S. Chen and P.S. Gopalakrishnan, Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 127-132, Landsdowne, VA, February 1998.
    • (1998) Proc. D , pp. 127-132
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 20
    • 85118743743 scopus 로고    scopus 로고
    • Statistical Language Modelling using CMU-Cambridge Toolkit
    • Rhodes, Greece
    • P Clarkson and R. Rosenfeld, Statistical Language Modelling using CMU-Cambridge Toolkit, Proc. ESCA EuroSpeech’97, 2707-2710, Rhodes, Greece, September 1997
    • (1997) Proc. ESCA EuroSpeech’97 , pp. 2707-2710
    • Clarkson, P.1    Rosenfeld, R.2
  • 21
    • 0019053271 scopus 로고
    • Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences
    • S. Davis and P Mermelstein, Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences, IEEE Trans. Acoustics, Speech, &Signal Processing, 28(4):357-366,1980.
    • (1980) IEEE Trans. Acoustics, Speech, &Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 24
    • 33646934294 scopus 로고
    • Genones: Optimization the Degree of Tying in a Large Vocabulary HMM-based Speech Recognizer
    • Adelaide, Australia
    • V. Digalakis and H. Murveit, Genones: Optimization the Degree of Tying in a Large Vocabulary HMM-based Speech Recognizer, Proc. IEEE ICASSP-94, 1:537-540, Adelaide, Australia, April 1994.
    • (1994) Proc. IEEE ICASSP-94 , vol.1 , pp. 537-540
    • Digalakis, V.1    Murveit, H.2
  • 25
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • V. Digalakis, D. Rtichev and L.G. Neumeyer, Speaker adaptation using constrained estimation of Gaussian mixtures, IEEE Trans. On Speech &Audio, 3(5):357-366, September 1995.
    • (1995) IEEE Trans. On Speech &Audio , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Rtichev, D.2    Neumeyer, L.G.3
  • 27
    • 0141515360 scopus 로고
    • Automatic Recognition of Phonetic Patterns in Speech
    • H. Dudley and S. Balashek, Automatic Recognition of Phonetic Patterns in Speech, J. Acoust. Soc. America, 30:721, 1958.
    • (1958) J. Acoust. Soc. America , vol.30 , pp. 721
    • Dudley, H.1    Balashek, S.2
  • 29
    • 0019583902 scopus 로고
    • Comparison of speaker recognition methods using statistical features and dynamic features
    • S. Furui, Comparison of speaker recognition methods using statistical features and dynamic features, IEEE Trans. On Acoustics, Speech &Signal Processing, ASSP-29, 342-350, 1981.
    • (1981) IEEE Trans. On Acoustics, Speech &Signal Processing , vol.ASSP-29 , pp. 342-350
    • Furui, S.1
  • 30
    • 85017310148 scopus 로고
    • An improved approach to hidden Markov model decomposition of speech and noise
    • San Francisco, CA
    • M.J.F. Gales and S.J. Young, An improved approach to hidden Markov model decomposition of speech and noise, Proc. IEEE ICASSP-92, 233-236, San Francisco, CA, March 1992.
    • (1992) Proc. IEEE ICASSP-92 , pp. 233-236
    • Gales, M.J.F.1    Young, S.J.2
  • 31
    • 0029390135 scopus 로고
    • Robust Continuous Speech Recognition using Parallel Model Combination
    • M.J.F. Gales and S.J. Young, Robust Continuous Speech Recognition using Parallel Model Combination, Computer Speech &Language, 9(4):289-307, October 1995.
    • (1995) Computer Speech &Language , vol.9 , Issue.4 , pp. 289-307
    • Gales, M.J.F.1    Young, S.J.2
  • 32
    • 85128364359 scopus 로고    scopus 로고
    • Cluster Adaptive Training for Speech Recognition
    • Sydney, Australia
    • M.J.F. Gales, Cluster Adaptive Training for Speech Recognition, Proc. IC-SLP’98, 1783-1786, Sydney, Australia, November 1998.
    • (1998) Proc. IC-SLP’98 , pp. 1783-1786
    • Gales, M.J.F.1
  • 33
    • 0032638856 scopus 로고    scopus 로고
    • Semi-Tied Covariance Matrices for Hidden Markov Models
    • M.J.F. Gales, Semi-Tied Covariance Matrices for Hidden Markov Models, IEEE Trans. On Speech and Audio, 7(3):273-281, May 1999.
    • (1999) IEEE Trans. On Speech and Audio , vol.7 , Issue.3 , pp. 273-281
    • Gales, M.J.F.1
  • 35
    • 0001790691 scopus 로고
    • Spoken Language component of the MASK Kiosk
    • K. Varghese, S. Pfleger(Eds.), Springer-Verlag, 1997. Also in Proc. Human Comfort and Security Workshop, Brussels, Belguim
    • J.L. Gauvain, S. Bennacef, L. Devillers, L. Lamel and R. Rosset, Spoken Language component of the MASK Kiosk in K. Varghese, S. Pfleger(Eds.) Human Comfort and security of information systems, Springer-Verlag, 1997. Also in Proc. Human Comfort and Security Workshop, Brussels, Belguim, October 1995.
    • (1995) Human Comfort and Security of Information Systems
    • Gauvain, J.L.1    Bennacef, S.2    Devillers, L.3    Lamel, L.4    Rosset, R.5
  • 36
    • 0030374902 scopus 로고    scopus 로고
    • Speech Recognition for an Information Kiosk
    • Philadelphia, PA
    • J.L. Gauvain, J.J. Gangolf, and L. Lamel, Speech Recognition for an Information Kiosk, Proc. ICSLP’96, 849-852, Philadelphia, PA, October 1996.
    • (1996) Proc. ICSLP’96 , pp. 849-852
    • Gauvain, J.L.1    Gangolf, J.J.2    Lamel, L.3
  • 37
    • 85128356454 scopus 로고    scopus 로고
    • Partitioning and Transcription of Broadcast News Data
    • Sydney, Australia
    • J.L. Gauvain, L. Lamel and G. Adda, Partitioning and Transcription of Broadcast News Data, Proc. ICSLP’98, 5:1335-1338, Sydney, Australia, December 1998.
    • (1998) Proc. ICSLP’98 , vol.5 , pp. 1335-1338
    • Gauvain, J.L.1    Lamel, L.2    Adda, G.3
  • 38
    • 0028996849 scopus 로고
    • Developments in Continuous Speech Dictation using the ARPA WSJ Task
    • Detroit, MI
    • J.L. Gauvain, L.F. Lamel and M. Adda-Decker, Developments in Continuous Speech Dictation using the ARPA WSJ Task, Proc. IEEE ICASSP-95, 65-68, Detroit, MI, May 1995.
    • (1995) Proc. IEEE ICASSP-95 , pp. 65-68
    • Gauvain, J.L.1    Lamel, L.F.2    Adda-Decker, M.3
  • 39
    • 0028419019 scopus 로고
    • Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
    • J.L. Gauvain and C.H. Lee, Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains, IEEE Trans. Speech &Audio Processing, 2(2):291-298, April 1994.
    • (1994) IEEE Trans. Speech &Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 40
    • 0036567851 scopus 로고    scopus 로고
    • The LIMSI Broadcast News Transcription System
    • J.L. Gauvain, L. Lamel and G. Adda, The LIMSI Broadcast News Transcription System, Speech Communication, 37(1-2):89-108, May 2002.
    • (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 89-108
    • Gauvain, J.L.1    Lamel, L.2    Adda, G.3
  • 41
    • 3643049373 scopus 로고
    • A Rapid Match Algorithm for Continuous Speech Recognition
    • Hidden Valley, PA
    • L. Gillick and R. Roth, A Rapid Match Algorithm for Continuous Speech Recognition, Proc. DARPA Speech &Natural Language Workshop, 170-172, Hidden Valley, PA, June 1990.
    • (1990) Proc. DARPA Speech &Natural Language Workshop , pp. 170-172
    • Gillick, L.1    Roth, R.2
  • 42
    • 0030648371 scopus 로고    scopus 로고
    • A Probabilistic Approach to Confidence Measure Estimation and Evaluation
    • Munich, Germany
    • L. Gillick, Y. Ito and J. Young, A Probabilistic Approach to Confidence Measure Estimation and Evaluation, Proc. IEEE ICASSP-97, 879-882, Munich, Germany, April 1997.
    • (1997) Proc. IEEE ICASSP-97 , pp. 879-882
    • Gillick, L.1    Ito, Y.2    Young, J.3
  • 43
    • 0032665631 scopus 로고    scopus 로고
    • Real-time Telephone-based Speech Recognition in the Jupiter Domain
    • Phoenix, AZ
    • J.R. Glass, T.J. Hazen and I. L. Hetherington, Real-time Telephone-based Speech Recognition in the Jupiter Domain, Proc. IEEE ICASSP-99, 1:61-64, Phoenix, AZ, March 1999.
    • (1999) Proc. IEEE ICASSP-99 , vol.1 , pp. 61-64
    • Glass, J.R.1    Hazen, T.J.2    Hetherington, I.L.3
  • 44
    • 85016587886 scopus 로고
    • SWITCHBOARD: Telephone Speech Corpus for Research and Development
    • San Francisco, CA
    • J. Godfrey, E. Holliman and J. McDaniel, SWITCHBOARD: Telephone Speech Corpus for Research and Development, Proc. IEEE ICASSP-92, 517-520, San Francisco, CA, March 1992.
    • (1992) Proc. IEEE ICASSP-92 , pp. 517-520
    • Godfrey, J.1    Holliman, E.2    McDaniel, J.3
  • 45
    • 0000803388 scopus 로고
    • The Population Frequencies of Species and the Estimation of Population Parameters
    • I.J. Good, The Population Frequencies of Species and the Estimation of Population Parameters, Biomterika, 40(3/4):237-264,1953.
    • (1953) Biomterika , vol.40 , Issue.3-4 , pp. 237-264
    • Good, I.J.1
  • 46
    • 0028996969 scopus 로고
    • A tree search strategy for large-vocabulary continuous speech recognition
    • Detroit, MI
    • P.S. Gopalakrishnan, L.R. Bahl and R.L. Mercer, A tree search strategy for large-vocabulary continuous speech recognition, Proc. IEEE ICASSP-95, 1:572-575, Detroit, MI, May 1995.
    • (1995) Proc. IEEE ICASSP-95 , vol.1 , pp. 572-575
    • Gopalakrishnan, P.S.1    Bahl, L.R.2    Mercer, R.L.3
  • 47
    • 85017287487 scopus 로고
    • Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition
    • R. Haeb-Umbach and H. Ney, Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition, Proc. ICASSP-92, 1:1316, March 1992.
    • (1992) Proc. ICASSP-92 , vol.1 , pp. 1316
    • Haeb-Umbach, R.1    Ney, H.2
  • 51
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, J. Acoust. Soc. America, 87(4):1738-1752,1990.
    • (1990) J. Acoust. Soc. America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 52
    • 0002384092 scopus 로고
    • Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system
    • Yokohama, Japan
    • M.M. Hochberg, S.J. Renals, A.J. Robinson and D. Kershaw, Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system, Proc. ICSLP’94, 1499-1502, Yokohama, Japan, September 1994.
    • (1994) Proc. ICSLP’94 , pp. 1499-1502
    • Hochberg, M.M.1    Renals, S.J.2    Robinson, A.J.3    Kershaw, D.4
  • 53
    • 85055768776 scopus 로고    scopus 로고
    • Chapter 1.3 of the State of the Art in Human Language Technology, (Cole et al, eds.)
    • M.J. Hunt, Signal Representation, Chapter 1.3 of the State of the Art in Human Language Technology, (Cole et al, eds.), 1996. (http://www.cse.ogi.edu/CSLU/HLTsurvey/ch1node2.html)
    • (1996) Signal Representation
    • Hunt, M.J.1
  • 54
    • 85015539783 scopus 로고
    • Subphonetic Modeling with Markov States - Senone
    • San Francisco, CA
    • M. Hwang and X. Huang, Subphonetic Modeling with Markov States - Senone, Proc. IEEE ICASSP-92,1:33-36, San Francisco, CA, March 1992.
    • (1992) Proc. IEEE ICASSP-92 , vol.1 , pp. 33-36
    • Hwang, M.1    Huang, X.2
  • 55
    • 0027153655 scopus 로고
    • Predicting Unseen Triphones with Senones
    • Minneapolis, MN
    • M.Y. Hwang, X. Huang and F. Alleva, Predicting Unseen Triphones with Senones, Proc. IEEE ICASSP-93, II:311-314, Minneapolis, MN, April 1993.
    • (1993) Proc. IEEE ICASSP-93 , vol.2 , pp. 311-314
    • Hwang, M.Y.1    Huang, X.2    Alleva, F.3
  • 56
    • 0016939124 scopus 로고
    • Continuous Speech Recognition by Statistical Methods
    • F. Jelinek, Continuous Speech Recognition by Statistical Methods, Proc. Of the IEEE, 64(4): 532-556, April 1976.
    • (1976) Proc. Of the IEEE , vol.64 , Issue.4 , pp. 532-556
    • Jelinek, F.1
  • 61
    • 0023312404 scopus 로고
    • Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer
    • S.M. Katz, Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer, IEEE Trans. Acoustics, Speech &Signal Processing, ASSP-35(3):400-401, March 1987.
    • (1987) IEEE Trans. Acoustics, Speech &Signal Processing , vol.ASSP-35 , Issue.3 , pp. 400-401
    • Katz, S.M.1
  • 62
    • 85135261720 scopus 로고    scopus 로고
    • Unsupervised Training of a Speech Recognizer: Recent Experiments
    • Budapest, Hungary
    • T. Kemp and A. Waibel, Unsupervised Training of a Speech Recognizer: Recent Experiments, Proc. ESCA Eurospeech’99, 6:2725-2728, Budapest, Hungary, September 1999.
    • (1999) Proc. ESCA Eurospeech’99 , vol.6 , pp. 2725-2728
    • Kemp, T.1    Waibel, A.2
  • 63
    • 33646908801 scopus 로고    scopus 로고
    • The 1995 Abbot hybrid connectionist-HMM large-vocabulary recognition system
    • Harriman, NY
    • D. Kershaw, A.J. Robinson and S.J. Renals, The 1995 Abbot hybrid connectionist-HMM large-vocabulary recognition system, Proc. ARPA Speech Recognition Workshop, 93-98, Harriman, NY, February 1996.
    • (1996) Proc. ARPA Speech Recognition Workshop , pp. 93-98
    • Kershaw, D.1    Robinson, A.J.2    Renals, S.J.3
  • 64
    • 85123963268 scopus 로고
    • Improved Clustering Techniques for Class-Based Statistical Language Modelling
    • Berlin, September
    • R. Kneser and H. Ney, Improved Clustering Techniques for Class-Based Statistical Language Modelling, Proc. Eurospeech’93, 973-976, Berlin, September 1993.
    • (1993) Proc. Eurospeech’93 , pp. 973-976
    • Kneser, R.1    Ney, H.2
  • 65
    • 0028996876 scopus 로고
    • Improved backing-off for n-gram language modeling
    • Detroit, MI, May
    • R. Kneser and H. Ney, Improved backing-off for n-gram language modeling, Proc. IEEEICASSP-95,1:181-184, Detroit, MI, May 1995.
    • (1995) Proc. IEEEICASSP-95 , vol.1 , pp. 181-184
    • Kneser, R.1    Ney, H.2
  • 68
    • 0032289099 scopus 로고    scopus 로고
    • Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition
    • December
    • N. Kumar and A.G. Andreou, Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition, Speech Communication, 26(4):283-297, December 1998.
    • (1998) Speech Communication , vol.26 , Issue.4 , pp. 283-297
    • Kumar, N.1    Andreou, A.G.2
  • 70
    • 0030351374 scopus 로고    scopus 로고
    • On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition
    • Philadelphia, PA, October
    • L.F. Lamel and G. Adda, On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition, Proc. ICSLP’96, 1:6-9, Philadelphia, PA, October 1996.
    • (1996) Proc. ICSLP’96 , vol.1 , pp. 6-9
    • Lamel, L.F.1    Adda, G.2
  • 73
    • 0029219785 scopus 로고
    • A Phone-based Approach to Non-Linguistic Speech Feature Identification
    • January
    • L.F. Lamel and J.L. Gauvain, A Phone-based Approach to Non-Linguistic Speech Feature Identification, Computer Speech &Language, 9(1):87-103, January 1995.
    • (1995) Computer Speech &Language , vol.9 , Issue.1 , pp. 87-103
    • Lamel, L.F.1    Gauvain, J.L.2
  • 74
    • 0036460908 scopus 로고    scopus 로고
    • Lightly Supervised and Unsupervised Acoustic Model Training
    • L. Lamel, J.L. Gauvain, and G. Adda, Lightly Supervised and Unsupervised Acoustic Model Training, Computer, Speech &Language, 16(1):115-229, January 2002.
    • (2002) Computer, Speech &Language , vol.16 , Issue.1 , pp. 115-229
    • Lamel, L.1    Gauvain, J.L.2    Adda, G.3
  • 77
    • 0029747183 scopus 로고    scopus 로고
    • Speaker Normalization Using Efficient Frequency Warping Procedures
    • Atlanta, GA, May
    • L. Lee and R.C. Rose, Speaker Normalization Using Efficient Frequency Warping Procedures, Proc. IEEE ICASSP-96,1:353-356, Atlanta, GA, May 1996.
    • (1996) Proc. IEEE ICASSP-96 , vol.1 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 78
    • 0029288633 scopus 로고
    • Maximum Likelihood Linear Regressionfor Speaker Adaptation of Continuous Density Hidden Markov Models
    • C. J. Leggetter and P. C. Woodland, Maximum Likelihood Linear Regressionfor Speaker Adaptation of Continuous Density Hidden Markov Models, Computer Speech &Language, 9(2):171-185, April 1995.
    • (1995) Computer Speech &Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 79
    • 0020180460 scopus 로고
    • Maximum Likelihood Estimation for Multivariate Observations of Markov Sources
    • Liporace, L. R., Maximum Likelihood Estimation for Multivariate Observations of Markov Sources, IEEE Transactions on Information Theory, IT-28(5):729-734,1982.
    • (1982) IEEE Transactions on Information Theory , vol.IT-28 , Issue.5 , pp. 729-734
    • Liporace, L.R.1
  • 80
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • R. P Lippmann, Speech recognition by machines and humans, Speech Communication, 22(1):1-15, July 1997.
    • (1997) Speech Communication , vol.22 , Issue.1 , pp. 1-15
    • Lippmann, R.P.1
  • 81
    • 85119434191 scopus 로고    scopus 로고
    • Fast Speaker Change Detection for Broadcast News Transcription and Indexing
    • Budapest, Hungary
    • D. Liu and F. Kubala, Fast Speaker Change Detection for Broadcast News Transcription and Indexing, Proc. ESCA EuroSpeech’99, 3:1031-1034, Budapest, Hungary, September 1999.
    • (1999) Proc. ESCA EuroSpeech’99 , vol.3 , pp. 1031-1034
    • Liu, D.1    Kubala, F.2
  • 82
    • 0345098384 scopus 로고
    • Multi-site Data Collection for a Spoken Language Corpus
    • Harriman, NY, February
    • Madcow, M., Multi-site Data Collection for a Spoken Language Corpus, Proc. Darpa Speech &Natural Language Workshop, 7-14, Harriman, NY, February 1992
    • (1992) Proc. Darpa Speech &Natural Language Workshop , pp. 7-14
    • Madcow, M.1
  • 83
    • 0034296009 scopus 로고    scopus 로고
    • A Stolcke, Finding Consensus in Speech Recognition: Word Error Minimization and Other Applications of Confusion Networks
    • L Mangu, E Brill, A Stolcke, Finding Consensus in Speech Recognition: Word Error Minimization and Other Applications of Confusion Networks, Computer, Speech and Language, 14(4):373-400, October 2000.
    • (2000) Computer, Speech and Language , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2
  • 84
    • 85135158363 scopus 로고    scopus 로고
    • Subspace distribution clustering for continuous observation density hidden Markov models
    • Rhodes, Greece
    • B Mak and E Bocchieri, Subspace distribution clustering for continuous observation density hidden Markov models, Proc. Eurospeech’97, 107-110, Rhodes, Greece, September 1997.
    • (1997) Proc. Eurospeech’97 , pp. 107-110
    • Mak, B.1    Bocchieri, E.2
  • 85
    • 33646936293 scopus 로고    scopus 로고
    • Spoken Language Processing and Human-Machine Communication in the European Union Programs
    • G. Varile, ed, Rhodes, Greece, September
    • J J Mariani Spoken Language Processing and Human-Machine Communication in the European Union Programs, in G. Varile, ed., Eurospeech’97 EU Speech Projects Day report, Rhodes, Greece, September 1997.
    • (1997) Eurospeech’97 EU Speech Projects Day Report
    • Mariani, J.J.1
  • 87
    • 85135152717 scopus 로고
    • Algorithms for Bigram and Trigram Clustering
    • Madrid, Spain
    • S. Martin, J. Liermann and H. Ney, Algorithms for Bigram and Trigram Clustering, Proc. Eurospeech’95, 1253-1256, Madrid, Spain, September 1995.
    • (1995) Proc. Eurospeech’95 , pp. 1253-1256
    • Martin, S.1    Liermann, J.2    Ney, H.3
  • 90
    • 84892168937 scopus 로고    scopus 로고
    • Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition
    • Seattle, WA
    • M. Mohri, M. Riley, D. Hindle, A. Ljolie and F. Pereira, Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition, Proc IEEE ICASSP-98, 665-668, Seattle, WA, May 1998.
    • (1998) Proc IEEE ICASSP-98 , pp. 665-668
    • Mohri, M.1    Riley, M.2    Hindle, D.3    Ljolie, A.4    Pereira, F.5
  • 91
    • 0027192626 scopus 로고
    • Large-Vocabulary Dictation using SRI’s Decipher Speech Recognition System: Progressive Search Techniques
    • Minneapolis, MN
    • H. Murveit, J. Butzberger, V. Digalakis and M. Weintraub, Large-Vocabulary Dictation using SRI’s Decipher Speech Recognition System: Progressive Search Techniques, Proc. IEEE ICASSP-93, II:319-322, Minneapolis, MN, April 1993.
    • (1993) Proc. IEEE ICASSP-93 , vol.2 , pp. 319-322
    • Murveit, H.1    Butzberger, J.2    Digalakis, V.3    Weintraub, M.4
  • 92
    • 0021406359 scopus 로고
    • The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition
    • H. Ney, The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition, IEEE Trans. Acoustics, Speech and Signal Processing, ASSP-32(2):263-271, April 1984.
    • (1984) IEEE Trans. Acoustics, Speech and Signal Processing , vol.ASSP-32 , Issue.2 , pp. 263-271
    • Ney, H.1
  • 93
    • 85017308347 scopus 로고
    • Improvements in Beam Search for 10000-Word Continuous Speech Recognition
    • San Francisco, CA
    • H. Ney, R. Haeb-Umbach, B.H. Tran and M. Oerder, Improvements in Beam Search for 10000-Word Continuous Speech Recognition, Proc. IEEE ICASSP-92, I:9-12, San Francisco, CA, March 1992.
    • (1992) Proc. IEEE ICASSP-92 , vol.1 , pp. 9-12
    • Ney, H.1    Haeb-Umbach, R.2    Tran, B.H.3    Oerder, M.4
  • 94
    • 0032689227 scopus 로고    scopus 로고
    • Single-Tree Method for Grammar-Directed Search
    • Phoenix, AZ
    • L. Nguyen and R. Schwartz, Single-Tree Method for Grammar-Directed Search, Proc. IEEE ICASSP-99,2:613-616, Phoenix, AZ, March 1999.
    • (1999) Proc. IEEE ICASSP-99 , vol.2 , pp. 613-616
    • Nguyen, L.1    Schwartz, R.2
  • 98
    • 0036295941 scopus 로고    scopus 로고
    • Modeling Inverse Covariance Matrices by Basis Expansion
    • Orlando, FL
    • P A. Olsen and R. A. Gopinath, Modeling Inverse Covariance Matrices by Basis Expansion, Proc. IEEE ICASSP-02, 945-948, Orlando, FL, 2002.
    • (2002) Proc. IEEE ICASSP-02 , pp. 945-948
    • Olsen, P.A.1    Gopinath, R.A.2
  • 99
    • 0030366694 scopus 로고    scopus 로고
    • Language-model look-ahead for large vocabulary speech recognition
    • Philadelphia, PA
    • S. Ortmanns, H. Ney, and A. Eiden, Language-model look-ahead for large vocabulary speech recognition, Proc. ICSLP’96, 2095-2098, Philadelphia, PA, October 1996
    • (1996) Proc. ICSLP’96 , pp. 2095-2098
    • Ortmanns, S.1    Ney, H.2    Eiden, A.3
  • 100
    • 0030719155 scopus 로고    scopus 로고
    • A Word Graph Algorithm for Large Vocabulary Continuous Speech Recognition
    • S. Ortmanns, H. Ney, and X. Aubert, A Word Graph Algorithm for Large Vocabulary Continuous Speech Recognition, Computer, Speech and Language, 11(1):43-72, January 1997.
    • (1997) Computer, Speech and Language , vol.11 , Issue.1 , pp. 43-72
    • Ortmanns, S.1    Ney, H.2    Aubert, X.3
  • 106
    • 0001895107 scopus 로고    scopus 로고
    • 1998 Broadcast News Benchmark Test Results: English and Non-English Word Error Rate Performance Measures
    • Herndon, VA
    • D. S. Pallett, J. G. Fiscus, J. S. Garofolo, A.F. Martin and M. A. Przybocki, 1998 Broadcast News Benchmark Test Results: English and Non-English Word Error Rate Performance Measures, Proc. Darpa Broadcast News Workshop, 5-12, Herndon, VA, February 1999.
    • (1999) Proc. Darpa Broadcast News Workshop , pp. 5-12
    • Pallett, D.S.1    Fiscus, J.G.2    Garofolo, J.S.3    Martin, A.F.4    Przybocki, M.A.5
  • 107
    • 85017287102 scopus 로고
    • An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model
    • San Francisco, CA
    • D. B. Paul, An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model, Proc. IEEE ICASSP-92, 405-409, San Francisco, CA, March 1992.
    • (1992) Proc. IEEE ICASSP-92 , pp. 405-409
    • Paul, D.B.1
  • 108
    • 0034849080 scopus 로고    scopus 로고
    • Improved Discriminative Training Techniques For Large Vocabulary Continuous Speech Recognition
    • Salt Lake City, May
    • D. Povey and P Woodland, Improved Discriminative Training Techniques For Large Vocabulary Continuous Speech Recognition, Proc. IEEE ICASSP-01, Salt Lake City, May 2001.
    • (2001) Proc. IEEE ICASSP-01
    • Povey, D.1    Woodland, P.2
  • 109
    • 0002617904 scopus 로고
    • Evaluation of Spoken Language Systems: The ATIS Domain
    • Hidden Valley, PA, June
    • P Price, Evaluation of Spoken Language Systems: The ATIS Domain, Proc. Darpa Speech and Natural Language Workshop, 91-95, Hidden Valley, PA, June, 1990
    • (1990) Proc. Darpa Speech and Natural Language Workshop , pp. 91-95
    • Price, P.1
  • 114
    • 0003904645 scopus 로고
    • Ph. D. Thesis, Carnegie Mellon University, (also Tech. rep. CMU-CS-94-138)
    • R. Rosenfeld, Adaptive Statistical Language Modeling, Ph. D. Thesis, Carnegie Mellon University, 1994. (also Tech. rep. CMU-CS-94-138).
    • (1994) Adaptive Statistical Language Modeling
    • Rosenfeld, R.1
  • 116
    • 0035426931 scopus 로고    scopus 로고
    • Language-independent and langauge-adaptive acoustic modeling for speech recognition
    • T Schultza and A. Waibel, Language-independent and langauge-adaptive acoustic modeling for speech recognition, Speech Communication, 35(1-2):31-51, August 2001.
    • (2001) Speech Communication , vol.35 , Issue.1-2 , pp. 31-51
    • Schultza, T.1    Waibel, A.2
  • 117
    • 0033896970 scopus 로고    scopus 로고
    • Memory-efficient LVCSR search using a one-pass stack decoder
    • M. Schuster, Memory-efficient LVCSR search using a one-pass stack decoder, Computer Speech &Language, 14(1):47-77, January 2000.
    • (2000) Computer Speech &Language , vol.14 , Issue.1 , pp. 47-77
    • Schuster, M.1
  • 118
    • 85017310294 scopus 로고
    • New uses for N-Best Sentence Hypothesis, within the BYBLOS Speech Recognition System
    • San Francisco, CA
    • R. Schwartz, S. Austin, F. Kubala and J. Makhoul, New uses for N-Best Sentence Hypothesis, within the BYBLOS Speech Recognition System, Proc. IEEE ICASSP-92,1:1-4, San Francisco, CA, March 1992.
    • (1992) Proc. IEEE ICASSP-92 , vol.1 , pp. 1-4
    • Schwartz, R.1    Austin, S.2    Kubala, F.3    Makhoul, J.4
  • 119
    • 0021142214 scopus 로고
    • Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition
    • San Diego, CA
    • R. Schwartz, Y. Chow, S. Roucos, M. Krasner and J. Makhoul, Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition, Proc. IEEE ICASSP-84, 3:35.6.1-35.6.4, San Diego, CA, March 1984.
    • (1984) Proc. IEEE ICASSP-84
    • Schwartz, R.1    Chow, Y.2    Roucos, S.3    Krasner, M.4    Makhoul, J.5
  • 120
    • 33646939277 scopus 로고    scopus 로고
    • NYU Language Modeling Experiments for the 1995 CSR Evaluation
    • Harriman, NY
    • S. Sekine and R. Grishman, NYU Language Modeling Experiments for the 1995 CSR Evaluation, Proc. ARPA Speech Recognition Workshop, 123-128, Harriman, NY, February 1996.
    • (1996) Proc. ARPA Speech Recognition Workshop , pp. 123-128
    • Sekine, S.1    Grishman, R.2
  • 121
    • 0029726011 scopus 로고
    • A Markov Random Field Approach to Bayesian Speaker Adaptation
    • Detroit, MI
    • B. Shahshahani, A Markov Random Field Approach to Bayesian Speaker Adaptation, Proc. IEEE ICASSP-95, 697-700, Detroit, MI, May 1995.
    • (1995) Proc. IEEE ICASSP-95 , pp. 697-700
    • Shahshahani, B.1
  • 123
    • 0030361237 scopus 로고    scopus 로고
    • Scalable backoff language models
    • Philadelphia, PA
    • K. Seymore and R. Rosenfeld, Scalable backoff language models, Proc. ICSLP’96, 1:232-235, Philadelphia, PA, October 1996.
    • (1996) Proc. ICSLP’96 , vol.1 , pp. 232-235
    • Seymore, K.1    Rosenfeld, R.2
  • 124
    • 0002782496 scopus 로고    scopus 로고
    • Automatic Segmentation, Classification and Clustering of Broadcast News Audio
    • Chantilly, VA
    • M. Siegler, U. Jain, B. Raj and R. Stern, Automatic Segmentation, Classification and Clustering of Broadcast News Audio, Proc DARPA Speech Recognition Workshop, 97-99, Chantilly, VA, February 1997
    • (1997) Proc DARPA Speech Recognition Workshop , pp. 97-99
    • Siegler, M.1    Jain, U.2    Raj, B.3    Stern, R.4
  • 125
    • 0033344871 scopus 로고    scopus 로고
    • Evaluation of word confidence for speech recognition systems
    • M. Siu and H. Gish, Evaluation of word confidence for speech recognition systems, Computer Speech &Language, 13(4):299-318, October 1999.
    • (1999) Computer Speech &Language , vol.13 , Issue.4 , pp. 299-318
    • Siu, M.1    Gish, H.2
  • 127
    • 0028996958 scopus 로고
    • Four-level Tied Structure for Efficient Representation of Acoustic Modeling
    • Detroit, MI
    • S. Takahashi and S. Sagayama, Four-level Tied Structure for Efficient Representation of Acoustic Modeling, Proc. IEEE ICASSP-95, 520-523, Detroit, MI, May 1995.
    • (1995) Proc. IEEE ICASSP-95 , pp. 520-523
    • Takahashi, S.1    Sagayama, S.2
  • 128
    • 85135261079 scopus 로고    scopus 로고
    • An Investigation into Vocal Tract Length Normalization
    • Budapest, Hungary
    • L. F. Uebel and P C. Woodland, An Investigation into Vocal Tract Length Normalization, Proc. ESCA Eurospeech’99, 2527-2530, Budapest, Hungary, September 1999.
    • (1999) Proc. ESCA Eurospeech’99 , pp. 2527-2530
    • Uebel, L.F.1    Woodland, P.C.2
  • 129
    • 0040262071 scopus 로고
    • Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance
    • Madrid, Spain
    • D.A. van Leeuwen, L. G. van den Berg and H.J. M. Steeneken, Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance, Proc. ESCA Eurospeech’95, 1461-1464, Madrid, Spain, September 1995.
    • (1995) Proc. ESCA Eurospeech’95 , pp. 1461-1464
    • Van Leeuwen, D.A.1    Van Den Berg, L.G.2    Steeneken, H.J.M.3
  • 130
    • 0010727514 scopus 로고
    • Speech discrimination by dynamic programming
    • T K. Vintsyuk, Speech discrimination by dynamic programming, Kibnernetika, 4:81, 1968.
    • (1968) Kibnernetika , vol.4 , pp. 81
    • Vintsyuk, T.K.1
  • 131
    • 34250411858 scopus 로고
    • Elements-wise recognition of continuous speech composed of words from a specified dictionary
    • March-April
    • T. K. Vintsyuk, Elements-wise recognition of continuous speech composed of words from a specified dictionary, Cybernetics, 7:133-143, March-April 1971.
    • (1971) Cybernetics , vol.7 , pp. 133-143
    • Vintsyuk, T.K.1
  • 132
    • 0001891171 scopus 로고
    • Verbmobil: Translation of Face-to-Face Dialogs
    • Berlin, Germany, Plenary
    • W. Wahlster, Verbmobil: Translation of Face-to-Face Dialogs, Proc. ESCA Eurospeech’93, Berlin, Germany, Plenary, 29-38, September 1993.
    • (1993) Proc. ESCA Eurospeech’93 , pp. 29-38
    • Wahlster, W.1
  • 134
    • 0032678104 scopus 로고    scopus 로고
    • Probabilistic Models for Topic Detection and Tracking
    • Phoenix, AZ
    • F. Walls, H. Jin, S. Sista and R. Schwartz, Probabilistic Models for Topic Detection and Tracking, Proc. IEEE ICASSP-99,1:521-524, Phoenix, AZ, March 1999
    • (1999) Proc. IEEE ICASSP-99 , vol.1 , pp. 521-524
    • Walls, F.1    Jin, H.2    Sista, S.3    Schwartz, R.4
  • 136
    • 0032657771 scopus 로고    scopus 로고
    • Progress in Broadcast News Transcription at Dragon Systems
    • Phoenix, AZ
    • S. Wegmann, P Zhan, and L. Gillick, Progress in Broadcast News Transcription at Dragon Systems, Proc IEEE ICASSP-99, 33-36, Phoenix, AZ, March 1999
    • (1999) Proc IEEE ICASSP-99 , pp. 33-36
    • Wegmann, S.1    Zhan, P.2    Gillick, L.3
  • 137
    • 0030706666 scopus 로고    scopus 로고
    • NeuralNetwork based Measures of Confidence for Word Recognition
    • Munich, Germany
    • M. Weintraub, F. Beaufays, Z. Rivlin, Y. Konig and A. Stolcke, NeuralNetwork based Measures of Confidence for Word Recognition, Proc. IEEE ICASSP-97, 887-890, Munich, Germany, April 1997.
    • (1997) Proc. IEEE ICASSP-97 , pp. 887-890
    • Weintraub, M.1    Beaufays, F.2    Rivlin, Z.3    Konig, Y.4    Stolcke, A.5
  • 138
    • 0031630644 scopus 로고    scopus 로고
    • Using word probabilities as confidence measures
    • Seattle, WA
    • F. Wessel, K. Macherey and R. Schlüter, Using word probabilities as confidence measures, Proc. IEEE ICASSP-98, 225-228, Seattle, WA, May 1998.
    • (1998) Proc. IEEE ICASSP-98 , pp. 225-228
    • Wessel, F.1    Macherey, K.2    Schlüter, R.3
  • 139
    • 84962920544 scopus 로고    scopus 로고
    • Unsupervised training of acoustic models for large vocabulary continuous speech recognition
    • Madonna di Campiglio, Italy
    • F. Wessel and H. Ney, Unsupervised training of acoustic models for large vocabulary continuous speech recognition, Proc. IEEE ASRU’01, Madonna di Campiglio, Italy, December 2001.
    • (2001) Proc. IEEE ASRU’01
    • Wessel, F.1    Ney, H.2
  • 140
    • 0026187945 scopus 로고
    • The Zero Frequency problem: Estimating the problems of Novel Events in Adaptive tex Compression
    • I.H. Witten and T. C. Bell, The Zero Frequency problem: Estimating the problems of Novel Events in Adaptive tex Compression, Proc. IEEE Trans. On Information Theory, 37(1):1085-1094, July 1991.
    • (1991) Proc. IEEE Trans. On Information Theory , vol.37 , Issue.1 , pp. 1085-1094
    • Witten, I.H.1    Bell, T.C.2
  • 141
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of hidden Markov models for speech recognition
    • P C. Woodland and D. Povey, Large scale discriminative training of hidden Markov models for speech recognition, Computer, Speech and Language, 16(1):25-47, January 2002.
    • (2002) Computer, Speech and Language , vol.16 , Issue.1 , pp. 25-47
    • Woodland, P.C.1    Povey, D.2
  • 144
    • 0031624946 scopus 로고    scopus 로고
    • A Hidden Markov Approach to Text Segmentation and Event Tracking
    • Seattle, WA
    • J.P. Yamron, I. Carp, L. Gillick, S. Lowe and P. van Mulbregt, A Hidden Markov Approach to Text Segmentation and Event Tracking, Proc IEEE ICASSP-98,1:333-336, Seattle, WA, May 1998.
    • (1998) Proc IEEE ICASSP-98 , vol.1 , pp. 333-336
    • Yamron, J.P.1    Carp, I.2    Gillick, L.3    Lowe, S.4    Van Mulbregt, P.5
  • 145
    • 0030244826 scopus 로고    scopus 로고
    • A Review of Large-Vocabulary Continuous Speech Recognition
    • S. J. Young, A Review of Large-Vocabulary Continuous Speech Recognition, IEEE Signal Processing Magazine, 13(5):45-57, September 1996.
    • (1996) IEEE Signal Processing Magazine , vol.13 , Issue.5 , pp. 45-57
    • Young, S.J.1
  • 147
    • 0032181247 scopus 로고    scopus 로고
    • Speech recognition evaluation: A review of the U.S. CSR andLVCSR programmes
    • S.J. Young and L. Chase, Speech recognition evaluation: a review of the U.S. CSR andLVCSR programmes, Computer Speech &Language, 12(4):263-279, October 1998
    • (1998) Computer Speech &Language , vol.12 , Issue.4 , pp. 263-279
    • Young, S.J.1    Chase, L.2
  • 149
    • 85135369802 scopus 로고
    • The Use of State Tying in Continuous Speech Recognition
    • Berlin, Germany
    • S.J. Young and P C. Woodland, The Use of State Tying in Continuous Speech Recognition, Proc. ESCA Eurospeech’93, 3:2203-2206, Berlin, Germany, September1993
    • (1993) Proc. ESCA Eurospeech’93 , vol.3 , pp. 2203-2206
    • Young, S.J.1    Woodland, P.C.2
  • 151
    • 0029745232 scopus 로고
    • Maximum a Posteriori Adaptation for Large Scale HMM Recognizers
    • Detroit, MI
    • G. Zavaliagkos, R. Schwartz and J. McDonough, Maximum a Posteriori Adaptation for Large Scale HMM Recognizers, Proc IEEE ICASSP-95, 725-728, Detroit, MI, May 1995
    • (1995) Proc IEEE ICASSP-95 , pp. 725-728
    • Zavaliagkos, G.1    Schwartz, R.2    McDonough, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.