메뉴 건너뛰기




Volumn E86-D, Issue 3, 2003, Pages 377-396

On automatic speech recognition at the dawn of the 21st century

Author keywords

Acoustic modeling; Automatic speech recognition; Dynamic programming; Feature extraction and detection; Heuristic search; Hidden Markov model; Language modeling; Lexical modeling; Maximum likelihood; Pattern recognition; String decoding; Utterance verification

Indexed keywords

ACOUSTICS; ALGORITHMS; COMPUTATIONAL METHODS; DYNAMIC PROGRAMMING; FEATURE EXTRACTION; MARKOV PROCESSES; MATHEMATICAL MODELS; SIGNAL PROCESSING; SPEECH ANALYSIS;

EID: 0038720005     PISSN: 09168532     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Review
Times cited : (22)

References (99)
  • 2
    • 60349114392 scopus 로고    scopus 로고
    • Form lord rayleigh to shannon: How do we decode speech?
    • Orlando
    • J. Alien, "From Lord Rayleigh to Shannon: How do we decode speech?," Proc. ICASSP-2002, Orlando, 2002.
    • (2002) Proc. ICASSP-2002
    • Alien, J.1
  • 3
    • 0020602364 scopus 로고
    • Efficient coding of LPC parameters by temporal decomposition
    • B.S. Atal, "Efficient coding of LPC parameters by temporal decomposition," Proc. ICASSP-83, pp.81-84, 1983.
    • (1983) Proc. ICASSP-83 , pp. 81-84
    • Atal, B.S.1
  • 4
    • 0020719320 scopus 로고
    • A maximum likelihood approach to continuous speech recognition
    • L.R. Bahl, F. Jelinek, and R.L. Mercer, "A maximum likelihood approach to continuous speech recognition," IEEE Trans. Pattern Anal. Mach. Intell., vol.5, no.2, pp.179-190, 1983.
    • (1983) IEEE Trans. Pattern Anal. Mach. Intell. , vol.5 , Issue.2 , pp. 179-190
    • Bahl, L.R.1    Jelinek, F.2    Mercer, R.L.3
  • 5
    • 0022890536 scopus 로고
    • Maximum mutual information estimation of hidden Markov model parameters for speech recognition
    • Tokyo
    • L.R. Bahl, P.F. Brown, P.V. de Souza, and R.L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition," Proc. ICASSP-86, pp.49-52, Tokyo, 1986.
    • (1986) Proc. ICASSP-86 , pp. 49-52
    • Bahl, L.R.1    Brown, P.F.2    De Souza, P.V.3    Mercer, R.L.4
  • 6
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L.E. Baum, T. Petrie, G. Soules, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Annal Math. Stat, vol.41, pp.164-171, 1970.
    • (1970) Annal Math. Stat , vol.41 , pp. 164-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 7
    • 0025629882 scopus 로고
    • Tied mixture continuous parameter modeling for speech recognition
    • J.R. Bellegarda and D. Nahamoo, "Tied mixture continuous parameter modeling for speech recognition," IEEE Trans. Acoust., Speech Signal Process, vol.38, no.12, pp.2033-2045, 1990.
    • (1990) IEEE Trans. Acoust., Speech Signal Process , vol.38 , Issue.12 , pp. 2033-2045
    • Bellegarda, J.R.1    Nahamoo, D.2
  • 8
    • 0000274403 scopus 로고    scopus 로고
    • Exploiting latent semantic information for statistical language modeling
    • J.R. Bellegarda, "Exploiting latent semantic information for statistical language modeling," Proc. IEEE, vol.88, no.8, pp.1279-1296, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1279-1296
    • Bellegarda, J.R.1
  • 9
    • 84949488544 scopus 로고
    • Discriminative feature extraction for speech recognition
    • Workshop
    • A. Biem, S. Katagiri, and B.-H. Juang, "Discriminative feature extraction for speech recognition," Proc. IEEE NN-SP Workshop, 1993.
    • (1993) Proc. IEEE NN-SP
    • Biem, A.1    Katagiri, S.2    Juang, B.-H.3
  • 12
    • 0001853667 scopus 로고    scopus 로고
    • An investigation of segmental hidden dynamic models of speech coarticulation for automatic speech recognition
    • CLSP at Johns Hopkins University
    • J. Bridle, el al, "An investigation of segmental hidden dynamic models of speech coarticulation for automatic speech recognition," Final Report: 1998 Workshop on Language Engineering, pp.1-61, CLSP at Johns Hopkins University, 1998.
    • (1998) Final Report: 1998 Workshop on Language Engineering , pp. 1-61
    • Bridle, J.1
  • 14
    • 0000767590 scopus 로고    scopus 로고
    • Discriminant-function-based minimum recognition error rate pattern recognition approach to automatic speech recognition
    • W. Chou, "Discriminant-function-based minimum recognition error rate pattern recognition approach to automatic speech recognition," Proc. IEEE, vol.88, no.8, pp. 1201-1223, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1201-1223
    • Chou, W.1
  • 16
    • 0042660763 scopus 로고    scopus 로고
    • Speeech and language processing for next-millennium communication services
    • R.V. Cox, C.A. Camm, L.R. Rabiner, J. Schroeter, and J.G. Wilpon, "Speeech and language processing for next-millennium communication services," Proc. IEEE, vol.88, no.8, pp.1237-1314, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1237-1314
    • Cox, R.V.1    Camm, C.A.2    Rabiner, L.R.3    Schroeter, J.4    Wilpon, J.G.5
  • 17
    • 0019053271 scopus 로고
    • Comparison of parametric representations of monosyllabic word recognition in continuously spoken sentences
    • S.B. Davis and P. Mermelstein, "Comparison of parametric representations of monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech Signal Process., vol.28, no.4, pp.357-366, 1980.
    • (1980) IEEE Trans. Acoust., Speech Signal Process. , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 20
    • 84948598244 scopus 로고
    • Statistical model based speech enhancement systems
    • Y. Ephraim, "Statistical model based speech enhancement systems," Proc. IEEE, vol.80, no.10, pp.1526-1555, 1992.
    • (1992) Proc. IEEE , vol.80 , Issue.10 , pp. 1526-1555
    • Ephraim, Y.1
  • 22
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • Santa Barbara
    • J.G. Fiscus, "A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)," Proc. 1997 ASRU Workshop, pp.347-352, Santa Barbara, 1997.
    • (1997) Proc. 1997 ASRU Workshop , pp. 347-352
    • Fiscus, J.G.1
  • 24
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust., Speech Signal Process., vol.34, no.1, pp.52-59, 1986.
    • (1986) IEEE Trans. Acoust., Speech Signal Process , vol.34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 26
    • 0038000235 scopus 로고
    • Parallel model combination for speech recognition in noise
    • CUED/F-INFENG/TR135
    • M.J.F. Gales and S.J. Young, "Parallel model combination for speech recognition in noise," Technical Report, CUED/F-INFENG/TR135, 1993.
    • (1993) Technical Report
    • Gales, M.J.F.1    Young, S.J.2
  • 27
    • 85009110670 scopus 로고    scopus 로고
    • Multistage coarticulation model combining articulatory, formant and cepstral features
    • Y. Gao, R. Bakkis, J. Huang, and B. Zhang, "Multistage coarticulation model combining articulatory, formant and cepstral features," Proc. ICSLP, pp.91-94, 2000.
    • (2000) Proc. ICSLP , pp. 91-94
    • Gao, Y.1    Bakkis, R.2    Huang, J.3    Zhang, B.4
  • 28
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate gaussian mixture observations of Markov chains
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp.291-298, 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 29
    • 0001596920 scopus 로고    scopus 로고
    • Large vocabulary continuous speech recognition: Advances and applications
    • J.-L. Gauvain and L. Lamel, "Large vocabulary continuous speech recognition: Advances and applications," Proc. IEEE, vol.88, no.8, pp.1181-1200, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1181-1200
    • Gauvain, J.-L.1    Lamel, L.2
  • 30
    • 0023859986 scopus 로고
    • Auditory nerve feedback as a basis for speech processing
    • O. Ghitza, "Auditory nerve feedback as a basis for speech processing," Proc. ICASSP-88, pp.91-94, 1988.
    • (1988) Proc. ICASSP-88 , pp. 91-94
    • Ghitza, O.1
  • 32
    • 0030784572 scopus 로고    scopus 로고
    • Stochastic trajectory modeling and sentence searching for continuous speech recognition
    • Y. Gong, "Stochastic trajectory modeling and sentence searching for continuous speech recognition," IEEE Trans. Speech Audio Process., vol.5, no.l, pp.33-44, 1997.
    • (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.1 , pp. 33-44
    • Gong, Y.1
  • 33
    • 0002076795 scopus 로고    scopus 로고
    • Insight into spoken language gleaned from phonetic transcription of the switchboard corpus
    • Philadelphia
    • S. Greenberg, J. Hollenback, and D. Ellis, "Insight into spoken language gleaned from phonetic transcription of the switchboard corpus," Proc. ICSLP-96, Philadelphia, 1996.
    • (1996) Proc. ICSLP-96
    • Greenberg, S.1    Hollenback, J.2    Ellis, D.3
  • 35
    • 0000250399 scopus 로고
    • Semi-continuous hidden Markov models for speech signal
    • X. Huang and M.A. Jack, "Semi-continuous hidden Markov models for speech signal," Computer, Speech and Language, vol.3, pp.239-251, 1989.
    • (1989) Computer, Speech and Language , vol.3 , pp. 239-251
    • Huang, X.1    Jack, M.A.2
  • 36
    • 0031103160 scopus 로고    scopus 로고
    • On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive bayes estimate
    • Q. Huo and C.-H. Lee, "On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive bayes estimate," IEEE Trans. Speech Audio Process., vol.5, no.2, pp.161-172, 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.2 , pp. 161-172
    • Huo, Q.1    Lee, C.-H.2
  • 37
    • 0034831586 scopus 로고    scopus 로고
    • Robust speech recognition based on adaptive classification and decision strategies
    • Q. Huo and C.-H. Lee, "Robust speech recognition based on adaptive classification and decision strategies," Speech Communication, vol.34, nos.1-2, pp.175-194, 2001.
    • (2001) Speech Communication , vol.34 , Issue.1-2 , pp. 175-194
    • Huo, Q.1    Lee, C.-H.2
  • 38
    • 0027683813 scopus 로고
    • Share-distribution hidden Markov models for speech recognition
    • M. Hwang and X. Huang, "Share-distribution hidden Markov models for speech recognition," IEEE Trans. Speech Audio Process., vol.1, no.3, pp.414-420, 1993.
    • (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.3 , pp. 414-420
    • Hwang, M.1    Huang, X.2
  • 39
    • 0022150487 scopus 로고
    • The development of an experimental discrete dictation recognizer
    • F. Jelinek, "The development of an experimental discrete dictation recognizer," Proc. IEEE, vol.73, no.10, pp.1616-1624, 1985.
    • (1985) Proc. IEEE , vol.73 , Issue.10 , pp. 1616-1624
    • Jelinek, F.1
  • 41
    • 0032685060 scopus 로고    scopus 로고
    • Robust speech recognition based on a Bayesian prediction approach
    • H. Jiang, K. Hirose, and Q. Huo, "Robust speech recognition based on a Bayesian prediction approach," IEEE Trans. Speech Audio Process., vol.7, no.4, pp.426-440, 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.4 , pp. 426-440
    • Jiang, H.1    Hirose, K.2    Huo, Q.3
  • 42
    • 0022097649 scopus 로고
    • Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains
    • B.-H. Juang, "Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol.64, 1985.
    • (1985) AT&T Tech. J. , vol.64
    • Juang, B.-H.1
  • 43
    • 0000763574 scopus 로고    scopus 로고
    • Automatic speech recognition and understanding: A first step toward natural human-machine communication
    • B.-H. Juang and S. Furui, "Automatic speech recognition and understanding: A first step toward natural human-machine communication," Proc. IEEE, vol.88, no.8, pp.1142-1165, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1142-1165
    • Juang, B.-H.1    Furui, S.2
  • 44
    • 0003071809 scopus 로고
    • Evaluation and optimization of perceptually-based ASR Front-End
    • J.-C. Junqua, H. Wakita, and H. Hermansky, "Evaluation and optimization of perceptually-based ASR Front-End," IEEE Trans. Speech Audio Process., vol.1, no.1, pp.39-48, 1993.
    • (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.1 , pp. 39-48
    • Junqua, J.-C.1    Wakita, H.2    Hermansky, H.3
  • 46
    • 0032203256 scopus 로고    scopus 로고
    • Pattern recognition using a generalized probabilistic descent method
    • S. Katagiri, B.-H. Juang, and C.-H. Lee, "Pattern recognition using a generalized probabilistic descent method," Proc. IEEE, vol.86, no.11, pp.2345-2373, 1998.
    • (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2345-2373
    • Katagiri, S.1    Juang, B.-H.2    Lee, C.-H.3
  • 47
    • 0032205629 scopus 로고    scopus 로고
    • Key-phrase detection and verification for flexible speech understanding
    • Nov.
    • T. Kawahara, C.-H. Lee, and B.-H. Juang, "Key-phrase detection and verification for flexible speech understanding," IEEE Trans. Speech Audio Process., vol.6, no.6, pp.558-568, Nov. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.6 , pp. 558-568
    • Kawahara, T.1    Lee, C.-H.2    Juang, B.-H.3
  • 48
    • 0017565919 scopus 로고
    • Review of the ARPA speech understanding project
    • D. Klatt, "Review of the ARPA speech understanding project," J. Acoust. Soc. Am., vol.62, no.6, 1977.
    • (1977) J. Acoust. Soc. Am. , vol.62 , Issue.6
    • Klatt, D.1
  • 50
    • 0035509488 scopus 로고    scopus 로고
    • Speech recognition and utterance verification based on a generalized confidence score
    • Nov.
    • M.-W. Koo, C.-H. Lee, and B.-H. Juang, "Speech recognition and utterance verification based on a generalized confidence score," IEEE Trans. Speech Audio Process., vol.9, no.8, pp.821-832, Nov. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.8 , pp. 821-832
    • Koo, M.-W.1    Lee, C.-H.2    Juang, B.-H.3
  • 51
    • 0038676761 scopus 로고    scopus 로고
    • Towards knowledge-based features for HMM based large vocabulary automatic speech recognition
    • Orlando
    • B. Launay, O. Siohan, A.C. Surendran, and C.-H. Lee, "Towards knowledge-based features for HMM based large vocabulary automatic speech recognition," Proc. ICASSP-2002, Orlando, 2002.
    • (2002) Proc. ICASSP-2002
    • Launay, B.1    Siohan, O.2    Surendran, A.C.3    Lee, C.-H.4
  • 53
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden Markov models
    • C.-H. Lee, C.-H. Lin, and B.-H. Juang, "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Acoust., Speech Signal Process., vol.39, no.4, pp.806-814, 1991.
    • (1991) IEEE Trans. Acoust., Speech Signal Process. , vol.39 , Issue.4 , pp. 806-814
    • Lee, C.-H.1    Lin, C.-H.2    Juang, B.-H.3
  • 55
    • 0008520151 scopus 로고    scopus 로고
    • A unified statistical hypothesis testing approach to speaker verification and verbal information verification
    • Greece
    • C.-H. Lee, "A unified statistical hypothesis testing approach to speaker verification and verbal information verification," Proc. COST Workshop on Speech Technology in the Public Telephone Network, pp.62-73, Greece, 1997.
    • (1997) Proc. COST Workshop on Speech Technology in the Public Telephone Network , pp. 62-73
    • Lee, C.-H.1
  • 56
    • 0032140546 scopus 로고    scopus 로고
    • On stochastic feature and model compensation approaches to robust speech recognition
    • C.-H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Communication, vol.25, pp.29-47, 1998.
    • (1998) Speech Communication , vol.25 , pp. 29-47
    • Lee, C.-H.1
  • 57
    • 0037662475 scopus 로고    scopus 로고
    • A detection approach to flexible speech recognition and understanding
    • C.-H. Lee, "A detection approach to flexible speech recognition and understanding," 1998 Johns Hopkins University Summer Workshop, 1998.
    • (1998) 1998 Johns Hopkins University Summer Workshop
    • Lee, C.-H.1
  • 58
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • C.-H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol.88, no.8, pp.1241-1269, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
    • Lee, C.-H.1    Huo, Q.2
  • 59
    • 0038676657 scopus 로고    scopus 로고
    • Statistical confidence measures and their applications
    • C.-H. Lee, "Statistical confidence measures and their applications," Proc. ICSP-01, pp. 1021-1028, 2001.
    • (2001) Proc. ICSP-01 , pp. 1021-1028
    • Lee, C.-H.1
  • 61
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, vol.9, pp.171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 62
    • 0022149626 scopus 로고
    • Structural methods in automatic speech recognition
    • S.E. Levinson, "Structural methods in automatic speech recognition," Proc. IEEE, vol.73, no.10, pp.1625-1650, 1985.
    • (1985) Proc. IEEE , vol.73 , Issue.10 , pp. 1625-1650
    • Levinson, S.E.1
  • 63
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by human and machines
    • R. Lippmann, "Speech recognition by human and machines," Speech Communication, vol.22, pp. 1-14, 1997.
    • (1997) Speech Communication , vol.22 , pp. 1-14
    • Lippmann, R.1
  • 64
    • 0038338178 scopus 로고
    • Optimal speech recognition using phone recognition and lexical access
    • A. Ljolje and M.D. Riley, "Optimal speech recognition using phone recognition and lexical access," Proc. ICSLP-92, pp.313-316, 1992.
    • (1992) Proc. ICSLP-92 , pp. 313-316
    • Ljolje, A.1    Riley, M.D.2
  • 65
    • 0020180460 scopus 로고
    • Maximum likelihood estimation for multivariate observations of Markov sources
    • L.R. Liporace, "Maximum likelihood estimation for multivariate observations of Markov sources," IEEE Trans. Inf. Theory, vol.28, no.5, pp.729-734, 1982.
    • (1982) IEEE Trans. Inf. Theory , vol.28 , Issue.5 , pp. 729-734
    • Liporace, L.R.1
  • 66
  • 67
    • 0002671953 scopus 로고
    • A minimax classification approach with application to robust speech recognition
    • N. Merhav and C.-H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Process., vol.1, no.1, pp.90-100, 1993.
    • (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.1 , pp. 90-100
    • Merhav, N.1    Lee, C.-H.2
  • 68
    • 0000635720 scopus 로고    scopus 로고
    • Progresses in dynamic programming search for LVCSR
    • H. Ney and S. Ortmanns, "Progresses in dynamic programming search for LVCSR," Proc. IEEE, vol.88, no.8, pp.1224-1240, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1224-1240
    • Ney, H.1    Ortmanns, S.2
  • 69
    • 0003731836 scopus 로고    scopus 로고
    • A detection framework for locating phonetic events
    • Sydney
    • P. Niyogi and P. Ramesh, "A detection framework for locating phonetic events," Proc. ICSLP-98, Sydney, 1998.
    • (1998) Proc. ICSLP-98
    • Niyogi, P.1    Ramesh, P.2
  • 70
    • 0026372945 scopus 로고
    • An improved MMIE training algorithm for speaker-independent small vocabulary, continuous speech recognition
    • Y. Normandin and D. Morgera, "An improved MMIE training algorithm for speaker-independent small vocabulary, continuous speech recognition," Proc. ICASSP-91, pp.537-540, 1991.
    • (1991) Proc. ICASSP-91 , pp. 537-540
    • Normandin, Y.1    Morgera, D.2
  • 72
    • 0024900279 scopus 로고
    • A stochastic segment model for phoneme-based continuous speech recognition
    • M. Ostendorf and S. Roukos, "A stochastic segment model for phoneme-based continuous speech recognition," IEEE Trans. Acoust., Speech Signal Process., vol.37, no.9, pp.1857-1869, 1989.
    • (1989) IEEE Trans. Acoust., Speech Signal Process. , vol.37 , Issue.9 , pp. 1857-1869
    • Ostendorf, M.1    Roukos, S.2
  • 73
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L.R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol.77, no.2, pp.257-286, 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 75
    • 85128410011 scopus 로고    scopus 로고
    • The voice feature for stop consonants: Acoustic phonetic analysis and automatic speech recognition experiments
    • Sydney
    • P. Ramesh and P. Niyogi, "The voice feature for stop consonants: Acoustic phonetic analysis and automatic speech recognition experiments," Proc. ICSLP-98, Sydney, 1998.
    • (1998) Proc. ICSLP-98
    • Ramesh, P.1    Niyogi, P.2
  • 76
    • 0344611009 scopus 로고
    • Speech recognition
    • Academic Press
    • R. Reddy, ed., Speech Recognition, Invited Papers Presented at the 1974 IEEE Symposium, Academic Press, 1974
    • (1974) 1974 IEEE Symposium
    • Reddy, R.1
  • 77
    • 0026405248 scopus 로고
    • A statistical model for generating pronunciation networks
    • M.D. Riley, "A statistical model for generating pronunciation networks," Proc. ICASSP-91, pp.737-740, 1991.
    • (1991) Proc. ICASSP-91 , pp. 737-740
    • Riley, M.D.1
  • 78
    • 0028420014 scopus 로고
    • Integrated models of speech and background with application to speaker identification in noise
    • R.C. Rose, E.M. Hofstetter, and D.A. Reynold, "Integrated models of speech and background with application to speaker identification in noise," IEEE Trans. Speech Audio Process., vol.2, no.2, pp.245-257, 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynold, D.A.3
  • 79
    • 33646907991 scopus 로고    scopus 로고
    • Two decades of statistical language modeling: Where do we go from here?
    • R. Rosenfeld, "Two decades of statistical language modeling: Where do we go from here?," Proc. IEEE, vol.88, no.8, pp.1279-1296, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1279-1296
    • Rosenfeld, R.1
  • 80
    • 0030149866 scopus 로고    scopus 로고
    • A maximum likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C.-H. Lee "A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Process., vol.4, no.3, pp.190-202, 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 81
    • 0025627406 scopus 로고
    • The N-best algorithm: An efficient and exact procedure for finding the N most likely sentence hypotheses
    • R. Schwartz and Y.-L. Chow, "The N-best algorithm: An efficient and exact procedure for finding the N most likely sentence hypotheses,"- Proc. ICASSP-90, pp.81-84, 1990.
    • (1990) Proc. ICASSP-90 , pp. 81-84
    • Schwartz, R.1    Chow, Y.-L.2
  • 82
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory speech processing
    • S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing," J. Phonetics, vol.16, pp.55-76, 1988.
    • (1988) J. Phonetics , vol.16 , pp. 55-76
    • Seneff, S.1
  • 83
    • 0035279111 scopus 로고    scopus 로고
    • A structural bayes approach to speaker adaptation
    • K. Shinoda and C.-H. Lee, "A structural bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol.9, no.3, pp.276-287, 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 276-287
    • Shinoda, K.1    Lee, C.-H.2
  • 84
    • 0035341086 scopus 로고    scopus 로고
    • Joint maximum a posteriori adaptation of transformation and HMM parameters
    • O. Siohan, C. Chesta, and C.-H. Lee, "Joint maximum a posteriori adaptation of transformation and HMM parameters," IEEE Trans. Speech Audio Process., vol.9, no.4, pp.417-428, 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 417-428
    • Siohan, O.1    Chesta, C.2    Lee, C.-H.3
  • 85
    • 0026370988 scopus 로고
    • A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition
    • F.K. Soong and E.F. Huang, "A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition," Proc. ICASSP-91, pp.703-706, 1991.
    • (1991) Proc. ICASSP-91 , pp. 703-706
    • Soong, F.K.1    Huang, E.F.2
  • 86
    • 0038338181 scopus 로고    scopus 로고
    • Special issue on robust speech recognition
    • "Special Issue on Robust Speech Recognition," Speech Communication, vol.25, nos.1-3, 1998.
    • (1998) Speech Communication , vol.25 , Issue.1-3
  • 88
    • 0030287341 scopus 로고    scopus 로고
    • Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition
    • R.A. Sukkar and C.-H. Lee, "Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition," IEEE Trans. Audio Speech Process., vol.4, no.6, pp.420-429, 1996.
    • (1996) IEEE Trans. Audio Speech Process. , vol.4 , Issue.6 , pp. 420-429
    • Sukkar, R.A.1    Lee, C.-H.2
  • 91
    • 85013744934 scopus 로고
    • A successive state splitting algorithm for efficient allophone modeling
    • J. Takami and S. Sagayama, "A successive state splitting algorithm for efficient allophone modeling," Proc. ICASSP-92, pp.I-573-576, 1992.
    • (1992) Proc. ICASSP-92
    • Takami, J.1    Sagayama, S.2
  • 92
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • A.P. Varga and R.K. Moore, "Hidden Markov model decomposition of speech and noise," Proc. ICASSP-90, pp.845-848, 1990.
    • (1990) Proc. ICASSP-90 , pp. 845-848
    • Varga, A.P.1    Moore, R.K.2
  • 93
    • 0012327341 scopus 로고    scopus 로고
    • Multiliguality in speech and spoken language systems
    • A. Waibel, P. Geutner, L.M. Tomokiyo, T. Schultz, and M. Woszczyna, "Multiliguality in speech and spoken language systems," Proc. IEEE, vol.88, no.8, pp.1166-1180, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1166-1180
    • Waibel, A.1    Geutner, P.2    Tomokiyo, L.M.3    Schultz, T.4    Woszczyna, M.5
  • 94
  • 97
    • 0020251572 scopus 로고
    • Acoustic-phonetic knowledge representation: Implications from spectrograms reading experiments
    • Bonas, France
    • V.W. Zue, "Acoustic-phonetic knowledge representation: Implications from spectrograms reading experiments," Tutorial paper presented at the 1981 NATO ASI on Speech Recognition, Bonas, France, 1981.
    • (1981) 1981 NATO ASI on Speech Recognition
    • Zue, V.W.1
  • 99
    • 0038000234 scopus 로고    scopus 로고
    • Conversational interfaces: Advances and challenges
    • V.W. Zue and J.R. Glass, "Conversational interfaces: Advances and challenges," Proc. IEEE, vol.88, no.8, pp.1166-1180, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1166-1180
    • Zue, V.W.1    Glass, J.R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.