메뉴 건너뛰기




Volumn 17, Issue 2-3, 2003, Pages 137-152

A probabilistic framework for segment-based speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; DECODING; FEATURE EXTRACTION; GRAPH THEORY; MATRIX ALGEBRA; MAXIMUM LIKELIHOOD ESTIMATION; RANDOM PROCESSES; SPEECH ANALYSIS; VECTORS; WORD PROCESSING;

EID: 0038359548     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0885-2308(03)00006-8     Document Type: Article
Times cited : (217)

References (43)
  • 1
    • 0037841331 scopus 로고    scopus 로고
    • Near-miss modeling: A segment-based approach to speech recognition
    • Ph.D. thesis, EECS, MIT
    • Chang, J., 1998. Near-miss modeling: a segment-based approach to speech recognition. Ph.D. thesis, EECS, MIT.
    • (1998)
    • Chang, J.1
  • 3
    • 0019572151 scopus 로고
    • Segmenting speech using dynamic programming
    • Cohen, J., 1981. Segmenting speech using dynamic programming. J. Acoust. Soc. Am. 69 (5), 1430-1438.
    • (1981) J. Acoust. Soc. Am. , vol.69 , Issue.5 , pp. 1430-1438
    • Cohen, J.1
  • 5
    • 0003938589 scopus 로고
    • Segment-based stochastic models of spectral dynamics for continuous speech recognition
    • Ph.D. thesis, Boston University
    • Digilakis, V., 1992. Segment-based stochastic models of spectral dynamics for continuous speech recognition. Ph.D. thesis, Boston University.
    • (1992)
    • Digilakis, V.1
  • 6
    • 0027681974 scopus 로고
    • ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
    • Digilakis, V., Rohlicek, J., Ostendorf, M., 1993. ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition. IEEE Trans. Speech Audio Proc. 1 (4), 431-442.
    • (1993) IEEE Trans. Speech Audio Proc. , vol.1 , Issue.4 , pp. 431-442
    • Digilakis, V.1    Rohlicek, J.2    Ostendorf, M.3
  • 7
    • 0003548585 scopus 로고
    • The DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM
    • NTIS order number PB91-505065, October
    • Garofolo, J., Lamel, L., Fisher, W., Fiscus, J., Pallet, D., Dahlgren, N., 1990. The DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM. NTIS order number PB91-505065, October.
    • (1990)
    • Garofolo, J.1    Lamel, L.2    Fisher, W.3    Fiscus, J.4    Pallet, D.5    Dahlgren, N.6
  • 8
    • 0008771262 scopus 로고
    • Finding acoustic regularities in speech: Applications to phonetic recognition
    • Ph.D. thesis, EECS, MIT, May
    • Glass, J., 1988. Finding acoustic regularities in speech: applications to phonetic recognition. Ph.D. thesis, EECS, MIT, May.
    • (1988)
    • Glass, J.1
  • 9
    • 0030372637 scopus 로고    scopus 로고
    • A probabilistic framework for feature-based speech recognition
    • October
    • Glass, J., Chang, J., McCandless, M., 1996. A probabilistic framework for feature-based speech recognition. In: Proc. ICSLP Philadelphia, PA, pp. 2277-2280, October.
    • (1996) Proc. ICSLP Philadelphia, PA , pp. 2277-2280
    • Glass, J.1    Chang, J.2    McCandless, M.3
  • 10
    • 0032665631 scopus 로고    scopus 로고
    • Real-time telephone-based speech recognition in the Jupiter domain
    • March
    • Glass, J., Hazen, T., Hetherington, L., 1999. Real-time telephone-based speech recognition in the Jupiter domain. In: Proc. ICASSP Phoenix, AZ, pp. 61-64, March.
    • (1999) Proc. ICASSP Phoenix, AZ , pp. 61-64
    • Glass, J.1    Hazen, T.2    Hetherington, L.3
  • 11
    • 0023776395 scopus 로고
    • Multi-level acoustic segmentation of continuous speech
    • April
    • Glass, J., Zue, V., 1988. Multi-level acoustic segmentation of continuous speech. In: Proc. ICASSP, New York, NY, pp. 429-432, April.
    • (1988) Proc. ICASSP, New York, NY , pp. 429-432
    • Glass, J.1    Zue, V.2
  • 12
    • 0003877861 scopus 로고    scopus 로고
    • Heterogeneous acoustic measurements and multiple classifiers for speech recognition
    • Ph.D. thesis, EECS, MIT, November
    • Halberstadt, A., 1998. Heterogeneous acoustic measurements and multiple classifiers for speech recognition. Ph.D. thesis, EECS, MIT, November.
    • (1998)
    • Halberstadt, A.1
  • 13
    • 85128407852 scopus 로고    scopus 로고
    • Heterogeneous measurements and multiple classifiers for speech recognition
    • Halberstadt, A., Glass, J., 1998. Heterogeneous measurements and multiple classifiers for speech recognition. In: Proc. ICSLP, Sydney, Australia, December, pp. 995-998.
    • (1998) Proc. ICSLP, Sydney, Australia, December , pp. 995-998
    • Halberstadt, A.1    Glass, J.2
  • 14
    • 84892140515 scopus 로고    scopus 로고
    • Using aggregation to improve the performance of mixture Gaussian acoustic models
    • Hazen, T., Halberstadt, A., 1998. Using aggregation to improve the performance of mixture Gaussian acoustic models. In: Proc. ICASSP, Seattle, WA, May, pp. 653-656.
    • (1998) Proc. ICASSP, Seattle, WA, May , pp. 653-656
    • Hazen, T.1    Halberstadt, A.2
  • 15
    • 0036460906 scopus 로고    scopus 로고
    • Recognition confidence scoring and its use in speech understanding systems
    • Hazen, T., Seneff, S., Polifroni, J., 2002. Recognition confidence scoring and its use in speech understanding systems. Comp. Speech Lang. 16, 49-67.
    • (2002) Comp. Speech Lang. , vol.16 , pp. 49-67
    • Hazen, T.1    Seneff, S.2    Polifroni, J.3
  • 16
    • 0037503680 scopus 로고    scopus 로고
    • An efficient implementation of phonological rules using finite-state transducers
    • Hetherington, L., 2001. An efficient implementation of phonological rules using finite-state transducers. In: Proc. Eurospeech, Aalborg, Denmark, September, pp. 1522-1609.
    • (2001) Proc. Eurospeech, Aalborg, Denmark, September , pp. 1522-1609
    • Hetherington, L.1
  • 17
    • 0029750240 scopus 로고    scopus 로고
    • Modeling speech variability with segmental HMMs
    • Holmes, W., Russell, M., 1996. Modeling speech variability with segmental HMMs. In: Proc. ICASSP, Atlanta, GA, May, pp. 447-450.
    • (1996) Proc. ICASSP, Atlanta, GA, May , pp. 447-450
    • Holmes, W.1    Russell, M.2
  • 19
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • Lee, K.F., Hon, H.W., 1989. Speaker-independent phone recognition using hidden Markov models. IEEE Trans. ASSP 37 (11), 1641-1648.
    • (1989) IEEE Trans. ASSP , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.F.1    Hon, H.W.2
  • 20
    • 0346262152 scopus 로고    scopus 로고
    • Real-time probabilistic segmentation for segment-based speech recognition
    • Lee, S., Glass, J., 1998. Real-time probabilistic segmentation for segment-based speech recognition. In: Proc. ICSLP, Sydney, Australia, December, pp. 1803-1806.
    • (1998) Proc. ICSLP, Sydney, Australia, December , pp. 1803-1806
    • Lee, S.1    Glass, J.2
  • 21
    • 84871621979 scopus 로고    scopus 로고
    • Segment-based recognition on the PhoneBook task: Initial results and observations on duration modeling
    • Livescu, K., Glass, J., 2001. Segment-based recognition on the PhoneBook task: initial results and observations on duration modeling. In: Proc. Eurospeech Aalborg, Denmark, September, pp. 1437-1440.
    • (2001) Proc. Eurospeech Aalborg, Denmark, September , pp. 1437-1440
    • Livescu, K.1    Glass, J.2
  • 22
    • 0028404665 scopus 로고
    • High accuracy phone recognition using context clustering and quasi-triphone models
    • Ljolje, A., 1994. High accuracy phone recognition using context clustering and quasi-triphone models. Comput. Speech Lang. 8 (2), 129-151.
    • (1994) Comput. Speech Lang. , vol.8 , Issue.2 , pp. 129-151
    • Ljolje, A.1
  • 23
    • 0027191575 scopus 로고
    • Phonetic recognition in a segment-based HMM
    • Marcus, J., 1993. Phonetic recognition in a segment-based HMM. In: Proc. ICASSP, Minneapolis, MN, April, pp. 479-482.
    • (1993) Proc. ICASSP, Minneapolis, MN, April , pp. 479-482
    • Marcus, J.1
  • 24
    • 0029770147 scopus 로고    scopus 로고
    • A second-order HMM for high performance word and phoneme-based continuous speech recognition
    • Mari, J.F., Fohr, D., Junqua, J.C., 1996. A second-order HMM for high performance word and phoneme-based continuous speech recognition. In: Proc. ICASSP, Atlanta, GA, May, pp. 435-438.
    • (1996) Proc. ICASSP, Atlanta, GA, May , pp. 435-438
    • Mari, J.F.1    Fohr, D.2    Junqua, J.C.3
  • 25
    • 0031624622 scopus 로고    scopus 로고
    • Improved phone recognition using Bayesian triphone models
    • Ming, J., Smith, F., 1998. Improved phone recognition using Bayesian triphone models. In: Proc. ICASSP, Seattle, WA, May, pp. 409-412.
    • (1998) Proc. ICASSP, Seattle, WA, May , pp. 409-412
    • Ming, J.1    Smith, F.2
  • 26
    • 0030245363 scopus 로고    scopus 로고
    • From HMM's to segment models: A unified view of stochastic modelling for speech recognition
    • Ostendorf, M., Digilakis, V., Kimball, O., 1996. From HMM's to segment models: a unified view of stochastic modelling for speech recognition. IEEE Trans. Speech Audio Proc. 4 (5), 360-378.
    • (1996) IEEE Trans. Speech Audio Proc. , vol.4 , Issue.5 , pp. 360-378
    • Ostendorf, M.1    Digilakis, V.2    Kimball, O.3
  • 27
    • 0024900279 scopus 로고
    • A stochastic segment model for phoneme-based continuous speech recognition
    • Ostendorf, M., Roucos, S., 1989. A stochastic segment model for phoneme-based continuous speech recognition. IEEE Trans. ASSP 37 (12), 1857-1869.
    • (1989) IEEE Trans. ASSP , vol.37 , Issue.12 , pp. 1857-1869
    • Ostendorf, M.1    Roucos, S.2
  • 28
    • 0038178893 scopus 로고
    • Phonetic transition modelling for continuous speech recognition
    • Phillips, M., Glass, J., 1994. Phonetic transition modelling for continuous speech recognition. J. Acoust. Soc. Am. 95 (5), 2877.
    • (1994) J. Acoust. Soc. Am. , vol.95 , Issue.5 , pp. 2877
    • Phillips, M.1    Glass, J.2
  • 29
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L., 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77 (2), 257-286.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 30
  • 31
    • 0028392167 scopus 로고
    • An application of recurrent nets to phone probability estimation
    • Robinson, A., 1994. An application of recurrent nets to phone probability estimation. IEEE Trans. Neural Networks 5 (2), 298-305.
    • (1994) IEEE Trans. Neural Networks , vol.5 , Issue.2 , pp. 298-305
    • Robinson, A.1
  • 34
    • 0023846644 scopus 로고
    • Stochastic segment modelling using the estimate-maximize algorithm
    • Roucos, S., Ostendorf, M., Gish, H., Derr, A., 1988. Stochastic segment modelling using the Estimate-Maximize algorithm. In: Proc. ICASSP, New York, NY, pp. 127-130.
    • (1988) Proc. ICASSP, New York, NY , pp. 127-130
    • Roucos, S.1    Ostendorf, M.2    Gish, H.3    Derr, A.4
  • 35
    • 0027228741 scopus 로고
    • A segmental HMM for speech pattern modelling
    • Russell, M., 1993. A segmental HMM for speech pattern modelling. In: Proc. ICASSP, Minneapolis, MN, pp. 499-502.
    • (1993) Proc. ICASSP, Minneapolis, MN , pp. 499-502
    • Russell, M.1
  • 36
    • 0002220140 scopus 로고
    • Applying phonetic knowledge to lexical access
    • Stevens, K., 1995. Applying phonetic knowledge to lexical access. In: Proc. Eurospeech, Madrid, Spain, pp. 3-11.
    • (1995) Proc. Eurospeech, Madrid, Spain , pp. 3-11
    • Stevens, K.1
  • 38
  • 39
    • 0025517070 scopus 로고
    • Automatic recognition of keywords in unconstrained speech using hidden Markov models
    • Wilpon, J., Rabiner, L., Lee, C.H., Goldman, E., 1990. Automatic recognition of keywords in unconstrained speech using hidden Markov models. IEEE Trans. ASSP 38 (11), 1870-1878.
    • (1990) IEEE Trans. ASSP , vol.38 , Issue.11 , pp. 1870-1878
    • Wilpon, J.1    Rabiner, L.2    Lee, C.H.3    Goldman, E.4
  • 40
    • 0028530231 scopus 로고
    • State clustering in hidden Markov model-based continuous speech recognition
    • Young, S., Woodland, P., 1994. State clustering in hidden Markov model-based continuous speech recognition. Comput. Speech Lang. 8 (4), 369-383.
    • (1994) Comput. Speech Lang. , vol.8 , Issue.4 , pp. 369-383
    • Young, S.1    Woodland, P.2
  • 41
    • 0033690878 scopus 로고    scopus 로고
    • On the use of variable frame rate analysis in speech recognition
    • Zhu, Q., Alwan, A., 2000. On the use of variable frame rate analysis in speech recognition. In: Proc. ICASSP, Istanbul, Turkey, June, pp. 1783-1786.
    • (2000) Proc. ICASSP, Istanbul, Turkey, June , pp. 1783-1786
    • Zhu, Q.1    Alwan, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.