SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 17, Issue 2-3, 2003, Pages 137-152

A probabilistic framework for segment-based speech recognition

(1) Glass, James R a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; DECODING; FEATURE EXTRACTION; GRAPH THEORY; MATRIX ALGEBRA; MAXIMUM LIKELIHOOD ESTIMATION; RANDOM PROCESSES; SPEECH ANALYSIS; VECTORS; WORD PROCESSING;

ACOUSTIC LIKELIHOOD ESTIMATION; SEGMENT BASED SPEECH RECOGNITION; WORD RECOGNITION;

SPEECH RECOGNITION;

EID: 0038359548 PISSN: 08852308 EISSN: None Source Type: Journal
DOI: 10.1016/S0885-2308(03)00006-8 Document Type: Article

Times cited : (217)

References (43)

1
- 0037841331
- Near-miss modeling: A segment-based approach to speech recognition
- Ph.D. thesis, EECS, MIT
- Chang, J., 1998. Near-miss modeling: a segment-based approach to speech recognition. Ph.D. thesis, EECS, MIT.
- (1998)
- Chang, J.¹

2
- 84969173798
- Segmentation and modeling in segment-based recognition
- Chang, J., Glass, J., 1997. Segmentation and modeling in segment-based recognition. In: Proc. Eurospeech, Rhodes, Greece, October, pp. 1199-1202.
- (1997) Proc. Eurospeech, Rhodes, Greece, October , pp. 1199-1202
- Chang, J.¹ Glass, J.²

3
- 0019572151
- Segmenting speech using dynamic programming
- Cohen, J., 1981. Segmenting speech using dynamic programming. J. Acoust. Soc. Am. 69 (5), 1430-1438.
- (1981) J. Acoust. Soc. Am. , vol.69 , Issue.5 , pp. 1430-1438
- Cohen, J.¹

4
- 0020499888
- Feature-based speaker-independent recognition of isolated letters
- Cole, R., Stern, R., Phillips, M., Brill, S., Pilant, A., Specker, P., 1983. Feature-based speaker-independent recognition of isolated letters. In: Proc. ICASSP, Boston, MA, pp. 731-733.
- (1983) Proc. ICASSP, Boston, MA , pp. 731-733
- Cole, R.¹ Stern, R.² Phillips, M.³ Brill, S.⁴ Pilant, A.⁵ Specker, P.⁶

5
- 0003938589
- Segment-based stochastic models of spectral dynamics for continuous speech recognition
- Ph.D. thesis, Boston University
- Digilakis, V., 1992. Segment-based stochastic models of spectral dynamics for continuous speech recognition. Ph.D. thesis, Boston University.
- (1992)
- Digilakis, V.¹

6
- 0027681974
- ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
- Digilakis, V., Rohlicek, J., Ostendorf, M., 1993. ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition. IEEE Trans. Speech Audio Proc. 1 (4), 431-442.
- (1993) IEEE Trans. Speech Audio Proc. , vol.1 , Issue.4 , pp. 431-442
- Digilakis, V.¹ Rohlicek, J.² Ostendorf, M.³

7
- 0003548585
- The DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM
- NTIS order number PB91-505065, October
- Garofolo, J., Lamel, L., Fisher, W., Fiscus, J., Pallet, D., Dahlgren, N., 1990. The DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM. NTIS order number PB91-505065, October.
- (1990)
- Garofolo, J.¹ Lamel, L.² Fisher, W.³ Fiscus, J.⁴ Pallet, D.⁵ Dahlgren, N.⁶

8
- 0008771262
- Finding acoustic regularities in speech: Applications to phonetic recognition
- Ph.D. thesis, EECS, MIT, May
- Glass, J., 1988. Finding acoustic regularities in speech: applications to phonetic recognition. Ph.D. thesis, EECS, MIT, May.
- (1988)
- Glass, J.¹

9
- 0030372637
- A probabilistic framework for feature-based speech recognition
- October
- Glass, J., Chang, J., McCandless, M., 1996. A probabilistic framework for feature-based speech recognition. In: Proc. ICSLP Philadelphia, PA, pp. 2277-2280, October.
- (1996) Proc. ICSLP Philadelphia, PA , pp. 2277-2280
- Glass, J.¹ Chang, J.² McCandless, M.³

10
- 0032665631
- Real-time telephone-based speech recognition in the Jupiter domain
- March
- Glass, J., Hazen, T., Hetherington, L., 1999. Real-time telephone-based speech recognition in the Jupiter domain. In: Proc. ICASSP Phoenix, AZ, pp. 61-64, March.
- (1999) Proc. ICASSP Phoenix, AZ , pp. 61-64
- Glass, J.¹ Hazen, T.² Hetherington, L.³

11
- 0023776395
- Multi-level acoustic segmentation of continuous speech
- April
- Glass, J., Zue, V., 1988. Multi-level acoustic segmentation of continuous speech. In: Proc. ICASSP, New York, NY, pp. 429-432, April.
- (1988) Proc. ICASSP, New York, NY , pp. 429-432
- Glass, J.¹ Zue, V.²

12
- 0003877861
- Heterogeneous acoustic measurements and multiple classifiers for speech recognition
- Ph.D. thesis, EECS, MIT, November
- Halberstadt, A., 1998. Heterogeneous acoustic measurements and multiple classifiers for speech recognition. Ph.D. thesis, EECS, MIT, November.
- (1998)
- Halberstadt, A.¹

13
- 85128407852
- Heterogeneous measurements and multiple classifiers for speech recognition
- Halberstadt, A., Glass, J., 1998. Heterogeneous measurements and multiple classifiers for speech recognition. In: Proc. ICSLP, Sydney, Australia, December, pp. 995-998.
- (1998) Proc. ICSLP, Sydney, Australia, December , pp. 995-998
- Halberstadt, A.¹ Glass, J.²

14
- 84892140515
- Using aggregation to improve the performance of mixture Gaussian acoustic models
- Hazen, T., Halberstadt, A., 1998. Using aggregation to improve the performance of mixture Gaussian acoustic models. In: Proc. ICASSP, Seattle, WA, May, pp. 653-656.
- (1998) Proc. ICASSP, Seattle, WA, May , pp. 653-656
- Hazen, T.¹ Halberstadt, A.²

15
- 0036460906
- Recognition confidence scoring and its use in speech understanding systems
- Hazen, T., Seneff, S., Polifroni, J., 2002. Recognition confidence scoring and its use in speech understanding systems. Comp. Speech Lang. 16, 49-67.
- (2002) Comp. Speech Lang. , vol.16 , pp. 49-67
- Hazen, T.¹ Seneff, S.² Polifroni, J.³

16
- 0037503680
- An efficient implementation of phonological rules using finite-state transducers
- Hetherington, L., 2001. An efficient implementation of phonological rules using finite-state transducers. In: Proc. Eurospeech, Aalborg, Denmark, September, pp. 1522-1609.
- (2001) Proc. Eurospeech, Aalborg, Denmark, September , pp. 1522-1609
- Hetherington, L.¹

17
- 0029750240
- Modeling speech variability with segmental HMMs
- Holmes, W., Russell, M., 1996. Modeling speech variability with segmental HMMs. In: Proc. ICASSP, Atlanta, GA, May, pp. 447-450.
- (1996) Proc. ICASSP, Atlanta, GA, May , pp. 447-450
- Holmes, W.¹ Russell, M.²

18
- 85135371588
- High performance speaker-independent phone recognition using CDHMM
- Lamel, L., Gauvain, J.L., 1993. High performance speaker-independent phone recognition using CDHMM. In: Proc. Eurospeech, Berlin, Germany, September, pp. 121-124.
- (1993) Proc. Eurospeech, Berlin, Germany, September , pp. 121-124
- Lamel, L.¹ Gauvain, J.L.²

19
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- Lee, K.F., Hon, H.W., 1989. Speaker-independent phone recognition using hidden Markov models. IEEE Trans. ASSP 37 (11), 1641-1648.
- (1989) IEEE Trans. ASSP , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.F.¹ Hon, H.W.²

20
- 0346262152
- Real-time probabilistic segmentation for segment-based speech recognition
- Lee, S., Glass, J., 1998. Real-time probabilistic segmentation for segment-based speech recognition. In: Proc. ICSLP, Sydney, Australia, December, pp. 1803-1806.
- (1998) Proc. ICSLP, Sydney, Australia, December , pp. 1803-1806
- Lee, S.¹ Glass, J.²

21
- 84871621979
- Segment-based recognition on the PhoneBook task: Initial results and observations on duration modeling
- Livescu, K., Glass, J., 2001. Segment-based recognition on the PhoneBook task: initial results and observations on duration modeling. In: Proc. Eurospeech Aalborg, Denmark, September, pp. 1437-1440.
- (2001) Proc. Eurospeech Aalborg, Denmark, September , pp. 1437-1440
- Livescu, K.¹ Glass, J.²

22
- 0028404665
- High accuracy phone recognition using context clustering and quasi-triphone models
- Ljolje, A., 1994. High accuracy phone recognition using context clustering and quasi-triphone models. Comput. Speech Lang. 8 (2), 129-151.
- (1994) Comput. Speech Lang. , vol.8 , Issue.2 , pp. 129-151
- Ljolje, A.¹

23
- 0027191575
- Phonetic recognition in a segment-based HMM
- Marcus, J., 1993. Phonetic recognition in a segment-based HMM. In: Proc. ICASSP, Minneapolis, MN, April, pp. 479-482.
- (1993) Proc. ICASSP, Minneapolis, MN, April , pp. 479-482
- Marcus, J.¹

24
- 0029770147
- A second-order HMM for high performance word and phoneme-based continuous speech recognition
- Mari, J.F., Fohr, D., Junqua, J.C., 1996. A second-order HMM for high performance word and phoneme-based continuous speech recognition. In: Proc. ICASSP, Atlanta, GA, May, pp. 435-438.
- (1996) Proc. ICASSP, Atlanta, GA, May , pp. 435-438
- Mari, J.F.¹ Fohr, D.² Junqua, J.C.³

25
- 0031624622
- Improved phone recognition using Bayesian triphone models
- Ming, J., Smith, F., 1998. Improved phone recognition using Bayesian triphone models. In: Proc. ICASSP, Seattle, WA, May, pp. 409-412.
- (1998) Proc. ICASSP, Seattle, WA, May , pp. 409-412
- Ming, J.¹ Smith, F.²

26
- 0030245363
- From HMM's to segment models: A unified view of stochastic modelling for speech recognition
- Ostendorf, M., Digilakis, V., Kimball, O., 1996. From HMM's to segment models: a unified view of stochastic modelling for speech recognition. IEEE Trans. Speech Audio Proc. 4 (5), 360-378.
- (1996) IEEE Trans. Speech Audio Proc. , vol.4 , Issue.5 , pp. 360-378
- Ostendorf, M.¹ Digilakis, V.² Kimball, O.³

27
- 0024900279
- A stochastic segment model for phoneme-based continuous speech recognition
- Ostendorf, M., Roucos, S., 1989. A stochastic segment model for phoneme-based continuous speech recognition. IEEE Trans. ASSP 37 (12), 1857-1869.
- (1989) IEEE Trans. ASSP , vol.37 , Issue.12 , pp. 1857-1869
- Ostendorf, M.¹ Roucos, S.²

28
- 0038178893
- Phonetic transition modelling for continuous speech recognition
- Phillips, M., Glass, J., 1994. Phonetic transition modelling for continuous speech recognition. J. Acoust. Soc. Am. 95 (5), 2877.
- (1994) J. Acoust. Soc. Am. , vol.95 , Issue.5 , pp. 2877
- Phillips, M.¹ Glass, J.²

29
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Rabiner, L., 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77 (2), 257-286.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.¹

30
- 0037503679
- Lexical access with a statistically-derived phonetic network
- Riley, M., Ljolje, A., 1991. Lexical access with a statistically-derived phonetic network. In: Proc. Eurospeech Genoa, Italy, September, pp. 585-585.
- (1991) Proc. Eurospeech Genoa, Italy, September , pp. 585
- Riley, M.¹ Ljolje, A.²

31
- 0028392167
- An application of recurrent nets to phone probability estimation
- Robinson, A., 1994. An application of recurrent nets to phone probability estimation. IEEE Trans. Neural Networks 5 (2), 298-305.
- (1994) IEEE Trans. Neural Networks , vol.5 , Issue.2 , pp. 298-305
- Robinson, A.¹

32
- 85079097438
- IPA: Improved phone modelling with recurrent neural networks
- Robinson, T., Hochberg, M., Renals, S., 1994. IPA: improved phone modelling with recurrent neural networks. In: Proc. ICASSP, Adelaide, Australia, April, pp. 37-40.
- (1994) Proc. ICASSP, Adelaide, Australia, April , pp. 37-40
- Robinson, T.¹ Hochberg, M.² Renals, S.³

33
- 0024905253
- Continuous hidden Markov modelling for speaker-independent word spotting
- Rohlicek, J., Russell, W., Roucos, S., Gish, H., 1989. Continuous hidden Markov modelling for speaker-independent word spotting. In: Proc. ICASSP, Glasgow, Scotland, May, pp. 627-630.
- (1989) Proc. ICASSP, Glasgow, Scotland, May , pp. 627-630
- Rohlicek, J.¹ Russell, W.² Roucos, S.³ Gish, H.⁴

34
- 0023846644
- Stochastic segment modelling using the estimate-maximize algorithm
- Roucos, S., Ostendorf, M., Gish, H., Derr, A., 1988. Stochastic segment modelling using the Estimate-Maximize algorithm. In: Proc. ICASSP, New York, NY, pp. 127-130.
- (1988) Proc. ICASSP, New York, NY , pp. 127-130
- Roucos, S.¹ Ostendorf, M.² Gish, H.³ Derr, A.⁴

35
- 0027228741
- A segmental HMM for speech pattern modelling
- Russell, M., 1993. A segmental HMM for speech pattern modelling. In: Proc. ICASSP, Minneapolis, MN, pp. 499-502.
- (1993) Proc. ICASSP, Minneapolis, MN , pp. 499-502
- Russell, M.¹

36
- 0002220140
- Applying phonetic knowledge to lexical access
- Stevens, K., 1995. Applying phonetic knowledge to lexical access. In: Proc. Eurospeech, Madrid, Spain, pp. 3-11.
- (1995) Proc. Eurospeech, Madrid, Spain , pp. 3-11
- Stevens, K.¹

37
- 14944356145
- Acoustic modelling improvements in a segment-based speech recognizer
- Ström, N., Hetherington, L., Hazen, T., Sandness, E., Glass, J., 1999. Acoustic modelling improvements in a segment-based speech recognizer. In: Proc. IEEE Automatic Speech Recognition and Understanding Workshop, Keystone, CO, December, pp. 139-142.
- (1999) Proc. IEEE Automatic Speech Recognition and Understanding Workshop, Keystone, CO, December , pp. 139-142
- Ström, N.¹ Hetherington, L.² Hazen, T.³ Sandness, E.⁴ Glass, J.⁵

38
- 0016469280
- A system for acoustic-phonetic analysis of continuous speech
- Weinstein, C., McCandless, S., Mondshein, L., Zue, V., 1975. A system for acoustic-phonetic analysis of continuous speech. IEEE Trans. ASSP 23, 54-67.
- (1975) IEEE Trans. ASSP , vol.23 , pp. 54-67
- Weinstein, C.¹ McCandless, S.² Mondshein, L.³ Zue, V.⁴

39
- 0025517070
- Automatic recognition of keywords in unconstrained speech using hidden Markov models
- Wilpon, J., Rabiner, L., Lee, C.H., Goldman, E., 1990. Automatic recognition of keywords in unconstrained speech using hidden Markov models. IEEE Trans. ASSP 38 (11), 1870-1878.
- (1990) IEEE Trans. ASSP , vol.38 , Issue.11 , pp. 1870-1878
- Wilpon, J.¹ Rabiner, L.² Lee, C.H.³ Goldman, E.⁴

40
- 0028530231
- State clustering in hidden Markov model-based continuous speech recognition
- Young, S., Woodland, P., 1994. State clustering in hidden Markov model-based continuous speech recognition. Comput. Speech Lang. 8 (4), 369-383.
- (1994) Comput. Speech Lang. , vol.8 , Issue.4 , pp. 369-383
- Young, S.¹ Woodland, P.²

41
- 0033690878
- On the use of variable frame rate analysis in speech recognition
- Zhu, Q., Alwan, A., 2000. On the use of variable frame rate analysis in speech recognition. In: Proc. ICASSP, Istanbul, Turkey, June, pp. 1783-1786.
- (2000) Proc. ICASSP, Istanbul, Turkey, June , pp. 1783-1786
- Zhu, Q.¹ Alwan, A.²

42
- 85121123643
- The MIT summit speech recognition system: A progress report
- Zue, V., Glass, J., Phillips, M., Seneff, S., 1989. The MIT Summit speech recognition system: a progress report. In: Proc. Speech and Natural Language Workshop, Philadelphia, PA, February, pp. 179-189.
- (1989) Proc. Speech and Natural Language Workshop, Philadelphia, PA, February , pp. 179-189
- Zue, V.¹ Glass, J.² Phillips, M.³ Seneff, S.⁴

43
- 0033878021
- Jupiter: A telephone-based conversational interface for weather information
- Zue, V., Seneff, S., Glass, J., Polifroni, J., Pao, C., Hazen, T., Hetherington, L., 2000. Jupiter: a telephone-based conversational interface for weather information. IEEE Trans. Speech Audio Proc. 8 (1), 85-96.
- (2000) IEEE Trans. Speech Audio Proc. , vol.8 , Issue.1 , pp. 85-96
- Zue, V.¹ Seneff, S.² Glass, J.³ Polifroni, J.⁴ Pao, C.⁵ Hazen, T.⁶ Hetherington, L.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.