-
1
-
-
0037841331
-
Near-miss modeling: A segment-based approach to speech recognition
-
Ph.D. thesis, EECS, MIT
-
Chang, J., 1998. Near-miss modeling: a segment-based approach to speech recognition. Ph.D. thesis, EECS, MIT.
-
(1998)
-
-
Chang, J.1
-
2
-
-
84969173798
-
Segmentation and modeling in segment-based recognition
-
Chang, J., Glass, J., 1997. Segmentation and modeling in segment-based recognition. In: Proc. Eurospeech, Rhodes, Greece, October, pp. 1199-1202.
-
(1997)
Proc. Eurospeech, Rhodes, Greece, October
, pp. 1199-1202
-
-
Chang, J.1
Glass, J.2
-
3
-
-
0019572151
-
Segmenting speech using dynamic programming
-
Cohen, J., 1981. Segmenting speech using dynamic programming. J. Acoust. Soc. Am. 69 (5), 1430-1438.
-
(1981)
J. Acoust. Soc. Am.
, vol.69
, Issue.5
, pp. 1430-1438
-
-
Cohen, J.1
-
4
-
-
0020499888
-
Feature-based speaker-independent recognition of isolated letters
-
Cole, R., Stern, R., Phillips, M., Brill, S., Pilant, A., Specker, P., 1983. Feature-based speaker-independent recognition of isolated letters. In: Proc. ICASSP, Boston, MA, pp. 731-733.
-
(1983)
Proc. ICASSP, Boston, MA
, pp. 731-733
-
-
Cole, R.1
Stern, R.2
Phillips, M.3
Brill, S.4
Pilant, A.5
Specker, P.6
-
5
-
-
0003938589
-
Segment-based stochastic models of spectral dynamics for continuous speech recognition
-
Ph.D. thesis, Boston University
-
Digilakis, V., 1992. Segment-based stochastic models of spectral dynamics for continuous speech recognition. Ph.D. thesis, Boston University.
-
(1992)
-
-
Digilakis, V.1
-
6
-
-
0027681974
-
ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
-
Digilakis, V., Rohlicek, J., Ostendorf, M., 1993. ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition. IEEE Trans. Speech Audio Proc. 1 (4), 431-442.
-
(1993)
IEEE Trans. Speech Audio Proc.
, vol.1
, Issue.4
, pp. 431-442
-
-
Digilakis, V.1
Rohlicek, J.2
Ostendorf, M.3
-
7
-
-
0003548585
-
The DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM
-
NTIS order number PB91-505065, October
-
Garofolo, J., Lamel, L., Fisher, W., Fiscus, J., Pallet, D., Dahlgren, N., 1990. The DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM. NTIS order number PB91-505065, October.
-
(1990)
-
-
Garofolo, J.1
Lamel, L.2
Fisher, W.3
Fiscus, J.4
Pallet, D.5
Dahlgren, N.6
-
8
-
-
0008771262
-
Finding acoustic regularities in speech: Applications to phonetic recognition
-
Ph.D. thesis, EECS, MIT, May
-
Glass, J., 1988. Finding acoustic regularities in speech: applications to phonetic recognition. Ph.D. thesis, EECS, MIT, May.
-
(1988)
-
-
Glass, J.1
-
9
-
-
0030372637
-
A probabilistic framework for feature-based speech recognition
-
October
-
Glass, J., Chang, J., McCandless, M., 1996. A probabilistic framework for feature-based speech recognition. In: Proc. ICSLP Philadelphia, PA, pp. 2277-2280, October.
-
(1996)
Proc. ICSLP Philadelphia, PA
, pp. 2277-2280
-
-
Glass, J.1
Chang, J.2
McCandless, M.3
-
10
-
-
0032665631
-
Real-time telephone-based speech recognition in the Jupiter domain
-
March
-
Glass, J., Hazen, T., Hetherington, L., 1999. Real-time telephone-based speech recognition in the Jupiter domain. In: Proc. ICASSP Phoenix, AZ, pp. 61-64, March.
-
(1999)
Proc. ICASSP Phoenix, AZ
, pp. 61-64
-
-
Glass, J.1
Hazen, T.2
Hetherington, L.3
-
11
-
-
0023776395
-
Multi-level acoustic segmentation of continuous speech
-
April
-
Glass, J., Zue, V., 1988. Multi-level acoustic segmentation of continuous speech. In: Proc. ICASSP, New York, NY, pp. 429-432, April.
-
(1988)
Proc. ICASSP, New York, NY
, pp. 429-432
-
-
Glass, J.1
Zue, V.2
-
12
-
-
0003877861
-
Heterogeneous acoustic measurements and multiple classifiers for speech recognition
-
Ph.D. thesis, EECS, MIT, November
-
Halberstadt, A., 1998. Heterogeneous acoustic measurements and multiple classifiers for speech recognition. Ph.D. thesis, EECS, MIT, November.
-
(1998)
-
-
Halberstadt, A.1
-
13
-
-
85128407852
-
Heterogeneous measurements and multiple classifiers for speech recognition
-
Halberstadt, A., Glass, J., 1998. Heterogeneous measurements and multiple classifiers for speech recognition. In: Proc. ICSLP, Sydney, Australia, December, pp. 995-998.
-
(1998)
Proc. ICSLP, Sydney, Australia, December
, pp. 995-998
-
-
Halberstadt, A.1
Glass, J.2
-
14
-
-
84892140515
-
Using aggregation to improve the performance of mixture Gaussian acoustic models
-
Hazen, T., Halberstadt, A., 1998. Using aggregation to improve the performance of mixture Gaussian acoustic models. In: Proc. ICASSP, Seattle, WA, May, pp. 653-656.
-
(1998)
Proc. ICASSP, Seattle, WA, May
, pp. 653-656
-
-
Hazen, T.1
Halberstadt, A.2
-
15
-
-
0036460906
-
Recognition confidence scoring and its use in speech understanding systems
-
Hazen, T., Seneff, S., Polifroni, J., 2002. Recognition confidence scoring and its use in speech understanding systems. Comp. Speech Lang. 16, 49-67.
-
(2002)
Comp. Speech Lang.
, vol.16
, pp. 49-67
-
-
Hazen, T.1
Seneff, S.2
Polifroni, J.3
-
16
-
-
0037503680
-
An efficient implementation of phonological rules using finite-state transducers
-
Hetherington, L., 2001. An efficient implementation of phonological rules using finite-state transducers. In: Proc. Eurospeech, Aalborg, Denmark, September, pp. 1522-1609.
-
(2001)
Proc. Eurospeech, Aalborg, Denmark, September
, pp. 1522-1609
-
-
Hetherington, L.1
-
17
-
-
0029750240
-
Modeling speech variability with segmental HMMs
-
Holmes, W., Russell, M., 1996. Modeling speech variability with segmental HMMs. In: Proc. ICASSP, Atlanta, GA, May, pp. 447-450.
-
(1996)
Proc. ICASSP, Atlanta, GA, May
, pp. 447-450
-
-
Holmes, W.1
Russell, M.2
-
18
-
-
85135371588
-
High performance speaker-independent phone recognition using CDHMM
-
Lamel, L., Gauvain, J.L., 1993. High performance speaker-independent phone recognition using CDHMM. In: Proc. Eurospeech, Berlin, Germany, September, pp. 121-124.
-
(1993)
Proc. Eurospeech, Berlin, Germany, September
, pp. 121-124
-
-
Lamel, L.1
Gauvain, J.L.2
-
19
-
-
0024768209
-
Speaker-independent phone recognition using hidden Markov models
-
Lee, K.F., Hon, H.W., 1989. Speaker-independent phone recognition using hidden Markov models. IEEE Trans. ASSP 37 (11), 1641-1648.
-
(1989)
IEEE Trans. ASSP
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.F.1
Hon, H.W.2
-
20
-
-
0346262152
-
Real-time probabilistic segmentation for segment-based speech recognition
-
Lee, S., Glass, J., 1998. Real-time probabilistic segmentation for segment-based speech recognition. In: Proc. ICSLP, Sydney, Australia, December, pp. 1803-1806.
-
(1998)
Proc. ICSLP, Sydney, Australia, December
, pp. 1803-1806
-
-
Lee, S.1
Glass, J.2
-
21
-
-
84871621979
-
Segment-based recognition on the PhoneBook task: Initial results and observations on duration modeling
-
Livescu, K., Glass, J., 2001. Segment-based recognition on the PhoneBook task: initial results and observations on duration modeling. In: Proc. Eurospeech Aalborg, Denmark, September, pp. 1437-1440.
-
(2001)
Proc. Eurospeech Aalborg, Denmark, September
, pp. 1437-1440
-
-
Livescu, K.1
Glass, J.2
-
22
-
-
0028404665
-
High accuracy phone recognition using context clustering and quasi-triphone models
-
Ljolje, A., 1994. High accuracy phone recognition using context clustering and quasi-triphone models. Comput. Speech Lang. 8 (2), 129-151.
-
(1994)
Comput. Speech Lang.
, vol.8
, Issue.2
, pp. 129-151
-
-
Ljolje, A.1
-
23
-
-
0027191575
-
Phonetic recognition in a segment-based HMM
-
Marcus, J., 1993. Phonetic recognition in a segment-based HMM. In: Proc. ICASSP, Minneapolis, MN, April, pp. 479-482.
-
(1993)
Proc. ICASSP, Minneapolis, MN, April
, pp. 479-482
-
-
Marcus, J.1
-
24
-
-
0029770147
-
A second-order HMM for high performance word and phoneme-based continuous speech recognition
-
Mari, J.F., Fohr, D., Junqua, J.C., 1996. A second-order HMM for high performance word and phoneme-based continuous speech recognition. In: Proc. ICASSP, Atlanta, GA, May, pp. 435-438.
-
(1996)
Proc. ICASSP, Atlanta, GA, May
, pp. 435-438
-
-
Mari, J.F.1
Fohr, D.2
Junqua, J.C.3
-
25
-
-
0031624622
-
Improved phone recognition using Bayesian triphone models
-
Ming, J., Smith, F., 1998. Improved phone recognition using Bayesian triphone models. In: Proc. ICASSP, Seattle, WA, May, pp. 409-412.
-
(1998)
Proc. ICASSP, Seattle, WA, May
, pp. 409-412
-
-
Ming, J.1
Smith, F.2
-
26
-
-
0030245363
-
From HMM's to segment models: A unified view of stochastic modelling for speech recognition
-
Ostendorf, M., Digilakis, V., Kimball, O., 1996. From HMM's to segment models: a unified view of stochastic modelling for speech recognition. IEEE Trans. Speech Audio Proc. 4 (5), 360-378.
-
(1996)
IEEE Trans. Speech Audio Proc.
, vol.4
, Issue.5
, pp. 360-378
-
-
Ostendorf, M.1
Digilakis, V.2
Kimball, O.3
-
27
-
-
0024900279
-
A stochastic segment model for phoneme-based continuous speech recognition
-
Ostendorf, M., Roucos, S., 1989. A stochastic segment model for phoneme-based continuous speech recognition. IEEE Trans. ASSP 37 (12), 1857-1869.
-
(1989)
IEEE Trans. ASSP
, vol.37
, Issue.12
, pp. 1857-1869
-
-
Ostendorf, M.1
Roucos, S.2
-
28
-
-
0038178893
-
Phonetic transition modelling for continuous speech recognition
-
Phillips, M., Glass, J., 1994. Phonetic transition modelling for continuous speech recognition. J. Acoust. Soc. Am. 95 (5), 2877.
-
(1994)
J. Acoust. Soc. Am.
, vol.95
, Issue.5
, pp. 2877
-
-
Phillips, M.1
Glass, J.2
-
29
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Rabiner, L., 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77 (2), 257-286.
-
(1989)
Proc. IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.1
-
30
-
-
0037503679
-
Lexical access with a statistically-derived phonetic network
-
Riley, M., Ljolje, A., 1991. Lexical access with a statistically-derived phonetic network. In: Proc. Eurospeech Genoa, Italy, September, pp. 585-585.
-
(1991)
Proc. Eurospeech Genoa, Italy, September
, pp. 585
-
-
Riley, M.1
Ljolje, A.2
-
31
-
-
0028392167
-
An application of recurrent nets to phone probability estimation
-
Robinson, A., 1994. An application of recurrent nets to phone probability estimation. IEEE Trans. Neural Networks 5 (2), 298-305.
-
(1994)
IEEE Trans. Neural Networks
, vol.5
, Issue.2
, pp. 298-305
-
-
Robinson, A.1
-
32
-
-
85079097438
-
IPA: Improved phone modelling with recurrent neural networks
-
Robinson, T., Hochberg, M., Renals, S., 1994. IPA: improved phone modelling with recurrent neural networks. In: Proc. ICASSP, Adelaide, Australia, April, pp. 37-40.
-
(1994)
Proc. ICASSP, Adelaide, Australia, April
, pp. 37-40
-
-
Robinson, T.1
Hochberg, M.2
Renals, S.3
-
33
-
-
0024905253
-
Continuous hidden Markov modelling for speaker-independent word spotting
-
Rohlicek, J., Russell, W., Roucos, S., Gish, H., 1989. Continuous hidden Markov modelling for speaker-independent word spotting. In: Proc. ICASSP, Glasgow, Scotland, May, pp. 627-630.
-
(1989)
Proc. ICASSP, Glasgow, Scotland, May
, pp. 627-630
-
-
Rohlicek, J.1
Russell, W.2
Roucos, S.3
Gish, H.4
-
34
-
-
0023846644
-
Stochastic segment modelling using the estimate-maximize algorithm
-
Roucos, S., Ostendorf, M., Gish, H., Derr, A., 1988. Stochastic segment modelling using the Estimate-Maximize algorithm. In: Proc. ICASSP, New York, NY, pp. 127-130.
-
(1988)
Proc. ICASSP, New York, NY
, pp. 127-130
-
-
Roucos, S.1
Ostendorf, M.2
Gish, H.3
Derr, A.4
-
35
-
-
0027228741
-
A segmental HMM for speech pattern modelling
-
Russell, M., 1993. A segmental HMM for speech pattern modelling. In: Proc. ICASSP, Minneapolis, MN, pp. 499-502.
-
(1993)
Proc. ICASSP, Minneapolis, MN
, pp. 499-502
-
-
Russell, M.1
-
36
-
-
0002220140
-
Applying phonetic knowledge to lexical access
-
Stevens, K., 1995. Applying phonetic knowledge to lexical access. In: Proc. Eurospeech, Madrid, Spain, pp. 3-11.
-
(1995)
Proc. Eurospeech, Madrid, Spain
, pp. 3-11
-
-
Stevens, K.1
-
37
-
-
14944356145
-
Acoustic modelling improvements in a segment-based speech recognizer
-
Ström, N., Hetherington, L., Hazen, T., Sandness, E., Glass, J., 1999. Acoustic modelling improvements in a segment-based speech recognizer. In: Proc. IEEE Automatic Speech Recognition and Understanding Workshop, Keystone, CO, December, pp. 139-142.
-
(1999)
Proc. IEEE Automatic Speech Recognition and Understanding Workshop, Keystone, CO, December
, pp. 139-142
-
-
Ström, N.1
Hetherington, L.2
Hazen, T.3
Sandness, E.4
Glass, J.5
-
38
-
-
0016469280
-
A system for acoustic-phonetic analysis of continuous speech
-
Weinstein, C., McCandless, S., Mondshein, L., Zue, V., 1975. A system for acoustic-phonetic analysis of continuous speech. IEEE Trans. ASSP 23, 54-67.
-
(1975)
IEEE Trans. ASSP
, vol.23
, pp. 54-67
-
-
Weinstein, C.1
McCandless, S.2
Mondshein, L.3
Zue, V.4
-
39
-
-
0025517070
-
Automatic recognition of keywords in unconstrained speech using hidden Markov models
-
Wilpon, J., Rabiner, L., Lee, C.H., Goldman, E., 1990. Automatic recognition of keywords in unconstrained speech using hidden Markov models. IEEE Trans. ASSP 38 (11), 1870-1878.
-
(1990)
IEEE Trans. ASSP
, vol.38
, Issue.11
, pp. 1870-1878
-
-
Wilpon, J.1
Rabiner, L.2
Lee, C.H.3
Goldman, E.4
-
40
-
-
0028530231
-
State clustering in hidden Markov model-based continuous speech recognition
-
Young, S., Woodland, P., 1994. State clustering in hidden Markov model-based continuous speech recognition. Comput. Speech Lang. 8 (4), 369-383.
-
(1994)
Comput. Speech Lang.
, vol.8
, Issue.4
, pp. 369-383
-
-
Young, S.1
Woodland, P.2
-
41
-
-
0033690878
-
On the use of variable frame rate analysis in speech recognition
-
Zhu, Q., Alwan, A., 2000. On the use of variable frame rate analysis in speech recognition. In: Proc. ICASSP, Istanbul, Turkey, June, pp. 1783-1786.
-
(2000)
Proc. ICASSP, Istanbul, Turkey, June
, pp. 1783-1786
-
-
Zhu, Q.1
Alwan, A.2
-
42
-
-
85121123643
-
The MIT summit speech recognition system: A progress report
-
Zue, V., Glass, J., Phillips, M., Seneff, S., 1989. The MIT Summit speech recognition system: a progress report. In: Proc. Speech and Natural Language Workshop, Philadelphia, PA, February, pp. 179-189.
-
(1989)
Proc. Speech and Natural Language Workshop, Philadelphia, PA, February
, pp. 179-189
-
-
Zue, V.1
Glass, J.2
Phillips, M.3
Seneff, S.4
-
43
-
-
0033878021
-
Jupiter: A telephone-based conversational interface for weather information
-
Zue, V., Seneff, S., Glass, J., Polifroni, J., Pao, C., Hazen, T., Hetherington, L., 2000. Jupiter: a telephone-based conversational interface for weather information. IEEE Trans. Speech Audio Proc. 8 (1), 85-96.
-
(2000)
IEEE Trans. Speech Audio Proc.
, vol.8
, Issue.1
, pp. 85-96
-
-
Zue, V.1
Seneff, S.2
Glass, J.3
Polifroni, J.4
Pao, C.5
Hazen, T.6
Hetherington, L.7
|