-
1
-
-
0025629882
-
Tied mixture continuous parameter modeling for speech recognition
-
Bellegarda, J. R. & Nahamoo, D. (1990). Tied mixture continuous parameter modeling for speech recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, 38, 2033-2045.
-
(1990)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.38
, pp. 2033-2045
-
-
Bellegarda, J.R.1
Nahamoo, D.2
-
2
-
-
0343367210
-
Phonological studies for speech recognition
-
Bernstein, J., Baldwin, G., Cohen, M., Murveit, H. & Weintraub, M. (1986). Phonological studies for speech recognition. In DARPA Speech Recognition Workshop, pp. 41-48.
-
(1986)
DARPA Speech Recognition Workshop
, pp. 41-48
-
-
Bernstein, J.1
Baldwin, G.2
Cohen, M.3
Murveit, H.4
Weintraub, M.5
-
3
-
-
0030637976
-
Pronunciation modelling for conversational speech recognition: A status report from WS97
-
Santa Barbara, CA, USA
-
Byrne, W., Finke, M., Khudanpur, S., McDonough, J., Nock, H., Riley, M., Saraclar, M., Wooters, C. & Zavaliagkos, G. (1997). Pronunciation modelling for conversational speech recognition: A status report from WS97. In IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings (ASRU), Santa Barbara, CA, USA, pp. 26-33.
-
(1997)
IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings (ASRU)
, pp. 26-33
-
-
Byrne, W.1
Finke, M.2
Khudanpur, S.3
McDonough, J.4
Nock, H.5
Riley, M.6
Saraclar, M.7
Wooters, C.8
Zavaliagkos, G.9
-
4
-
-
0031624621
-
Pronunciation modelling using a hand-labelled corpus for conversational speech recognition
-
Seattle, USA
-
Byrne, W., Finke, M., Khudanpur, S., McDonough, J., Nock, H., Riley, M., Saraclar, M., Wooters, C. & Zavaliagkos, G. (1998). Pronunciation modelling using a hand-labelled corpus for conversational speech recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seattle, USA, pp. 313-316.
-
(1998)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 313-316
-
-
Byrne, W.1
Finke, M.2
Khudanpur, S.3
McDonough, J.4
Nock, H.5
Riley, M.6
Saraclar, M.7
Wooters, C.8
Zavaliagkos, G.9
-
7
-
-
0029375590
-
Speaker adaptation using constrained estimation of Gaussian mixtures
-
Digalakis, V. V., Rtischev, D. & Neumeyer, L. G. (1995). Speaker adaptation using constrained estimation of Gaussian mixtures. IEEE Transactions on Speech and Audio Processing, 3, 357-366.
-
(1995)
IEEE Transactions on Speech and Audio Processing
, vol.3
, pp. 357-366
-
-
Digalakis, V.V.1
Rtischev, D.2
Neumeyer, L.G.3
-
9
-
-
0028996886
-
Understanding and improving speech recognition performance through the use of diagnostic tools
-
Detroit, MI
-
Eide, E., Gish, H., Jeanrenaud, P. & Mielke, A. (1995). Understanding and improving speech recognition performance through the use of diagnostic tools. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Detroit, MI, pp. 221-224.
-
(1995)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 221-224
-
-
Eide, E.1
Gish, H.2
Jeanrenaud, P.3
Mielke, A.4
-
10
-
-
80053229524
-
Modeling and efficient decoding of large vocabulary conversational speech
-
Budapest, Hungary
-
Finke, M., Fritsch, J., Koll, D. & Waibel, A. (1999). Modeling and efficient decoding of large vocabulary conversational speech. Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), Budapest, Hungary, pp. 467-470.
-
(1999)
Proceedings of the European Conference on Speech Communication and Technology (Eurospeech)
, pp. 467-470
-
-
Finke, M.1
Fritsch, J.2
Koll, D.3
Waibel, A.4
-
12
-
-
0043272135
-
Automatic learning of word pronunciation from data
-
addendum
-
Fosler, E., Weintraub, M., Wegmann, S., Kao, Y-H., Khudanpur, S., Galles, C. & Saraclar, M. (1996). Automatic learning of word pronunciation from data. Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp. S28-S29 (addendum).
-
(1996)
Proceedings of the International Conference on Spoken Language Processing (ICSLP)
-
-
Fosler, E.1
Weintraub, M.2
Wegmann, S.3
Kao, Y.-H.4
Khudanpur, S.5
Galles, C.6
Saraclar, M.7
-
14
-
-
0025642104
-
Word juncture modeling using phonological rules for HMM-based continuous speech recognition
-
Giachin, E., Rosenberg, A. & Lee, C. (1990). Word juncture modeling using phonological rules for HMM-based continuous speech recognition. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 737-740.
-
(1990)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 737-740
-
-
Giachin, E.1
Rosenberg, A.2
Lee, C.3
-
15
-
-
85016587886
-
SWITCHBOARD: Telephone speech corpus for research and development
-
Godfrey, J., Holliman, E. & McDaniel, J. (1992). SWITCHBOARD: Telephone speech corpus for research and development. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 517-520, Available at http://www.ldc.upenn.edu/.
-
(1992)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 517-520
-
-
Godfrey, J.1
Holliman, E.2
McDaniel, J.3
-
19
-
-
0000250399
-
Semicontinuous hidden Markov models for speech signals
-
Huang, X. D. & Jack, M. A. (1989). Semicontinuous hidden Markov models for speech signals. Computer Speech and Language, 3, 239-251.
-
(1989)
Computer Speech and Language
, vol.3
, pp. 239-251
-
-
Huang, X.D.1
Jack, M.A.2
-
20
-
-
0029747183
-
Speaker normalization using efficient frequency warping procedures
-
Atlanta, GA
-
Lee, L. & Rose, R. (1996). Speaker normalization using efficient frequency warping procedures. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Atlanta, GA, pp. 353-356.
-
(1996)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 353-356
-
-
Lee, L.1
Rose, R.2
-
21
-
-
0029288633
-
Speaker adaptation of continuous density HMMs using multivariate linear regression
-
Leggetter, C. J. & Woodland, R C. (1995). Speaker adaptation of continuous density HMMs using multivariate linear regression. Computer Speech and Language, 9, 171-185.
-
(1995)
Computer Speech and Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, R.C.2
-
22
-
-
0021191078
-
An information theoretic approach to the automatic determination of phonemic baseforms
-
San Diego, CA
-
Lucassen, J. M. & Mercer, R. L. (1984). An information theoretic approach to the automatic determination of phonemic baseforms. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), San Diego, CA, pp. 42.5.1-42.5.4.
-
(1984)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 4251-4254
-
-
Lucassen, J.M.1
Mercer, R.L.2
-
24
-
-
84943154470
-
Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
-
Sydney, Australia
-
McAllaster, D., Gillick, L., Scattone, F. & Newman, M. (1998). Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch. Proceedings of the International Conference on Spoken Language Processing (ICSLP), Sydney, Australia, pp. 1847-1850.
-
(1998)
Proceedings of the International Conference on Spoken Language Processing (ICSLP)
, pp. 1847-1850
-
-
McAllaster, D.1
Gillick, L.2
Scattone, F.3
Newman, M.4
-
25
-
-
0012306376
-
The design principles of a weighted finite state transducer library
-
Mohri, M., Pereira, F. C. N. & Riley, M. (2000). The design principles of a weighted finite state transducer library. Theoretical Computer Science, 231, 17-32, Available from http://www.research.att.com/sw/tools/ fsm/.
-
(2000)
Theoretical Computer Science
, vol.231
, pp. 17-32
-
-
Mohri, M.1
Pereira, F.C.N.2
Riley, M.3
-
27
-
-
0032639915
-
Improvements in recognition of conversational telephone speech
-
Peskin, B., Newman, M., McAllaster, D., Nagesha, V., Richards, H., Wegmann, S., Hunt, M. & Gillick, L. (1999). Improvements in recognition of conversational telephone speech. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 53-56.
-
(1999)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 53-56
-
-
Peskin, B.1
Newman, M.2
McAllaster, D.3
Nagesha, V.4
Richards, H.5
Wegmann, S.6
Hunt, M.7
Gillick, L.8
-
30
-
-
0033353288
-
Stochastic pronunciation modelling from hand-labelled phonetic corpora
-
Riley, M., Byrne, W., Finke, M., Khudanpur, S., Ljolje, A., McDonough, J., Nock, H., Saraclar, M., Wooters, C. & Zavaliagkos, G. (1999). Stochastic pronunciation modelling from hand-labelled phonetic corpora. Speech Communication, 29, 209-224.
-
(1999)
Speech Communication
, vol.29
, pp. 209-224
-
-
Riley, M.1
Byrne, W.2
Finke, M.3
Khudanpur, S.4
Ljolje, A.5
McDonough, J.6
Nock, H.7
Saraclar, M.8
Wooters, C.9
Zavaliagkos, G.10
-
31
-
-
0003921935
-
Automatic generation of detailed pronunciation lexicons
-
chapter 12, Kluwer Academic Press
-
Riley, M. & Ljolje, A. (1995). Automatic generation of detailed pronunciation lexicons. Automatic Speech and Speaker Recognition : Advanced Topics, chapter 12, pp. 285-302. Kluwer Academic Press.
-
(1995)
Automatic Speech and Speaker Recognition : Advanced Topics
, pp. 285-302
-
-
Riley, M.1
Ljolje, A.2
-
33
-
-
0033335618
-
Modeling pronunciation variation for ASR: A survey of the literature
-
Strik, H. & Cucchiarini, C. (1999). Modeling pronunciation variation for ASR: A survey of the literature. Speech Communication, 29, 225-246.
-
(1999)
Speech Communication
, vol.29
, pp. 225-246
-
-
Strik, H.1
Cucchiarini, C.2
-
34
-
-
85135194422
-
Building multiple pronunciation models for novel words using exploratory computational phonology
-
Madrid, Spain
-
Tajchman, G., Fosler, E. & Jurafsky, D. (1995). Building multiple pronunciation models for novel words using exploratory computational phonology. Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), Madrid, Spain, pp. 2247-2250.
-
(1995)
Proceedings of the European Conference on Speech Communication and Technology (Eurospeech)
, pp. 2247-2250
-
-
Tajchman, G.1
Fosler, E.2
Jurafsky, D.3
-
35
-
-
0033106613
-
Multiple pronunciation dictionary using HMM-state confusion characteristics
-
Wakita, Y., Singer, H. & Sagisaka, Y. (1999). Multiple pronunciation dictionary using HMM-state confusion characteristics. Computer Speech and Language, 13, 143-153.
-
(1999)
Computer Speech and Language
, vol.13
, pp. 143-153
-
-
Wakita, Y.1
Singer, H.2
Sagisaka, Y.3
-
36
-
-
0043086491
-
Effect of speaking style on LVCSR performance
-
addendum
-
Weintraub, M., Taussig, K., Hunicke-Smith, K. & Snodgrass, A. (1996). Effect of speaking style on LVCSR performance. Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp. S16-S19 (addendum).
-
(1996)
Proceedings of the International Conference on Spoken Language Processing (ICSLP)
-
-
Weintraub, M.1
Taussig, K.2
Hunicke-Smith, K.3
Snodgrass, A.4
-
38
-
-
0003571977
-
-
Entropic Cambridge Research Laboratory
-
Young, S., Jansen, J., Odell, J., Ollasen, D. & Woodland, P. The HTK Book (Version 2.0). Entropic Cambridge Research Laboratory.
-
The HTK Book (Version 2.0)
-
-
Young, S.1
Jansen, J.2
Odell, J.3
Ollasen, D.4
Woodland, P.5
|