-
1
-
-
79959854710
-
Learning new word pronunciations from spoken examples
-
I. Badr, I. McGraw, and J. R. Glass, "Learning new word pronunciations from spoken examples, " in Proc. INTERSPEECH, 2010, pp. 2294-2297.
-
(2010)
Proc. INTERSPEECH
, pp. 2294-2297
-
-
Badr, I.1
McGraw, I.2
Glass, J.R.3
-
2
-
-
84865763465
-
Pronunciation learning from continuous speech
-
I. Badr, I. McGraw, and J. R. Glass, "Pronunciation learning from continuous speech, " in Proc. INTERSPEECH, 2011, pp. 549-552.
-
(2011)
Proc. INTERSPEECH
, pp. 549-552
-
-
Badr, I.1
McGraw, I.2
Glass, J.R.3
-
3
-
-
41049105254
-
Joint-sequence models for grapheme-tophoneme conversion
-
May
-
M. Bisani and H. Ney, "Joint-sequence models for grapheme-tophoneme conversion, " Speech Commun., vol. 50, no. 5, pp. 434-451, May 2008.
-
(2008)
Speech Commun
, vol.50
, Issue.5
, pp. 434-451
-
-
Bisani, M.1
Ney, H.2
-
4
-
-
0002629270
-
Maximum likelihood from incomplete data via the em algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm, " J. R. Statist. Soc., Ser. B, vol. 39, no. 1, pp. 1-38, 1977.
-
(1977)
J. R. Statist. Soc., Ser. B
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
5
-
-
0020719320
-
Maximum likelihood approach to continuous speech recognition
-
L. R. Bahl, F. Jelinek, and R. L. Mercer, "A maximum likelihood approach to continuous speech recognition, " IEEE Trans. Pattern Anal. Mach. Intell., vol. PAMI-5, no. 2, pp. 179-190, Mar. 1983. (Pubitemid 13555897)
-
(1983)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.PAMI-5
, Issue.2
, pp. 179-190
-
-
Bahl Lalit, R.1
Jelinek Frederick2
Mercer Robert, L.3
-
6
-
-
0038359548
-
A probabilistic framework for segment-based speech recognition
-
J. R. Glass, "A probabilistic framework for segment-based speech recognition, " Comput. Speech Lang., vol. 17, no. 2-3, pp. 137-152, 2003.
-
(2003)
Comput. Speech Lang
, vol.17
, Issue.2-3
, pp. 137-152
-
-
Glass, J.R.1
-
7
-
-
85009152019
-
The MIT finite-state transducer toolkit for speech and language processing
-
I. L. Hetherington, "The MIT finite-state transducer toolkit for speech and language processing, " in Proc. INTERSPEECH, 2004, pp. 2609-2612.
-
(2004)
Proc. INTERSPEECH
, pp. 2609-2612
-
-
Hetherington, I.L.1
-
8
-
-
85009074656
-
An efficient implementation of phonological rules using finite-state transducers
-
I. L. Hetherington, "An efficient implementation of phonological rules using finite-state transducers, " in Proc. EuroSpeech, 2001, pp. 1599-1602.
-
(2001)
Proc. EuroSpeech
, pp. 1599-1602
-
-
Hetherington, I.L.1
-
9
-
-
19944423811
-
Pronunciation modeling using a finite-state transducer representation
-
DOI 10.1016/j.specom.2005.03.004, PII S0167639305000361, Pronunciation Modeling and Lexicon Adaptation
-
T. J. Hazen, I. L. Hetherington, H. Shu, and K. Livescu, "Pronunciation modeling using a finite-state transducer representation, " Speech Commun., vol. 46, no. 2, pp. 189-203, 2005. (Pubitemid 40753202)
-
(2005)
Speech Communication
, vol.46
, Issue.2
, pp. 189-203
-
-
Hazen, T.J.1
Hetherington, I.L.2
Shu, H.3
Livescu, K.4
-
10
-
-
0033335618
-
Modeling pronunciation variation for ASR: A survey of the literature
-
DOI 10.1016/S0167-6393(99)00038-2
-
H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature, " Speech Commun., vol. 29, no. 2-4, pp. 225-246, 1999. (Pubitemid 30514833)
-
(1999)
Speech Communication
, vol.29
, Issue.2
, pp. 225-246
-
-
Strik, H.1
Cucchiarini, C.2
-
11
-
-
0012262424
-
-
Ph. D. dissertation, Massachusetts Inst. of Technol., Cambridge, MA
-
I. L. Hetherington, "A characterization of the problem of new, out-of-vocabulary words in continuous-speech recognition and understanding, " Ph. D. dissertation, Massachusetts Inst. of Technol., Cambridge, MA, 1995.
-
(1995)
A Characterization of the Problem of New, Out-of-vocabulary Words in Continuous-speech Recognition and Understanding
-
-
Hetherington, I.L.1
-
12
-
-
0035278951
-
Confidence measures for large vocabulary continuous speech recognition
-
DOI 10.1109/89.906002, PII S1063667601013281
-
F. Wessel, R. Schlüter, K. Macherey, and H. Ney, "Confidence measures for large vocabulary continuous speech recognition, " IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 288-298, Mar. 2001. (Pubitemid 32286598)
-
(2001)
IEEE Transactions on Speech and Audio Processing
, vol.9
, Issue.3
, pp. 288-298
-
-
Wessel, F.1
Schluter, R.2
Macherey, K.3
Ney, H.4
-
13
-
-
33745202406
-
Open vocabulary speech recognition with flat hybrid models
-
9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
-
M. Bisani and H. Ney, "Open vocabulary speech recognition with flat hybrid models, " in Proc. INTERSPEECH, 2005, pp. 725-728. (Pubitemid 43908165)
-
(2005)
9th European Conference on Speech Communication and Technology
, pp. 725-728
-
-
Bisani, M.1
Ney, H.2
-
14
-
-
85009227369
-
Conditional and joint models for grapheme-to-phoneme conversion
-
S. F. Chen, "Conditional and joint models for grapheme-to-phoneme conversion, " in Proc. INTERSPEECH, 2003, pp. 2033-2036.
-
(2003)
Proc. INTERSPEECH
, pp. 2033-2036
-
-
Chen, S.F.1
-
15
-
-
84878203695
-
Regular models of phonological rule systems
-
R. M. Kaplan and M. Kay, "Regular models of phonological rule systems, " Comput. Linguist., vol. 20, pp. 331-378, 1994.
-
(1994)
Comput. Linguist
, vol.20
, pp. 331-378
-
-
Kaplan, R.M.1
Kay, M.2
-
17
-
-
0039255896
-
A multi-strategy approach to improving pronunciation by analogy
-
Y. Marchand and R. I. Damper, "A multi-strategy approach to improving pronunciation by analogy, " Comput. Linguist., vol. 26, pp. 195-219, 2000.
-
(2000)
Comput. Linguist
, vol.26
, pp. 195-219
-
-
Marchand, Y.1
Damper, R.I.2
-
18
-
-
19944409831
-
Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy
-
DOI 10.1016/j.specom.2005.03.002, PII S0167639305000336, Pronunciation Modeling and Lexicon Adaptation
-
J. Bellegarda, "Unsupervised, language-independent grapheme-tophoneme conversion by latent analogy, " Speech Commun., vol. 46, no. 2, pp. 140-152, 2005. (Pubitemid 40753199)
-
(2005)
Speech Communication
, vol.46
, Issue.2
, pp. 140-152
-
-
Bellegarda, J.R.1
-
19
-
-
70450194704
-
Grapheme to phoneme conversion using an SMT system
-
A. Laurent, P. Delglise, and S. Meignier, "Grapheme to phoneme conversion using an SMT system, " in Proc. INTERSPEECH, 2009, pp. 708-711.
-
(2009)
Proc. INTERSPEECH
, pp. 708-711
-
-
Laurent, A.1
Delglise, P.2
Meignier, S.3
-
20
-
-
70450186703
-
Online discriminative training for grapheme-to-phoneme conversion
-
S. Jiampojamarn and G. Kondrak, "Online discriminative training for grapheme-to-phoneme conversion, " in Proc. INTERSPEECH, 2009, pp. 1303-1306.
-
(2009)
Proc. INTERSPEECH
, pp. 1303-1306
-
-
Jiampojamarn, S.1
Kondrak, G.2
-
22
-
-
18244423993
-
Assessing text-to-phoneme mapping strategies in speaker independent isolated word recognition
-
J. Häkkinen, J. Suontausta, S. Riis, and K. J. Jensen, "Assessing text-to-phoneme mapping strategies in speaker independent isolated word recognition, " Speech Commun., vol. 41, no. 2-3, pp. 455-467, 2003.
-
(2003)
Speech Commun
, vol.41
, Issue.2-3
, pp. 455-467
-
-
Häkkinen, J.1
Suontausta, J.2
Riis, S.3
Jensen, K.J.4
-
23
-
-
79959846061
-
-
M. S. thesis, Massachusetts Inst. of Technol., Cambridge, MA
-
S. Wang, "Using graphone models in automatic speech recognition, " M. S. thesis, Massachusetts Inst. of Technol., Cambridge, MA, 2009.
-
(2009)
Using Graphone Models in Automatic Speech Recognition
-
-
Wang, S.1
-
24
-
-
84943154470
-
Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
-
D. McAllaster, L. Gillick, F. Scattone, and M. Newman, "Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch, " in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 1998.
-
(1998)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP
-
-
McAllaster, D.1
Gillick, L.2
Scattone, F.3
Newman, M.4
-
25
-
-
84956975318
-
Automatic baseform generation from acoustic data
-
B. Maison, "Automatic baseform generation from acoustic data, " in Proc. INTERSPEECH, 2003, pp. 2545-2548.
-
(2003)
Proc. INTERSPEECH
, pp. 2545-2548
-
-
Maison, B.1
-
26
-
-
51449103917
-
A turbo-style algorithm for lexical baseforms estimation
-
G. F. Choueiter, M. I. Ohannessian, S. Seneff, and J. R. Glass, "A turbo-style algorithm for lexical baseforms estimation, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008, pp. 4313-4316.
-
(2008)
Proc IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP
, pp. 4313-4316
-
-
Choueiter, G.F.1
Ohannessian, M.I.2
Seneff, S.3
Glass, J.R.4
-
27
-
-
0026372223
-
Automatic phonetic baseform determination
-
L. R. Bahl, S. Das, P. V. Desouza, M. Epstein, R. L. Mercer, B. Merialdo, D. Nahamoo, M. A. Picheny, and J. Powell, "Automatic phonetic baseform determination, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1991, pp. 173-176.
-
(1991)
Proc IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP
, pp. 173-176
-
-
Bahl, L.R.1
Das, S.2
Desouza, P.V.3
Epstein, M.4
Mercer, R.L.5
Merialdo, B.6
Nahamoo, D.7
Picheny, M.A.8
Powell, J.9
-
28
-
-
44849099982
-
Adapting grapheme-tophoneme conversion for name recognition
-
X. Li, A. Gunawardana, and A. Acero, "Adapting grapheme-tophoneme conversion for name recognition, " in Proc. Autom. Speech Recognit. Understand. Workshop (ASRU), 2007, pp. 130-135.
-
(2007)
Proc. Autom. Speech Recognit. Understand. Workshop (ASRU
, pp. 130-135
-
-
Li, X.1
Gunawardana, A.2
Acero, A.3
-
29
-
-
70349209414
-
Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion
-
O. Vinyals, L. Deng, D. Yu, and A. Acero, "Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009, pp. 4445-4448.
-
(2009)
Proc IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP
, pp. 4445-4448
-
-
Vinyals, O.1
Deng, L.2
Yu, D.3
Acero, A.4
-
30
-
-
0033878021
-
JUPITER: A telephone-based conversational interface for weather information
-
DOI 10.1109/89.817460
-
V. Zue, S. Seneff, J. Glass, J. Polifroni, C. Pao, T. J. Hazen, and L. Hetherington, "Jupiter: A telephone-based conversational interface for weather information, " IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 85-96, Jan. 2000. (Pubitemid 30540738)
-
(2000)
IEEE Transactions on Speech and Audio Processing
, vol.8
, Issue.1
, pp. 85-96
-
-
Zue Victor1
Seneff Stephanie2
Glass James, R.3
Polifroni Joseph4
Pao Christine5
Hazen Timothy, J.6
Hetherington Lee7
-
32
-
-
0024909979
-
Some statistical issues in the comparison of speech recognition algorithms
-
L. Gillick and S. Cox, "Some statistical issues in the comparison of speech recognition algorithms, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1989, pp. 532-535. (Pubitemid 20604171)
-
(1989)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.1
, pp. 532-535
-
-
Gillick, L.1
Cox, S.J.2
-
33
-
-
84878563022
-
Automating crowd-supervised learning for spoken language systems
-
I. McGraw, S. Cyphers, P. Pasupat, J. Liu, and J. Glass, "Automating crowd-supervised learning for spoken language systems, " in Proc. INTERSPEECH, 2012.
-
(2012)
Proc. INTERSPEECH
-
-
McGraw, I.1
Cyphers, S.2
Pasupat, P.3
Liu, J.4
Glass, J.5
|