-
1
-
-
0028996974
-
Language model representations for beam-search decoding
-
DOI: 10.1109/ICASSP.1995.47966633
-
G. Antoniol, F. Brugnara, M. Cettolo, and M. Frederico, "Language model representations for beam-search decoding," in Proc. ICASSP, 1995, pp. 588-591. DOI: 10.1109/ICASSP.1995.47966633
-
(1995)
Proc. ICASSP
, pp. 588-591
-
-
Antoniol, G.1
Brugnara, F.2
Cettolo, M.3
Frederico, M.4
-
3
-
-
33745219793
-
General indexation of weighted automata-application to spoken utterance retrieval
-
C. Allauzen and M. Mohri, "General indexation of weighted automata-application to spoken utterance retrieval," in Proc. HLT-NAACL, 2004. 135
-
(2004)
Proc. HLT-NAACL
, vol.135
-
-
Allauzen, C.1
Mohri, M.2
-
4
-
-
85149121374
-
Generalized algorithms for constructing statistical language models
-
DOI: 10.3115/1075096.107510277,130
-
C. Allauzen, M. Mohri, and B. Roark, "Generalized algorithms for constructing statistical language models," in Proc. ACL, 2003, pp. 40-47. DOI: 10.3115/1075096.1075102 77, 130
-
(2003)
Proc. ACL
, pp. 40-47
-
-
Allauzen, C.1
Mohri, M.2
Roark, B.3
-
5
-
-
4544339437
-
A generalized construction of integrated speech recognition transducers
-
DOI: 10.1109/CASSP.2004.132609791
-
C. Allauzen, M. Mohri, M. Riley, and B. Roark, "A generalized construction of integrated speech recognition transducers," in Proc. ICASSP, vol. I, 2004, pp. 761-764. DOI: 10.1109/CASSP.2004.132609791
-
(2004)
Proc. ICASSP
, vol.1
, pp. 761-764
-
-
Allauzen, C.1
Mohri, M.2
Riley, M.3
Roark, B.4
-
6
-
-
38149133882
-
OpenFst: A general and efficient weighted finite-state transducer library
-
DOI: 10.1007/978-3-540-76336-9-3136
-
C. Allauzen, M. Riley, J. Schalkwyk, W. Skut, and M. Mohri, "OpenFst: A general and efficient weighted finite-state transducer library," in Proc. of CIAA, 2007, pp. 11-23. DOI: 10.1007/978-3-540-76336- 9-3136
-
(2007)
Proc. of CIAA
, pp. 11-23
-
-
Allauzen, C.1
Riley, M.2
Schalkwyk, J.3
Skut, W.4
Mohri, M.5
-
7
-
-
70450183653
-
A generalized composition algorithm for weighted finite-state transducers
-
71, 91 95 99 104 105 107
-
C. Allauzen, M. Riley, and J. Schalkwyk, "A generalized composition algorithm for weighted finite-state transducers," in Proc. Interspeech, 2009, pp. 1203-1206. 71, 91, 95, 99, 104, 105, 107
-
(2009)
Proc. Interspeech
, pp. 1203-1206
-
-
Allauzen, C.1
Riley, M.2
Schalkwyk, J.3
-
8
-
-
84855817752
-
A filter-based algorithm for efficient composition of finite-state transducers
-
DOI: 10.1142/S0129054111009033 105, 106
-
C. Allauzen, M. Riley, and J. Schalkwyk, "A filter-based algorithm for efficient composition of finite-state transducers," International Journal of Foundations of Computer Science, 2011. DOI: 10.1142/S0129054111009033 105, 106
-
(2011)
International Journal of Foundations of Computer Science
-
-
Allauzen, C.1
Riley, M.2
Schalkwyk, J.3
-
9
-
-
0026382117
-
The forward-backward search algorithm
-
DOI: 10.1109/ICASSP.1991.1504354
-
S. Austin, R. Schwartz, and P. Placeway, "The forward-backward search algorithm," in Proc. ICASSP, vol. 1, 1991, pp. 697-700. DOI: 10.1109/ICASSP.1991.1504354
-
(1991)
Proc. ICASSP
, vol.1
, pp. 697-700
-
-
Austin, S.1
Schwartz, R.2
Placeway, P.3
-
10
-
-
0036460898
-
An overview of decoding techniques for large vocabulary continuous speech recognition
-
DOI: 10.1006/csla.2001.01854
-
X.L.Aubert, "An overview of decoding techniques for large vocabulary continuous speech recognition," Computer Speech and Language, vol. 16, pp. 89-114, 2002. DOI: 10.1006/csla.2001.01854
-
(2002)
Computer Speech and Language
, vol.16
, pp. 89-114
-
-
Aubert, X.L.1
-
11
-
-
84987256786
-
An algorithm for connected word recognition
-
3
-
J. S. Bridle, M. D. Brown, and R. M. Chamberlain, "An algorithm for connected word recognition," in Proc. ICASSP, 1982, pp. 899-902. 3
-
(1982)
Proc. ICASSP
, pp. 899-902
-
-
Bridle, J.S.1
Brown, M.D.2
Chamberlain, R.M.3
-
12
-
-
84865035841
-
Minimization of automata
-
abs/1010.5318
-
J. Berstel, L. Boasson, O. Carton, and I. Fagnot, "Minimization of automata," CoRR, vol. abs/1010.5318, 2010. 61
-
(2010)
CoRR
, pp. 61
-
-
Berstel, J.1
Boasson, L.2
Carton, O.3
Fagnot, I.4
-
13
-
-
85135168435
-
Improvements in tree-based language model representation
-
F. Brugnara and M. Cettolo, "Improvements in tree-based language model representation," in Proc. EUROSPEECH, 1995, pp. 1797-1800. 33
-
(1995)
Proc. EUROSPEECH
, vol.33
, pp. 1797-1800
-
-
Brugnara, F.1
Cettolo, M.2
-
15
-
-
85022919385
-
Class-based n-gram models of natural language
-
130
-
P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. Della Pietra, and J. C. Lai, "Class-based n-gram models of natural language," Computational Linguistics, vol. 18(4), pp. 467-479, 1992. 130
-
(1992)
Computational Linguistics
, vol.18
, Issue.4
, pp. 467-479
-
-
Brown, P.F.1
Desouza, P.V.2
Mercer, R.L.3
Della Pietra, V.J.4
Lai, J.C.5
-
16
-
-
0026400222
-
Decision trees for phonological rules in continuous speech
-
DOI: 10.1109/ICASSP.1991.150308 18 131
-
L. R. Bahl, P. V. de Souza, and P. S. Gopalakrishman, "Decision trees for phonological rules in continuous speech," in Proc. ICASSP, 1991, pp. 185-188. DOI: 10.1109/ICASSP.1991.150308 18, 131
-
(1991)
Proc. ICASSP
, pp. 185-188
-
-
Bahl, L.R.1
De Souza, P.V.2
Gopalakrishman, P.S.3
-
17
-
-
70349521673
-
Robust understanding in multimodal interfaces
-
DOI: 10.1162/coli.08-022-R2-06-26135
-
S. Bangalore and M. Johnston, "Robust understanding in multimodal interfaces," Computer Linguistics, vol. 35, no. 3, pp. 345-397, 2009. DOI: 10.1162/coli.08-022-R2-06-26135
-
(2009)
Computer Linguistics
, vol.35
, Issue.3
, pp. 345-397
-
-
Bangalore, S.1
Johnston, M.2
-
18
-
-
0020719320
-
Maximum likelihood approach to continuous speech recognition
-
Mar DOI: 10.1109/TPAMI.1983.47673709
-
L. R. Bahl, F. Jelinek, and R. L. Mercer, "Maximum likelihood approach to continuous speech recognition," IEEE Transactions on Patten Analysis and Machine Intelligence, vol. PAMI-5, no. 2, pp. 179-190, Mar. 1983. DOI: 10.1109/TPAMI.1983.47673709
-
(1983)
IEEE Transactions on Patten Analysis and Machine Intelligence
, vol.PAMI-5
, Issue.2
, pp. 179-190
-
-
Bahl, L.R.1
Jelinek, F.2
Mercer, R.L.3
-
19
-
-
0017216776
-
Testing for the consecutive ones property, interval graphs, and graph planarity using pq-tree algorithms
-
DOI: 10.1016/S0022-0000(76)80045-1109
-
K. Booth and G. Lueker, "Testing for the consecutive ones property, interval graphs, and graph planarity using pq-tree algorithms," Journal of Computer and System Sciences, vol. 13, pp. 335-379, 1976. DOI: 10.1016/S0022-0000(76)80045-1109
-
(1976)
Journal of Computer and System Sciences
, vol.13
, pp. 335-379
-
-
Booth, K.1
Lueker, G.2
-
20
-
-
0034854347
-
Joint prosody prediction and unit selection for concatenative speech synthesis
-
DOI: 10.1109/ICASSP.2001.941031135
-
I. Bulyko and M. Ostendorf, "Joint prosody prediction and unit selection for concatenative speech synthesis," in Proc. ICASSP, vol. 2, 2001, pp. 781-784. DOI: 10.1109/ICASSP.2001.941031135
-
(2001)
Proc. ICASSP
, vol.2
, pp. 781-784
-
-
Bulyko, I.1
Ostendorf, M.2
-
21
-
-
0036663562
-
Efficient integrated response generation from multiple targets using weighted finite state transducers
-
DOI: 10.1016/S0885-2308(02)00023-2135
-
I. Bulyko and M.Ostendorf, "Efficient integrated response generation from multiple targets using weighted finite state transducers," Computer Speech and Language, vol. 16(3-4), pp. 533-550, 2002. DOI: 10.1016/S0885- 2308(02)00023-2135
-
(2002)
Computer Speech and Language
, vol.16
, Issue.3-4
, pp. 533-550
-
-
Bulyko, I.1
Ostendorf, M.2
-
22
-
-
84962861457
-
Finite-state transducers for speech-input translation
-
DOI: 10.1109/ASRU.2001.1034664133
-
F. Casacuberta, "Finite-state transducers for speech-input translation," in Proc. ASRU, 2001, pp. 375-380. DOI: 10.1109/ASRU.2001. 1034664133
-
(2001)
Proc. ASRU
, pp. 375-380
-
-
Casacuberta, F.1
-
23
-
-
34547544207
-
A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition
-
DOI: 10.1109/ICASSP.2007.36692095, 99
-
O. Cheng, J. Dines, and M. M. Doss, "A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition," in Proc. ICASSP, 2007, pp. 348-351. DOI: 10.1109/ICASSP.2007.36692095, 99
-
(2007)
Proc. ICASSP
, pp. 348-351
-
-
Cheng, O.1
Dines, J.2
Doss, M.M.3
-
24
-
-
84925661323
-
Rational kernels: Theory and algorithms
-
C. Cortes, P. Haffner, and M. Mohri, "Rational kernels: Theory and algorithms," The Journal of Machine Learning Research, vol. 5, pp. 1035-1062, 2004.
-
(2004)
The Journal of Machine Learning Research
, vol.5
, pp. 1035-1062
-
-
Cortes, C.1
Haffner, P.2
Mohri, M.3
-
25
-
-
84962787683
-
Transducer composition for "on-The-fly" lexicon and language model integration
-
DOI: 10.1109/ASRU.2001.103466795 99
-
D. Caseiro and I.Trancoso, "Transducer composition for "on-The-fly" lexicon and language model integration," in Proc. ASRU, 2001, pp. 393-396. DOI: 10.1109/ASRU.2001.103466795, 99
-
(2001)
Proc. ASRU
, pp. 393-396
-
-
Caseiro, D.1
Trancoso, I.2
-
26
-
-
0141480004
-
A tail-sharing WFST composition for large vocabulary speech recognition
-
DOI: 10.1109/ICASSP.2003.119879195
-
D. Caseiro and I. Trancoso, "A tail-sharing WFST composition for large vocabulary speech recognition," in Proc. ICASSP, vol. I, 2003, pp. 356-359. DOI: 10.1109/ICASSP.2003.119879195
-
(2003)
Proc. ICASSP
, vol.1
, pp. 356-359
-
-
Caseiro, D.1
Trancoso, I.2
-
27
-
-
34047273021
-
A specialized on-The-fly algorithm for lexicon and language model composition
-
DOI: 10.1109/TSA.2005.86083899
-
D. Caseiro and I. Trancoso, "A specialized on-The-fly algorithm for lexicon and language model composition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1281-1291, 2006. DOI: 10.1109/TSA.2005.86083899
-
(2006)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.14
, Issue.4
, pp. 1281-1291
-
-
Caseiro, D.1
Trancoso, I.2
-
28
-
-
84966356313
-
Grapheme-to-phone using finite-state transducers
-
DOI: 10.1109/WSS.2002.1224412135
-
D. Caseiro, L. Trancoso, L. Oliveira, and C. Viana, "Grapheme-to- phone using finite-state transducers," in Proc. IEEE Workshop on Speech Synthesis, 2002, pp. 215-218. DOI: 10.1109/WSS.2002.1224412135
-
(2002)
Proc. IEEE Workshop on Speech Synthesis
, pp. 215-218
-
-
Caseiro, D.1
Trancoso, L.2
Oliveira, L.3
Viana, C.4
-
29
-
-
44849131087
-
The TITECH large vocabulary WFST speech recognition system
-
DOI: 10.1109/ASRU.2007.4430153136
-
P.R. Dixon, D. Caseiro, T. Oonishi, and S. Furui, "The TITECH large vocabulary WFST speech recognition system," in Proc. ASRU, 2007, pp. 443-448. DOI: 10.1109/ASRU.2007.4430153136
-
(2007)
Proc. ASRU
, pp. 443-448
-
-
Dixon, P.R.1
Caseiro, D.2
Oonishi, T.3
Furui, S.4
-
30
-
-
84962878172
-
Incremental language models for speech recognition using finite-state transducers
-
DOI: 10.1109/ASRU.2001.1034620 95, 110
-
H. J. G. A. Dolfing and I. L. Hetherington, "Incremental language models for speech recognition using finite-state transducers," in Proc. ASRU, 2001, pp. 194-197. DOI: 10.1109/ASRU.2001.1034620 95, 110
-
(2001)
Proc. ASRU
, pp. 194-197
-
-
Dolfing, H.J.G.A.1
Hetherington, I.L.2
-
31
-
-
84867588266
-
A comparison of dynamic WFST decoding approaches
-
Kyoto, Japan DOI: 10.1109/ICASSP.2012.6288847 93, 126
-
P. R. Dixon, C. Hori, and H. Kashioka, "A comparison of dynamic WFST decoding approaches," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4209-4212. DOI: 10.1109/ICASSP.2012.6288847 93, 126
-
(2012)
Proc. ICASSP
, pp. 4209-4212
-
-
Dixon, P.R.1
Hori, C.2
Kashioka, H.3
-
32
-
-
85149140805
-
Parameter estimation for probabilistic finite-state transducers
-
DOI: 10.3115/1073083.1073085133
-
J. Eisner, "Parameter estimation for probabilistic finite-state transducers," in Proc. ACL, 2002, pp. 1-8. DOI: 10.3115/1073083.1073085133
-
(2002)
Proc. ACL
, pp. 1-8
-
-
Eisner, J.1
-
33
-
-
0023776398
-
The DARPA 1000-word resource management database for continuous speech recognition
-
DOI: 10.1109/ICASSP.1988.1966691
-
W. M. Fisher, J. Bernstein, and D. S. Pallett, "The DARPA 1000-word resource management database for continuous speech recognition," in Proc. ICASSP, vol. 1, 1988, pp. 651-654. DOI: 10.1109/ICASSP.1988.1966691
-
(1988)
Proc. ICASSP
, vol.1
, pp. 651-654
-
-
Fisher, W.M.1
Bernstein, J.2
Pallett, D.S.3
-
34
-
-
33745207361
-
A Japanese national project on spontaneous speech corpus and processing technology
-
83
-
S.Furui, K.Maekawa, and H. Isahara, "A Japanese national project on spontaneous speech corpus and processing technology," in Proc. of ASR, 2000, pp. 244-248. 83
-
(2000)
Proc. of ASR
, pp. 244-248
-
-
Furui, S.1
Maekawa, K.2
Isahara, H.3
-
35
-
-
84872849317
-
-
web page
-
"AT&T FSM Library," web page http://www.itl.nist.gov/iad/ mig/tests/rt/2009/index.html. 136
-
AT&T FSM Library
-
-
-
36
-
-
0022667694
-
Speaker-independent isolated word recognition using dynamic features of speech spectrum
-
DOI: 10.1109/TASSP.1986.116478812
-
S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 34, no. 1, pp. 52-59, 1986. DOI: 10.1109/TASSP.1986.116478812
-
(1986)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.34
, Issue.1
, pp. 52-59
-
-
Furui, S.1
-
37
-
-
84867198032
-
Silence models in weighted finite-state transducers
-
75
-
P.Garner, "Silence models in weighted finite-state transducers," in Proc.Interspeech, Brisbane, Australia, 2008, pp. 1817-1820. 75
-
(2008)
Proc.Interspeech, Brisbane, Australia
, pp. 1817-1820
-
-
Garner, P.1
-
38
-
-
0028996969
-
A tree search strategy for large-vocabulary continuous speech recognition
-
DOI: 10.1109/ICASSP.1995.4796624
-
P. S. Gopalakrishnan, L. R. Bahl, and R. L. Mercer, "A tree search strategy for large-vocabulary continuous speech recognition," in Proc. ICASSP, vol. 572-575, 1995. DOI: 10.1109/ICASSP.1995.4796624
-
(1995)
Proc. ICASSP
, pp. 572-575
-
-
Gopalakrishnan, P.S.1
Bahl, L.R.2
Mercer, R.L.3
-
39
-
-
0000803388
-
The population frequencies of species and the estimation of population parameters
-
DOI: 10.2307/233334422
-
I. J.Good, "The population frequencies of species and the estimation of population parameters," Biometrika, vol. 40, no. 3-4, pp. 237-264, 1953. DOI: 10.2307/233334422
-
(1953)
Biometrika
, vol.40
, Issue.3-4
, pp. 237-264
-
-
Good, I.J.1
-
40
-
-
0004056285
-
-
Prentice Hall 12
-
X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall, 2001. 12
-
(2001)
Spoken Language Processing: A Guide to Theory Algorithm, and System Development
-
-
Huang, X.1
Acero, A.2
Hon, H.-W.3
-
41
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
DOI: 10.1109/ICASSP.2000.86202412
-
H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. ICASSP, vol. 3, 2000, pp. 1635-1638. DOI: 10.1109/ICASSP.2000.86202412
-
(2000)
Proc. ICASSP
, vol.3
, pp. 1635-1638
-
-
Hermansky, H.1
Ellis, D.P.W.2
Sharma, S.3
-
42
-
-
85009152019
-
The MIT finite-state transducer toolkit for speech and language processing
-
136
-
I. L. Hetherington, "The MIT finite-state transducer toolkit for speech and language processing," in Proc. Interspeech-ICSLP, 2004. 136
-
(2004)
Proc. Interspeech-ICSLP
-
-
Hetherington, I.L.1
-
43
-
-
33745188707
-
A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition
-
I. L. Hetherington, "A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition," in Proc. Interspeech-Eurospeech, 2005, pp. 545-548. 131
-
(2005)
Proc. Interspeech-Eurospeech
, vol.131
, pp. 545-548
-
-
Hetherington, I.L.1
-
44
-
-
17444406567
-
The ATIS spoken language systems pilot corpus
-
Hidden Valley, Pennsylvania June DOI: 10.3115/116580.1166131
-
C.T.Hemphill, J.J.Godfrey, and G.R.Doddington, "The ATIS spoken language systems pilot corpus," in DARPA Speech and Natural Language Workshop, Hidden Valley, Pennsylvania, June 1990. DOI: 10.3115/116580.1166131
-
(1990)
DARPA Speech and Natural Language Workshop
-
-
Hemphill, C.T.1
Godfrey, J.J.2
Doddington, G.R.3
-
45
-
-
85009204481
-
Speech summarization using weighted finite-state transducers
-
134
-
T. Hori, C. Hori, and Y. Minami, "Speech summarization using weighted finite-state transducers," in Proc. Eurospeech, 2003, pp. 2817-2820. 134
-
(2003)
Proc. Eurospeech
, pp. 2817-2820
-
-
Hori, T.1
Hori, C.2
Minami, Y.3
-
46
-
-
85009063824
-
Fast on-The-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition
-
6 95 110
-
T.Hori, C.Hori, and Y.Minami, "Fast on-The-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition," in Proc. Interspeech-ICSLP, vol. 1, 2004, pp. 289-292. 6, 95, 110
-
(2004)
Proc. Interspeech-ICSLP
, vol.1
, pp. 289-292
-
-
Hori, T.1
Hori, C.2
Minami, Y.3
-
47
-
-
45849093239
-
Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
-
DOI: 10.1109/TASL.2006.889790 6 95 110
-
T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEETransactions on Audio, Speech, and Language Processing, vol.15, no.4, pp.1352-1365, 2007.DOI: 10.1109/TASL.2006.889790 6, 95, 110
-
(2007)
IEEETransactions on Audio, Speech, and Language Processing
, vol.15
, Issue.4
, pp. 1352-1365
-
-
Hori, T.1
Hori, C.2
Minami, Y.3
Nakamura, A.4
-
48
-
-
83755196741
-
WFST enabled solutions to ASR problems: Beyond HMM decoding
-
DOI: 10.1109/TASL.2011.2162402132
-
B. Hoffmeister, G. Heigold, D. Rybach, R. Schluter, and H. Ney, "WFST enabled solutions to ASR problems: Beyond HMM decoding," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20(2), pp. 551-564, 2012. DOI: 10.1109/TASL.2011.2162402132
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.2
, pp. 551-564
-
-
Hoffmeister, B.1
Heigold, G.2
Rybach, D.3
Schluter, R.4
Ney, H.5
-
49
-
-
0003620778
-
-
Addison-Wesley Publishing Company 4 45 75
-
J. E. Hopcroft, R. Motwani, and J. D. Ullman, Introduction to Automata Theory, Languages, and Computation, 3rd ed. Addison-Wesley Publishing Company, 2006. 4, 45, 75
-
(2006)
Introduction to Automata Theory, Languages, and Computation, 3rd Ed
-
-
Hopcroft, J.E.1
Motwani, R.2
Ullman, J.D.3
-
50
-
-
33646426591
-
Generalized fast on-The-fly composition algorithm for WFST-based speech recognition
-
94 95 110 130
-
T. Hori and A. Nakamura, "Generalized fast on-The-fly composition algorithm for WFST-based speech recognition," in Proc. Interspeech- Eurospeech, 2005, pp. 557-560. 94, 95, 110, 130
-
(2005)
Proc. Interspeech-Eurospeech
, pp. 557-560
-
-
Hori, T.1
Nakamura, A.2
-
51
-
-
84867199378
-
Dialog management using weighted finite-state transducers
-
DOI: 10.1109/ASRU.2009.5373350136
-
C.Hori, K.Ohtaki, T.Misu, H.Kashioka, andS.Nakamura, "Dialog management using weighted finite-state transducers," in Proc. Interspeech, 2008, pp. 211-214. DOI: 10.1109/ASRU.2009.5373350136
-
(2008)
Proc. Interspeech
, pp. 211-214
-
-
Hori, C.1
Ohtaki, K.2
Misu, T.3
Nakamura, S.4
-
52
-
-
70349207745
-
Statistical dialog management applied to WFST-based dialog systems
-
DOI: 10.1109/ICASSP.2009.4960703136
-
C. Hori, K. Ohtaki, T. Misu, H. Kashioka, and S. Nakamura, "Statistical dialog management applied to WFST-based dialog systems," in Proc. ICASSP, 2009, pp. 4793-4796. DOI: 10.1109/ICASSP.2009.4960703136
-
(2009)
Proc. ICASSP
, pp. 4793-4796
-
-
Hori, C.1
Ohtaki, K.2
Misu, T.3
Kashioka, H.4
Nakamura, S.5
-
53
-
-
33947677731
-
Flexible multi-stream framework for speech recognition using multi-tape finite-state transducers
-
DOI: 10.1109/ICASSP.2006.1660046132
-
I. L. Hetherington, H. Shu, and J. R. Glass, "Flexible multi-stream framework for speech recognition using multi-tape finite-state transducers," in Proc. ICASSP, 2006, pp. 417-420. DOI: 10.1109/ICASSP.2006. 1660046132
-
(2006)
Proc. ICASSP
, pp. 417-420
-
-
Hetherington, I.L.1
Shu, H.2
Glass, J.R.3
-
54
-
-
70349226871
-
A multimedia retrieval system using speech input
-
DOI: 10.1145/1647314.1647356132
-
G. Heigold, R. Schluter, and H. Ney, "A multimedia retrieval system using speech input," in Proc. ICASSP, 2009, pp. 3749-3752. DOI: 10.1145/1647314.1647356132
-
(2009)
Proc. ICASSP
, pp. 3749-3752
-
-
Heigold, G.1
Schluter, R.2
Ney, H.3
-
55
-
-
0141480041
-
Language model adaptation using WFST-based speaking-style translation
-
DOI: 10.1109/ICASSP.2003.1198759131
-
T. Hori, D. Willett, and Y. Minami, "Language model adaptation using WFST-based speaking-style translation," in Proc. ICASSP, vol. I, 2003, pp. 228-231. DOI: 10.1109/ICASSP.2003.1198759131
-
(2003)
Proc. ICASSP
, vol.1
, pp. 228-231
-
-
Hori, T.1
Willett, D.2
Minami, Y.3
-
56
-
-
77954609645
-
Paraphrasing spontaneous speech using weighted finite-state transducers
-
134
-
T. Hori, D. Willett, and Y. Minami, "Paraphrasing spontaneous speech using weighted finite-state transducers," in Proc. SSPR, 2003. 134
-
(2003)
Proc. SSPR
-
-
Hori, T.1
Willett, D.2
Minami, Y.3
-
57
-
-
70349208656
-
Aflat direct model for speech recognition
-
DOI: 10.1109/ICASSP.2009.4960470133
-
G.Heigold, G.Zweig, andP.Nguyen, "Aflat direct model for speech recognition," in Proc. ICASSP, 2009, pp. 3861-3864. DOI: 10.1109/ICASSP.2009.4960470133
-
(2009)
Proc. ICASSP
, pp. 3861-3864
-
-
Heigold, G.1
Nguyen, P.2
-
58
-
-
0040261516
-
Language modeling with sentence-level mixtures
-
DOI: 10.3115/1075812.1075828131
-
R. Iyer, M. Ostendorf, and J. R. Rohlicek, "Language modeling with sentence-level mixtures," in Proc.Workshop on Human Language Technology, 1994, pp. 82-87. DOI: 10.3115/1075812.1075828131
-
(1994)
Proc.Workshop on Human Language Technology
, pp. 82-87
-
-
Iyer, R.1
Ostendorf, M.2
Rohlicek, J.R.3
-
59
-
-
0016507833
-
Design of a linguistic statistical decoder for the recognition of continuous speech
-
DOI: 10.1109/TIT.1975.10553849
-
F. Jelinek, L. R. Bahl, and R. L. Mercer, "Design of a linguistic statistical decoder for the recognition of continuous speech," IEEE Transactions on Information Theory, vol. IT-21, no. 3, pp. 250-256, 1975. DOI: 10.1109/TIT.1975.10553849
-
(1975)
IEEE Transactions on Information Theory
, vol.21
, Issue.3
, pp. 250-256
-
-
Jelinek, F.1
Bahl, L.R.2
Mercer, R.L.3
-
61
-
-
0012357341
-
A dynamic language model for speech recognition
-
DOI: 10.3115/112405.112464131
-
F. Jelinek, B. Merialdo, R. S., and M. Strauss, "A dynamic language model for speech recognition," in Proc. DARPA Workshop on Speech and Natural Language, 1991, pp. 293-295. DOI: 10.3115/112405.112464131
-
(1991)
Proc. DARPA Workshop on Speech and Natural Language
, pp. 293-295
-
-
Jelinek, F.1
Merialdo R S, B.2
Strauss, M.3
-
62
-
-
77950550412
-
Development of a WFST based speech recognition system for a resource deficient language using machine translation
-
131
-
A. T. Jensson, T. Oonishi, K. Iwano, and S. Furui, "Development of a WFST based speech recognition system for a resource deficient language using machine translation," in Proc. APSIPA ASC, 2009, pp. 50-56. 131
-
(2009)
Proc. APSIPA ASC
, pp. 50-56
-
-
Jensson, A.T.1
Oonishi, T.2
Iwano, K.3
Furui, S.4
-
63
-
-
85009198110
-
Speech recognition with dynamic grammars using finite-state transducers
-
131
-
J. J. Schalkwyk, I. L. Hetherington, and E. Story, "Speech recognition with dynamic grammars using finite-state transducers," in Proc. Eurospeech, 2003, pp. 1969-1972. 131
-
(2003)
Proc. Eurospeech
, pp. 1969-1972
-
-
Schalkwyk, J.J.1
Hetherington, I.L.2
Story, E.3
-
64
-
-
0032289099
-
Heteroscendastic discriminant analysis and reduced rank HMMs for improved speech recognition
-
DOI: 10.1016/S0167-6393(98)00061-212
-
N. Kumar and H. G. Andreou, "Heteroscendastic discriminant analysis and reduced rank HMMs for improved speech recognition," Speech Communication, vol. 26, pp. 283-297, 1998. DOI: 10.1016/S0167-6393(98)00061-212
-
(1998)
Speech Communication
, vol.26
, pp. 283-297
-
-
Kumar, N.1
Andreou, H.G.2
-
65
-
-
0023312404
-
Estimation of probabilities from sparse data for the language model component of a speech recognizer
-
DOI: 10.1109/TASSP.1987.116512522
-
S. M. Katz, "Estimation of probabilities from sparse data for the language model component of a speech recognizer," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 35, no. 3, pp. 400-401, 1987. DOI: 10.1109/TASSP.1987.116512522
-
(1987)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.35
, Issue.3
, pp. 400-401
-
-
Katz, S.M.1
-
66
-
-
0026372947
-
Admissible heuristics for rapid lexical access
-
DOI: 10.1109/ICASSP.1991.1504334
-
P. Kenny, R. Hollan, V. Gupta, M. Lennig, P.Mermelstein, and D. O'Shaughnessy, "A&z.ast;-admissible heuristics for rapid lexical access," in Proc. ICASSP, 1991, pp. 689-692. DOI: 10.1109/ICASSP.1991. 1504334
-
(1991)
Proc. ICASSP
, pp. 689-692
-
-
Kenny, P.1
Hollan, R.2
Gupta, V.3
Lennig, M.4
Mermelstein, P.5
O'Shaughnessy, D.6
-
67
-
-
0028996960
-
Improved decision trees for phonetic modeling
-
DOI: 10.1007/11965152-26131
-
R. Kuhn, A. Lazarides, Y. Normandin, and J. Brousseau, "Improved decision trees for phonetic modeling," in Proc. ICASSP, vol. 1, 1995, pp. 552-555. DOI: 10.1007/11965152-26131
-
(1995)
Proc. ICASSP
, vol.1
, pp. 552-555
-
-
Kuhn, R.1
Lazarides, A.2
Normandin, Y.3
Brousseau, J.4
-
68
-
-
84941164160
-
FSA: An efficient and flexible C++ toolkit for finite state automata using on-demand computation
-
136
-
S. Kanthak and H. Ney, "FSA: An efficient and flexible C++ toolkit for finite state automata using on-demand computation," in Proc. ACL, 2004, pp. 510-517. 136
-
(2004)
Proc. ACL
, pp. 510-517
-
-
Kanthak, S.1
Ney, H.2
-
69
-
-
84865227975
-
Structural classification methods based on weighted finite-state transducers for automatic speech recognition
-
in press. 133
-
Y. Kubo, S. Watanabe, T. Hori, and A. Nakamura, "Structural classification methods based on weighted finite-state transducers for automatic speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, 2012, in press. 133
-
IEEE Transactions on Audio, Speech, and Language Processing
, vol.2012
-
-
Kubo, Y.1
Watanabe, S.2
Hori, T.3
Nakamura, A.4
-
71
-
-
78049379945
-
Language model combination and adaptation using weighted finite state transducers
-
131
-
X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland, "Language model combination and adaptation using weighted finite state transducers," in Proc. ICASSP, 2010, pp. 5390-5393. 131
-
(2010)
Proc. ICASSP
, pp. 5390-5393
-
-
Liu, X.1
Gales, M.J.F.2
Hieronymus, J.L.3
Woodland, P.C.4
-
73
-
-
0003509020
-
-
PhD theses, Dept. of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, USA 3, 27
-
B. Lowerre, "The HARPY speech recognition system," PhD theses, Dept. of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, USA, 1976. 3, 27
-
(1976)
The HARPY Speech Recognition System
-
-
Lowerre, B.1
-
74
-
-
85135253868
-
Efficient general lattice generation and rescor-ing
-
88 94 126
-
A. Ljolje, F. Pereira, and M. Riley, "Efficient general lattice generation and rescor-ing," in Proc. Eurospeech, 1999, pp. 1251-1254. 88, 94, 126
-
(1999)
Proc. Eurospeech
, pp. 1251-1254
-
-
Ljolje, A.1
Pereira, F.2
Riley, M.3
-
75
-
-
78049355806
-
Discriminatively estimated joint acoustic, duration, and language model for speech recognition
-
DOI: 10.1109/ICASSP.2010.5495227133
-
M. Lehr and I. Shafran, "Discriminatively estimated joint acoustic, duration, and language model for speech recognition," in Proc. ICASSP, 2010, pp. 5542-5545. DOI: 10.1109/ICASSP.2010.5495227133
-
(2010)
Proc. ICASSP
, pp. 5542-5545
-
-
Lehr, M.1
Shafran, I.2
-
77
-
-
70450152774
-
Juicer: A weighted finite-state transducer decoder
-
136
-
D. Moore, J. Dines, M. Magimai Doss, J. Vepa, O. Cheng, and T. Hain, "Juicer: a weighted finite-state transducer decoder," Machine Learning for Multimodal In teraction, Lecture Notes in Computer Science, vol. 4299, pp. 285-296, 2006. 136
-
(2006)
Machine Learning for Multimodal in Teraction, Lecture Notes in Computer Science
, vol.4299
, pp. 285-296
-
-
Moore, D.1
Dines, J.2
Magimai Doss, M.3
Vepa, J.4
Cheng, O.5
Hain, T.6
-
78
-
-
34547522070
-
Discriminative training for large vocabulary speech recognition using minimum classification error
-
DOI: 10.1109/TASL.2006.876778132
-
E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error," IEEE Transactions on Audio, Speech and Language Processing, vol. 15, pp. 203-223, 2007. DOI: 10.1109/TASL.2006. 876778132
-
(2007)
IEEE Transactions on Audio, Speech and Language Processing
, vol.15
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.J.2
Le Roux, J.3
Nakamura, A.4
Katagiri, S.5
-
79
-
-
84872856906
-
-
web page 136
-
"The MIT FST Toolkit," web page http://people.csail.mit.edu/ ilh/fst. 136
-
The MIT FST Toolkit
-
-
-
80
-
-
84892889057
-
Generic epsilon-removal and input epsilon-normalization algorithms for weighted transducers
-
DOI: 10.1142/S012905410200099665
-
M. Mohri, "Generic epsilon-removal and input epsilon-normalization algorithms for weighted transducers," International Journal of Foundations of Computer Science, vol. 13(1), pp. 129-143, 2002. DOI: 10.1142/ S012905410200099665
-
(2002)
International Journal of Foundations of Computer Science
, vol.13
, Issue.1
, pp. 129-143
-
-
Mohri, M.1
-
81
-
-
70350376504
-
Weighted automata algorithms
-
M. Droste, W. Kuich, and H. Vogler, Eds. Springer-Verlag New York Inc. DOI: 10.1007/978-3-642-01492-5 56 57 61 65
-
M. Mohri, "Weighted automata algorithms," in Handbook of Weighted Automata, M. Droste, W. Kuich, and H. Vogler, Eds. Springer-Verlag New York Inc., 2009. DOI: 10.1007/978-3-642-01492-5 56, 57, 61, 65
-
(2009)
Handbook of Weighted Automata
-
-
Mohri, M.1
-
82
-
-
21444449828
-
Weighted automata in text and speech processing
-
Budapest, Hungary 4
-
M. Mohri, F. Pereira, and M. Riley, "Weighted automata in text and speech processing," in Proc. ECAI-96, Workshop on Extended Finite State Models of Language, Budapest, Hungary, 1996. 4
-
(1996)
Proc. ECAI-96, Workshop on Extended Finite State Models of Language
-
-
Mohri, M.1
Pereira, F.2
Riley, M.3
-
83
-
-
0012306376
-
The design principles of a weighted finite-state transducer library
-
DOI: 10.1016/S0304-3975(99)00014-6136
-
M. Mohri, F. Pereira, and M. Riley, "The design principles of a weighted finite-state transducer library," Theoretical Computer Science, vol. 231(1), pp. 17-32, 2000. DOI: 10.1016/S0304-3975(99)00014-6136
-
(2000)
Theoretical Computer Science
, vol.231
, Issue.1
, pp. 17-32
-
-
Mohri, M.1
Pereira, F.2
Riley, M.3
-
84
-
-
0036460907
-
Weighted finite-state transducers in speech recognition
-
DOI: 10.1006/csla.2001.0184 4 41 71 80 83 94 95
-
M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducers in speech recognition," Computer Speech and Language, vol. 16, pp. 69-88, 2002. DOI: 10.1006/csla.2001.0184 4, 41, 71, 80, 83, 94, 95
-
(2002)
Computer Speech and Language
, vol.16
, pp. 69-88
-
-
Mohri, M.1
Pereira, F.2
Riley, M.3
-
85
-
-
33646939678
-
Weighted determinization and minimization for large vocabulary speech recognition
-
95
-
M. Mohri and M. Riley, "Weighted determinization and minimization for large vocabulary speech recognition," in Proc. Eurospeech, vol. 1, 1997, pp. 131-134. 95
-
(1997)
Proc. Eurospeech
, vol.1
, pp. 131-134
-
-
Mohri, M.1
Riley, M.2
-
86
-
-
85009070232
-
A weight pushing algorithm for large vocabulary speech recognition
-
91
-
M. Mohri and M.Riley, "A weight pushing algorithm for large vocabulary speech recognition," in Proc. Eurospeech, 2001, pp. 1603-1606. 91
-
(2001)
Proc. Eurospeech
, pp. 1603-1606
-
-
Mohri, M.1
Riley, M.2
-
87
-
-
44849112578
-
An algorithm for fast composition of weighted finite-state transducers
-
DOI: 10.1109/ASRU.2007.4430156 95 99
-
J. McDonough, E. Stoimenov, and D. Klakow, "An algorithm for fast composition of weighted finite-state transducers," in Proc. ASRU, 2007, pp. 461-466. DOI: 10.1109/ASRU.2007.4430156 95, 99
-
(2007)
Proc. ASRU
, pp. 461-466
-
-
McDonough, J.1
Stoimenov, E.2
Klakow, D.3
-
88
-
-
0021406359
-
The use of a one-stage dynamic programming algorithm for connected word recognition
-
Apr DOI: 10.1109/TASSP.1984.11643203
-
H. Ney, "The use of a one-stage dynamic programming algorithm for connected word recognition," IEEETransactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, no. 2, pp. 263-271, Apr. 1984. DOI: 10.1109/TASSP.1984.11643203
-
(1984)
IEEETransactions on Acoustics, Speech, and Signal Processing
, vol.ASSP-32
, Issue.2
, pp. 263-271
-
-
Ney, H.1
-
89
-
-
85017308347
-
Improvementsin beam search for 10000-word continuous speech recognition
-
DOI: 10.1109/89.27928727
-
H.Ney, R.Haeb-Umbach, B.Tran, and M.Oerder, "Improvementsin beam search for 10000-word continuous speech recognition," in Proc. ICASSP, vol. I, 1992, pp. 9-12. DOI: 10.1109/89.27928727
-
(1992)
Proc. ICASSP
, vol.1
, pp. 9-12
-
-
Ney, H.1
Haeb-Umbach, R.2
Tran, B.3
Oerder, M.4
-
90
-
-
70349227632
-
Generalization of specialized on-The-fly composition
-
DOI: 10.1109/ICASSP.2009.4960584 95 99 102
-
T. Oonishi, P. R. Dixon, K. Iwano, and S. Furui, "Generalization of specialized on-The-fly composition," in Proc. ICASSP, 2009, pp. 4317-4320. DOI: 10.1109/ICASSP.2009.4960584 95, 99, 102
-
(2009)
Proc. ICASSP
, pp. 4317-4320
-
-
Oonishi, T.1
Dixon, P.R.2
Iwano, K.3
Furui, S.4
-
91
-
-
84866849798
-
Optimization of on-The-fly composition for WFST-based speech recognition decoders
-
(in Japanese) 102 103
-
T. Oonishi, P. R. Dixon, K. Iwano, and S. Furui, "Optimization of on-The-fly composition for WFST-based speech recognition decoders," IEICE Transactions on Information and Systems, vol. J92-D, no. 7, pp. 1026-1035, 2009, (in Japanese). 102, 103
-
(2009)
IEICE Transactions on Information and Systems
, vol.J92-D
, Issue.7
, pp. 1026-1035
-
-
Oonishi, T.1
Dixon, P.R.2
Iwano, K.3
Furui, S.4
-
92
-
-
80051616419
-
Round-robin duel discriminative language models in one-pass decoding with on-The-fly error correction
-
DOI: 10.1109/ICASSP.2011.5947626131
-
T. Oba, T. Hori, A. Ito, and A. Nakamura, "Round-robin duel discriminative language models in one-pass decoding with on-The-fly error correction," in Proc. ICASSP, 2011, pp. 5588-5591. DOI: 10.1109/ICASSP.2011.5947626131
-
(2011)
Proc. ICASSP
, pp. 5588-5591
-
-
Oba, T.1
Hori, T.2
Ito, A.3
Nakamura, A.4
-
93
-
-
0030719155
-
A word graph algorithm for large vocabulary continuous speech recognition
-
DOI: 10.1006/csla.1996.0022 4 33 38 94
-
S.Ortmanns, H.Ney, and X.Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Computer Speech and Language, vol. 1, pp. 43-72, 1997. DOI: 10.1006/csla.1996.0022 4, 33, 38, 94
-
(1997)
Computer Speech and Language
, vol.1
, pp. 43-72
-
-
Ortmanns, S.1
Ney, H.2
Aubert, X.3
-
94
-
-
0030366694
-
Language-model look-ahead for large vocabulary speech recognition
-
DOI: 10.1109/ICSLP.1996.607215103
-
S. Ortmanns, H. Ney, and A. Eiden, "Language-model look-ahead for large vocabulary speech recognition," in Proc. ICSLP, 1996, pp. 2095-2098. DOI: 10.1109/ICSLP.1996.607215103
-
(1996)
Proc. ICSLP
, pp. 2095-2098
-
-
Ortmanns, S.1
Ney, H.2
Eiden, A.3
-
95
-
-
84872842078
-
-
web page 136
-
"OpenFst Library," web page http://www.openfst.org/twiki/bin/ view/FST/WebHome. 136
-
OpenFst Library
-
-
-
96
-
-
0024934084
-
Benchmark tests for DARPA resource management database performance evaluations
-
DOI: 10.1109/ICASSP.1989.2664821
-
D. S. Pallett, "Benchmark tests for DARPA resource management database performance evaluations," in Proc. ICASSP, 1989, pp. 536-539. DOI: 10.1109/ICASSP.1989.2664821
-
(1989)
Proc. ICASSP
, pp. 536-539
-
-
Pallett, D.S.1
-
97
-
-
0026368475
-
Algorithm for an optimal A&z.ast; Search and linearizing the search in the stack decoder
-
DOI: 10.1109/ICASSP.1991.1504344
-
D. B. Paul, "Algorithm for an optimal A&z.ast; search and linearizing the search in the stack decoder," in Proc. ICASSP, 1991, pp. 693-696. DOI: 10.1109/ICASSP.1991.1504344
-
(1991)
Proc. ICASSP
, pp. 693-696
-
-
Paul, D.B.1
-
98
-
-
78049502526
-
The subspace Gaussian mixture model-A structured model for speech recognition
-
April DOI: 10.1016/j.csl.2010.06.00316
-
D. Povey, L. Burget, M. Agarwal, P. Akyazi, F. Kai, A. Ghoshal, O. Glembek, N. Goel, M. Karafiat, A. Rastrow, R. C. Rosei, P. Schwarz, and S. Thomas, "The subspace Gaussian mixture model-A structured model for speech recognition," Computer Speech & Language, vol. 25, no. 2, pp. 404-439, April 2011. DOI: 10.1016/j.csl.2010.06.00316
-
(2011)
Computer Speech & Language
, vol.25
, Issue.2
, pp. 404-439
-
-
Povey, D.1
Burget, L.2
Agarwal, M.3
Akyazi, P.4
Kai, F.5
Ghoshal, A.6
Glembek, O.7
Goel, N.8
Karafiat, M.9
Rastrow, A.10
Rosei, R.C.11
Schwarz, P.12
Thomas, S.13
-
99
-
-
6744225722
-
DARPA ATIS test results June 1990
-
R. Stern, Ed. Morgan Kaufmann Publishers, Inc., June 1
-
D. S. Pallett, W. M. Fisher, J. G. Fiscus, and J. S. Garofolo, "DARPA ATIS test results June 1990," in Proc. Speech and Natural Language Workshop, R. Stern, Ed. Morgan Kaufmann Publishers, Inc., June 1990, pp. 114-121. 1
-
(1990)
Proc. Speech and Natural Language Workshop
, pp. 114-121
-
-
Pallett, D.S.1
Fisher, W.M.2
Fiscus, J.G.3
Garofolo, J.S.4
-
100
-
-
84858953642
-
The Kaldi speech recognition toolkit
-
136
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The Kaldi speech recognition toolkit," in Proc. ASRU, 2011. 136
-
(2011)
Proc. ASRU
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlicek, P.8
Qian, Y.9
Schwarz, P.10
Silovsky, J.11
Stemmer, G.12
Vesely, K.13
-
101
-
-
0002837345
-
Speech recognition by composition of weighted finite automata
-
MIT Press 4
-
F. Pereira and M. Riley, "Speech recognition by composition of weighted finite automata," in Finite-State Language Processing. MIT Press, 1996, pp. 431-453. 4
-
(1996)
Finite-State Language Processing
, pp. 431-453
-
-
Pereira, F.1
Riley, M.2
-
102
-
-
0242312781
-
Weighted rational transductions and their application to human language processing
-
DOI: 10.3115/1075812.10758704
-
F. Pereira, M. Riley, and R. Sproat, "Weighted rational transductions and their application to human language processing," in Proc. ARPA Workshop on Human Language technology, 1994, pp. 249-254. DOI: 10.3115/1075812.10758704
-
(1994)
Proc. ARPA Workshop on Human Language Technology
, pp. 249-254
-
-
Pereira, F.1
Riley, M.2
Sproat, R.3
-
103
-
-
0036296863
-
Minimum phone error and I-smoothing for improved discriminative training
-
DOI: 10.1109/ICASSP.2002.5743665 13 132
-
D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP, vol. I, 2002, pp. 105-108. DOI: 10.1109/ICASSP.2002.5743665 13, 132
-
(2002)
Proc. ICASSP
, vol.1
, pp. 105-108
-
-
Povey, D.1
Woodland, P.C.2
-
104
-
-
0027113267
-
Minimisation of acyclic deterministic automata in linear time
-
DOI: 10.1016/0304-3975(92)90142-364
-
D. Revuz, "Minimisation of acyclic deterministic automata in linear time," Theoretical Computer Science, vol. 92(1), pp. 181-189, 1992. DOI: 10.1016/0304-3975(92)90142-364
-
(1992)
Theoretical Computer Science
, vol.92
, Issue.1
, pp. 181-189
-
-
Revuz, D.1
-
105
-
-
84878410921
-
RASR-The RWTH Aachen university open source speech recognition toolkit
-
136
-
D.Rybach, S.Hahn, P.Lehnen, D.Nolden, M.Sundermeyer, Z.Tuske, S.Wiesler, R. Schluter, and N. Ney, "RASR-The RWTH Aachen university open source speech recognition toolkit," in Proc. ASRU, 2011. 136
-
(2011)
Proc. ASRU
-
-
Rybach, D.1
Hahn, S.2
Lehnen, P.3
Nolden, D.4
Sundermeyer, M.5
Tuske, Z.6
Wiesler, S.7
Schluter, R.8
Ney, N.9
-
106
-
-
0028194709
-
Connectionist probability estimators in HMM speech recognition
-
DOI: 10.1109/89.26035916
-
S. Renals, N. Morgan, H. Boulard, M. Cohen, and H. Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994. DOI: 10.1109/89.26035916
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, Issue.1
, pp. 161-174
-
-
Renals, S.1
Morgan, N.2
Boulard, H.3
Cohen, M.4
Franco, H.5
-
107
-
-
0002247642
-
Transducer composition for context-dependent network expansion
-
71
-
M. Riley, F. Pereira, and M. Mohri, "Transducer composition for context-dependent network expansion," in Proc. Eurospeech, 1997, pp. 1427-1430. 71
-
(1997)
Proc. Eurospeech
, pp. 1427-1430
-
-
Riley, M.1
Pereira, F.2
Mohri, M.3
-
109
-
-
85149106909
-
Discriminative language modeling with conditional random fields and the perceptron algorithm
-
DOI: 10.3115/1218955.1218962130
-
B. Roark, M. Saraclar, M. Collins, and M. Johnson, "Discriminative language modeling with conditional random fields and the perceptron algorithm," in Proc. ACL, 2004. DOI: 10.3115/1218955.1218962130
-
(2004)
Proc. ACL
-
-
Roark, B.1
Saraclar, M.2
Collins, M.3
Johnson, M.4
-
110
-
-
84867593936
-
Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders
-
Kyoto, Japan DOI: 10.1109/ICASSP.2012.628884675
-
D. Rybach, R. Schluter, and H. Ney, "Silence is golden: modeling non-speech events in WFST-based dynamic network decoders," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4205-4208. DOI: 10.1109/ICASSP.2012.628884675
-
(2012)
Proc. ICASSP
, pp. 4205-4208
-
-
Rybach, D.1
Schluter, R.2
Ney, H.3
-
111
-
-
84872853828
-
-
web page 136
-
"The RWTH FSA Toolkit," web page http://www-i6.informatik.rwth- aachen.de/kanthak/fsa.html. 136
-
The RWTH FSA Toolkit
-
-
-
112
-
-
0026390882
-
A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses
-
DOI: 10.1109/ICASSP.1991.15043639
-
R. Schwartz and Y. Austin, "A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses," in Proc. ICASSP, 1990, pp. 701-704. DOI: 10.1109/ICASSP.1991.15043639
-
(1990)
Proc. ICASSP
, pp. 701-704
-
-
Schwartz, R.1
Austin, Y.2
-
113
-
-
80053442098
-
A similarity evaluation of speech patterns by dynamic programming (in japanese)
-
2
-
H. Sakoe and S. Chiba, "A similarity evaluation of speech patterns by dynamic programming (in japanese)," in the Dig. 1970 Nat. Meeting, Inst. Electrn. Comm. Eng. Japan, July 1970, p. 136. 2
-
(1970)
The Dig. 1970 Nat. Meeting, Inst. Electrn. Comm. Eng. Japan, July
, pp. 136
-
-
Sakoe, H.1
Chiba, S.2
-
114
-
-
0005670423
-
A dynamic programming approach to continuous speech recognition
-
Budapest, Hungary, Paper 20 C 13, August 2
-
H. Sakoe and S. Chiba, "A dynamic programming approach to continuous speech recognition," in Proc. ICA, Budapest, Hungary, Paper 20 C 13, August 1971, pp. 65-68. 2
-
(1971)
Proc. ICA
, pp. 65-68
-
-
Sakoe, H.1
Chiba, S.2
-
115
-
-
0025627406
-
The N-best algorithm:an efficient and exact procedure for finding the N most likely sentence hypotheses
-
DOI: 10.1109/ICASSP.1990.1155424
-
R.Schwartz andY.Chow, "The N-best algorithm:an efficient and exact procedure for finding the N most likely sentence hypotheses," in Proc. ICASSP, 1990, pp. 81-84. DOI: 10.1109/ICASSP.1990.1155424
-
(1990)
Proc. ICASSP
, pp. 81-84
-
-
Chow, Y.1
-
116
-
-
0033896970
-
Memory-efficient LVCSR search using a one-pass stack decoder
-
January DOI: 10.1006/csla.1999.01354
-
M. Schuster, "Memory-efficient LVCSR search using a one-pass stack decoder," Computer Speech & Language, vol. 14(1), pp. 47-77, January 2000. DOI: 10.1006/csla.1999.01354
-
(2000)
Computer Speech & Language
, vol.14
, Issue.1
, pp. 47-77
-
-
Schuster, M.1
-
117
-
-
0026370988
-
A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition
-
DOI: 10.1109/ICASSP.1991.1504374
-
F. K. Soong and E.-F. Huang, "A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition," in Proc. ICASSP, vol. 1, 1991, pp. 705-708. DOI: 10.1109/ICASSP.1991.1504374
-
(1991)
Proc. ICASSP
, vol.1
, pp. 705-708
-
-
Soong, F.K.1
Huang, E.-F.2
-
118
-
-
85009292190
-
EM training of finite-state transducers and its application to pronunciation modeling
-
132
-
H. Shu and I. L. Hetherington, "EM training of finite-state transducers and its application to pronunciation modeling," in Proc. ICSLP, 2002, pp. 1293-1296. 132
-
(2002)
Proc. ICSLP
, pp. 1293-1296
-
-
Shu, H.1
Hetherington, I.L.2
-
119
-
-
33645768509
-
Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition
-
DOI: 10.1109/ICASSP.2005.1415085132
-
M. Schuster and T. Hori, "Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition," in Proc. ICASSP, 2005, pp. 201-204. DOI: 10.1109/ICASSP.2005. 1415085132
-
(2005)
Proc. ICASSP
, pp. 201-204
-
-
Schuster, M.1
Hori, T.2
-
120
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
16
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, 2011, pp. 437-440. 16
-
(2011)
Proc. Interspeech
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
121
-
-
33947638544
-
Modeling polyphone context with weighted finite-state transducers
-
DOI: 10.1109/ICASSP.2006.1659972132
-
E. Stoimenov and J. McDonough, "Modeling polyphone context with weighted finite-state transducers," in Proc. ICASSP, vol. I, 2006, pp. 121-124. DOI: 10.1109/ICASSP.2006.1659972132
-
(2006)
Proc. ICASSP
, vol.1
, pp. 121-124
-
-
Stoimenov, E.1
McDonough, J.2
-
122
-
-
0030362785
-
Multilingual text analysis for text-to-speech synthesis
-
DOI: 10.1017/S1351324997001654135
-
R. Sproat, "Multilingual text analysis for text-to-speech synthesis," in Proc. ICSLP, vol. 3, 1996, pp. 1365-1368. DOI: 10.1017/S1351324997001654135
-
(1996)
Proc. ICSLP
, vol.3
, pp. 1365-1368
-
-
Sproat, R.1
-
124
-
-
0033906251
-
MDL-based context-dependent subword modeling for speech recognition
-
DOI: 10.1250/ast.21.7919
-
K.Shinoda andT.Watanabe, "MDL-based context-dependent subword modeling for speech recognition," Acoustic Science and Technology, vol. 21, no. 2, pp. 79-86, 2000. DOI: 10.1250/ast.21.7919
-
(2000)
Acoustic Science and Technology
, vol.21
, Issue.2
, pp. 79-86
-
-
Shinoda, K.1
Watanabe, T.2
-
125
-
-
0029765807
-
Spontaneous dialogue speech recognition using cross-word context constrained word graphs
-
DOI: 10.1109/ICASSP.1996.540311126
-
T. Shimizu, H. Yamamoto, H. Masataki, S. Matsunaga, and Y. Sagisaka, "Spontaneous dialogue speech recognition using cross-word context constrained word graphs," in Proc. ICASSP, 1996, pp. 145-148. DOI: 10.1109/ICASSP.1996.540311126
-
(1996)
Proc. ICASSP
, pp. 145-148
-
-
Shimizu, T.1
Yamamoto, H.2
Masataki, H.3
Matsunaga, S.4
Sagisaka, Y.5
-
126
-
-
78049374440
-
A discriminative model for continuous speech recognition based on weighted finite state transducers
-
DOI: 10.1109/ICASSP.2010.5495096133
-
S.Watanabe, T. Hori, E. McDermott, and A. Nakamura, "A discriminative model for continuous speech recognition based on weighted finite state transducers," in Proc. ICASSP, 2010, pp. 4922-4925. DOI: 10.1109/ICASSP.2010.5495096133
-
(2010)
Proc. ICASSP
, pp. 4922-4925
-
-
Watanabe, S.1
Hori, T.2
McDermott, E.3
Nakamura, A.4
-
127
-
-
85009110509
-
Time and memory efficient Viterbi decoding for LVCSR using a precompiled search network
-
95, 110 111
-
D. Willett, E. McDermott, Y. Minami, and S. Katagiri, "Time and memory efficient Viterbi decoding for LVCSR using a precompiled search network," in Proc. Eurospeech, 2001, pp. 847-850. 95, 110, 111
-
(2001)
Proc. Eurospeech
, pp. 847-850
-
-
Willett, D.1
McDermott, E.2
Minami, Y.3
Katagiri, S.4
-
128
-
-
3042741069
-
Variational Bayesian estimation and clustering for speech recognition
-
DOI: 10.1109/TSA.2004.82864019
-
S. Watanabe, Y. Minami, A. Nakamura, and N. Ueda, "Variational Bayesian estimation and clustering for speech recognition," IEEE Transactions on Speech and Audio Processing, vol.12, pp.365-381, 2004.DOI: 10.1109/TSA.2004.82864019
-
(2004)
IEEE Transactions on Speech and Audio Processing
, vol.12
, pp. 365-381
-
-
Watanabe, S.1
Minami, Y.2
Nakamura, A.3
Ueda, N.4
-
129
-
-
0002144369
-
Tree-based state tying for high accuracy acoustics modeling
-
DOI: 10.3115/1075812.107588518
-
S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustics modeling," in Proc. ARPA Human Language Technology Workshop, 1994, pp. 307-312. DOI: 10.3115/1075812.107588518
-
(1994)
Proc. ARPA Human Language Technology Workshop
, pp. 307-312
-
-
Young, S.J.1
Odell, J.J.2
Woodland, P.C.3
-
130
-
-
84867598134
-
A general discriminative training algorithm for speech recognition using weighted finite-state transducers
-
Kyoto, Japan DOI: 10.1109/ICASSP.2012.6288849132
-
Y. Zhao, A. Ljolje, D. Caseiro, and B.-H. Juang, "A general discriminative training algorithm for speech recognition using weighted finite-state transducers," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4217-4220. DOI: 10.1109/ICASSP.2012.6288849132
-
(2012)
Proc. ICASSP
, pp. 4217-4220
-
-
Zhao, Y.1
Ljolje, A.2
Caseiro, D.3
Juang, B.-H.4
-
131
-
-
77949370075
-
A segmental CRF approach to large vocabulary continuous speech recognition
-
DOI: 10.1109/ASRU.2009.5372916133
-
G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition," in Proc. ASRU, 2009, pp. 152-157. DOI: 10.1109/ASRU.2009.5372916133
-
(2009)
Proc. ASRU
, pp. 152-157
-
-
Zweig, G.1
Nguyen, P.2
|