SCOPUS 정보 검색 플랫폼

Synthesis Lectures on Speech and Audio Processing

Volumn 10, Issue , 2013, Pages 1-162

Speech recognition algorithms using weighted finite-state transducers

(2) Hori, Takaaki a Nakamura, Atsushi a

a Japan (Japan)

Author keywords

automaton; decoder; optimization; speech recognition; Viterbi algorithm; weighted finite state transducer

Indexed keywords

AUTOMATON; BEST MATCH; BLACK BOXES; COMPUTATIONAL COSTS; CONVENTIONAL METHODS; DECODER; DECODING ALGORITHM; DECODING PROCESS; DECODING SPEED; IMPLEMENTATION TECHNIQUES; RECOGNITION ALGORITHM; RECOGNITION ERROR; SEARCH PROBLEM; SPEECH RECOGNITION TECHNOLOGY; SPEECH RECOGNIZER; SPEECH SIGNALS; SPOKEN LANGUAGE PROCESSING; VOCABULARY SIZE; WEIGHTED FINITE-STATE TRANSDUCERS;

AUTOMATA THEORY; DECODING; OPTIMIZATION; REVIEWS; TRANSDUCERS; VITERBI ALGORITHM;

SPEECH RECOGNITION;

EID: 84872838032 PISSN: 1932121X EISSN: 19321678 Source Type: Book Series
DOI: 10.2200/S00462ED1V01Y201212SAP010 Document Type: Article

Times cited : (16)

References (131)

1
- 0028996974
- Language model representations for beam-search decoding
- DOI: 10.1109/ICASSP.1995.47966633
- G. Antoniol, F. Brugnara, M. Cettolo, and M. Frederico, "Language model representations for beam-search decoding," in Proc. ICASSP, 1995, pp. 588-591. DOI: 10.1109/ICASSP.1995.47966633
- (1995) Proc. ICASSP , pp. 588-591
- Antoniol, G.¹ Brugnara, F.² Cettolo, M.³ Frederico, M.⁴

2
- 0003415652
- Addison-Wesley Publishing Company
- A. V. Aho, J. E. Hopcroft, and J. D. Ullman, The Design and Analysis of Computer Algorithms. Addison-Wesley Publishing Company, 1974. 4
- (1974) The Design and Analysis of Computer Algorithms , vol.4
- Aho, A.V.¹ Hopcroft, J.E.² Ullman, J.D.³

3
- 33745219793
- General indexation of weighted automata-application to spoken utterance retrieval
- C. Allauzen and M. Mohri, "General indexation of weighted automata-application to spoken utterance retrieval," in Proc. HLT-NAACL, 2004. 135
- (2004) Proc. HLT-NAACL , vol.135
- Allauzen, C.¹ Mohri, M.²

4
- 85149121374
- Generalized algorithms for constructing statistical language models
- DOI: 10.3115/1075096.107510277,130
- C. Allauzen, M. Mohri, and B. Roark, "Generalized algorithms for constructing statistical language models," in Proc. ACL, 2003, pp. 40-47. DOI: 10.3115/1075096.1075102 77, 130
- (2003) Proc. ACL , pp. 40-47
- Allauzen, C.¹ Mohri, M.² Roark, B.³

5
- 4544339437
- A generalized construction of integrated speech recognition transducers
- DOI: 10.1109/CASSP.2004.132609791
- C. Allauzen, M. Mohri, M. Riley, and B. Roark, "A generalized construction of integrated speech recognition transducers," in Proc. ICASSP, vol. I, 2004, pp. 761-764. DOI: 10.1109/CASSP.2004.132609791
- (2004) Proc. ICASSP , vol.1 , pp. 761-764
- Allauzen, C.¹ Mohri, M.² Riley, M.³ Roark, B.⁴

6
- 38149133882
- OpenFst: A general and efficient weighted finite-state transducer library
- DOI: 10.1007/978-3-540-76336-9-3136
- C. Allauzen, M. Riley, J. Schalkwyk, W. Skut, and M. Mohri, "OpenFst: A general and efficient weighted finite-state transducer library," in Proc. of CIAA, 2007, pp. 11-23. DOI: 10.1007/978-3-540-76336- 9-3136
- (2007) Proc. of CIAA , pp. 11-23
- Allauzen, C.¹ Riley, M.² Schalkwyk, J.³ Skut, W.⁴ Mohri, M.⁵

7
- 70450183653
- A generalized composition algorithm for weighted finite-state transducers
- 71, 91 95 99 104 105 107
- C. Allauzen, M. Riley, and J. Schalkwyk, "A generalized composition algorithm for weighted finite-state transducers," in Proc. Interspeech, 2009, pp. 1203-1206. 71, 91, 95, 99, 104, 105, 107
- (2009) Proc. Interspeech , pp. 1203-1206
- Allauzen, C.¹ Riley, M.² Schalkwyk, J.³

8
- 84855817752
- A filter-based algorithm for efficient composition of finite-state transducers
- DOI: 10.1142/S0129054111009033 105, 106
- C. Allauzen, M. Riley, and J. Schalkwyk, "A filter-based algorithm for efficient composition of finite-state transducers," International Journal of Foundations of Computer Science, 2011. DOI: 10.1142/S0129054111009033 105, 106
- (2011) International Journal of Foundations of Computer Science
- Allauzen, C.¹ Riley, M.² Schalkwyk, J.³

9
- 0026382117
- The forward-backward search algorithm
- DOI: 10.1109/ICASSP.1991.1504354
- S. Austin, R. Schwartz, and P. Placeway, "The forward-backward search algorithm," in Proc. ICASSP, vol. 1, 1991, pp. 697-700. DOI: 10.1109/ICASSP.1991.1504354
- (1991) Proc. ICASSP , vol.1 , pp. 697-700
- Austin, S.¹ Schwartz, R.² Placeway, P.³

10
- 0036460898
- An overview of decoding techniques for large vocabulary continuous speech recognition
- DOI: 10.1006/csla.2001.01854
- X.L.Aubert, "An overview of decoding techniques for large vocabulary continuous speech recognition," Computer Speech and Language, vol. 16, pp. 89-114, 2002. DOI: 10.1006/csla.2001.01854
- (2002) Computer Speech and Language , vol.16 , pp. 89-114
- Aubert, X.L.¹

11
- 84987256786
- An algorithm for connected word recognition
- 3
- J. S. Bridle, M. D. Brown, and R. M. Chamberlain, "An algorithm for connected word recognition," in Proc. ICASSP, 1982, pp. 899-902. 3
- (1982) Proc. ICASSP , pp. 899-902
- Bridle, J.S.¹ Brown, M.D.² Chamberlain, R.M.³

12
- 84865035841
- Minimization of automata
- abs/1010.5318
- J. Berstel, L. Boasson, O. Carton, and I. Fagnot, "Minimization of automata," CoRR, vol. abs/1010.5318, 2010. 61
- (2010) CoRR , pp. 61
- Berstel, J.¹ Boasson, L.² Carton, O.³ Fagnot, I.⁴

13
- 85135168435
- Improvements in tree-based language model representation
- F. Brugnara and M. Cettolo, "Improvements in tree-based language model representation," in Proc. EUROSPEECH, 1995, pp. 1797-1800. 33
- (1995) Proc. EUROSPEECH , vol.33 , pp. 1797-1800
- Brugnara, F.¹ Cettolo, M.²

14
- 0004009767
- New Jersey: Princeton Univ. Press
- R.Bellman and S.Dreyfus, AppliedDynamic Programming. New Jersey:Princeton Univ. Press, 1962. 2, 16
- (1962) AppliedDynamic Programming , vol.2 , pp. 16
- Bellman, R.¹ Dreyfus, S.²

15
- 85022919385
- Class-based n-gram models of natural language
- 130
- P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. Della Pietra, and J. C. Lai, "Class-based n-gram models of natural language," Computational Linguistics, vol. 18(4), pp. 467-479, 1992. 130
- (1992) Computational Linguistics , vol.18 , Issue.4 , pp. 467-479
- Brown, P.F.¹ Desouza, P.V.² Mercer, R.L.³ Della Pietra, V.J.⁴ Lai, J.C.⁵

16
- 0026400222
- Decision trees for phonological rules in continuous speech
- DOI: 10.1109/ICASSP.1991.150308 18 131
- L. R. Bahl, P. V. de Souza, and P. S. Gopalakrishman, "Decision trees for phonological rules in continuous speech," in Proc. ICASSP, 1991, pp. 185-188. DOI: 10.1109/ICASSP.1991.150308 18, 131
- (1991) Proc. ICASSP , pp. 185-188
- Bahl, L.R.¹ De Souza, P.V.² Gopalakrishman, P.S.³

17
- 70349521673
- Robust understanding in multimodal interfaces
- DOI: 10.1162/coli.08-022-R2-06-26135
- S. Bangalore and M. Johnston, "Robust understanding in multimodal interfaces," Computer Linguistics, vol. 35, no. 3, pp. 345-397, 2009. DOI: 10.1162/coli.08-022-R2-06-26135
- (2009) Computer Linguistics , vol.35 , Issue.3 , pp. 345-397
- Bangalore, S.¹ Johnston, M.²

18
- 0020719320
- Maximum likelihood approach to continuous speech recognition
- Mar DOI: 10.1109/TPAMI.1983.47673709
- L. R. Bahl, F. Jelinek, and R. L. Mercer, "Maximum likelihood approach to continuous speech recognition," IEEE Transactions on Patten Analysis and Machine Intelligence, vol. PAMI-5, no. 2, pp. 179-190, Mar. 1983. DOI: 10.1109/TPAMI.1983.47673709
- (1983) IEEE Transactions on Patten Analysis and Machine Intelligence , vol.PAMI-5 , Issue.2 , pp. 179-190
- Bahl, L.R.¹ Jelinek, F.² Mercer, R.L.³

19
- 0017216776
- Testing for the consecutive ones property, interval graphs, and graph planarity using pq-tree algorithms
- DOI: 10.1016/S0022-0000(76)80045-1109
- K. Booth and G. Lueker, "Testing for the consecutive ones property, interval graphs, and graph planarity using pq-tree algorithms," Journal of Computer and System Sciences, vol. 13, pp. 335-379, 1976. DOI: 10.1016/S0022-0000(76)80045-1109
- (1976) Journal of Computer and System Sciences , vol.13 , pp. 335-379
- Booth, K.¹ Lueker, G.²

20
- 0034854347
- Joint prosody prediction and unit selection for concatenative speech synthesis
- DOI: 10.1109/ICASSP.2001.941031135
- I. Bulyko and M. Ostendorf, "Joint prosody prediction and unit selection for concatenative speech synthesis," in Proc. ICASSP, vol. 2, 2001, pp. 781-784. DOI: 10.1109/ICASSP.2001.941031135
- (2001) Proc. ICASSP , vol.2 , pp. 781-784
- Bulyko, I.¹ Ostendorf, M.²

21
- 0036663562
- Efficient integrated response generation from multiple targets using weighted finite state transducers
- DOI: 10.1016/S0885-2308(02)00023-2135
- I. Bulyko and M.Ostendorf, "Efficient integrated response generation from multiple targets using weighted finite state transducers," Computer Speech and Language, vol. 16(3-4), pp. 533-550, 2002. DOI: 10.1016/S0885- 2308(02)00023-2135
- (2002) Computer Speech and Language , vol.16 , Issue.3-4 , pp. 533-550
- Bulyko, I.¹ Ostendorf, M.²

22
- 84962861457
- Finite-state transducers for speech-input translation
- DOI: 10.1109/ASRU.2001.1034664133
- F. Casacuberta, "Finite-state transducers for speech-input translation," in Proc. ASRU, 2001, pp. 375-380. DOI: 10.1109/ASRU.2001. 1034664133
- (2001) Proc. ASRU , pp. 375-380
- Casacuberta, F.¹

23
- 34547544207
- A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition
- DOI: 10.1109/ICASSP.2007.36692095, 99
- O. Cheng, J. Dines, and M. M. Doss, "A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition," in Proc. ICASSP, 2007, pp. 348-351. DOI: 10.1109/ICASSP.2007.36692095, 99
- (2007) Proc. ICASSP , pp. 348-351
- Cheng, O.¹ Dines, J.² Doss, M.M.³

24
- 84925661323
- Rational kernels: Theory and algorithms
- C. Cortes, P. Haffner, and M. Mohri, "Rational kernels: Theory and algorithms," The Journal of Machine Learning Research, vol. 5, pp. 1035-1062, 2004.
- (2004) The Journal of Machine Learning Research , vol.5 , pp. 1035-1062
- Cortes, C.¹ Haffner, P.² Mohri, M.³

25
- 84962787683
- Transducer composition for "on-The-fly" lexicon and language model integration
- DOI: 10.1109/ASRU.2001.103466795 99
- D. Caseiro and I.Trancoso, "Transducer composition for "on-The-fly" lexicon and language model integration," in Proc. ASRU, 2001, pp. 393-396. DOI: 10.1109/ASRU.2001.103466795, 99
- (2001) Proc. ASRU , pp. 393-396
- Caseiro, D.¹ Trancoso, I.²

26
- 0141480004
- A tail-sharing WFST composition for large vocabulary speech recognition
- DOI: 10.1109/ICASSP.2003.119879195
- D. Caseiro and I. Trancoso, "A tail-sharing WFST composition for large vocabulary speech recognition," in Proc. ICASSP, vol. I, 2003, pp. 356-359. DOI: 10.1109/ICASSP.2003.119879195
- (2003) Proc. ICASSP , vol.1 , pp. 356-359
- Caseiro, D.¹ Trancoso, I.²

27
- 34047273021
- A specialized on-The-fly algorithm for lexicon and language model composition
- DOI: 10.1109/TSA.2005.86083899
- D. Caseiro and I. Trancoso, "A specialized on-The-fly algorithm for lexicon and language model composition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1281-1291, 2006. DOI: 10.1109/TSA.2005.86083899
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.4 , pp. 1281-1291
- Caseiro, D.¹ Trancoso, I.²

28
- 84966356313
- Grapheme-to-phone using finite-state transducers
- DOI: 10.1109/WSS.2002.1224412135
- D. Caseiro, L. Trancoso, L. Oliveira, and C. Viana, "Grapheme-to- phone using finite-state transducers," in Proc. IEEE Workshop on Speech Synthesis, 2002, pp. 215-218. DOI: 10.1109/WSS.2002.1224412135
- (2002) Proc. IEEE Workshop on Speech Synthesis , pp. 215-218
- Caseiro, D.¹ Trancoso, L.² Oliveira, L.³ Viana, C.⁴

29
- 44849131087
- The TITECH large vocabulary WFST speech recognition system
- DOI: 10.1109/ASRU.2007.4430153136
- P.R. Dixon, D. Caseiro, T. Oonishi, and S. Furui, "The TITECH large vocabulary WFST speech recognition system," in Proc. ASRU, 2007, pp. 443-448. DOI: 10.1109/ASRU.2007.4430153136
- (2007) Proc. ASRU , pp. 443-448
- Dixon, P.R.¹ Caseiro, D.² Oonishi, T.³ Furui, S.⁴

30
- 84962878172
- Incremental language models for speech recognition using finite-state transducers
- DOI: 10.1109/ASRU.2001.1034620 95, 110
- H. J. G. A. Dolfing and I. L. Hetherington, "Incremental language models for speech recognition using finite-state transducers," in Proc. ASRU, 2001, pp. 194-197. DOI: 10.1109/ASRU.2001.1034620 95, 110
- (2001) Proc. ASRU , pp. 194-197
- Dolfing, H.J.G.A.¹ Hetherington, I.L.²

31
- 84867588266
- A comparison of dynamic WFST decoding approaches
- Kyoto, Japan DOI: 10.1109/ICASSP.2012.6288847 93, 126
- P. R. Dixon, C. Hori, and H. Kashioka, "A comparison of dynamic WFST decoding approaches," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4209-4212. DOI: 10.1109/ICASSP.2012.6288847 93, 126
- (2012) Proc. ICASSP , pp. 4209-4212
- Dixon, P.R.¹ Hori, C.² Kashioka, H.³

32
- 85149140805
- Parameter estimation for probabilistic finite-state transducers
- DOI: 10.3115/1073083.1073085133
- J. Eisner, "Parameter estimation for probabilistic finite-state transducers," in Proc. ACL, 2002, pp. 1-8. DOI: 10.3115/1073083.1073085133
- (2002) Proc. ACL , pp. 1-8
- Eisner, J.¹

33
- 0023776398
- The DARPA 1000-word resource management database for continuous speech recognition
- DOI: 10.1109/ICASSP.1988.1966691
- W. M. Fisher, J. Bernstein, and D. S. Pallett, "The DARPA 1000-word resource management database for continuous speech recognition," in Proc. ICASSP, vol. 1, 1988, pp. 651-654. DOI: 10.1109/ICASSP.1988.1966691
- (1988) Proc. ICASSP , vol.1 , pp. 651-654
- Fisher, W.M.¹ Bernstein, J.² Pallett, D.S.³

34
- 33745207361
- A Japanese national project on spontaneous speech corpus and processing technology
- 83
- S.Furui, K.Maekawa, and H. Isahara, "A Japanese national project on spontaneous speech corpus and processing technology," in Proc. of ASR, 2000, pp. 244-248. 83
- (2000) Proc. of ASR , pp. 244-248
- Furui, S.¹ Maekawa, K.² Isahara, H.³

35
- 84872849317
- web page
- "AT&T FSM Library," web page http://www.itl.nist.gov/iad/ mig/tests/rt/2009/index.html. 136
- AT&T FSM Library

36
- 0022667694
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- DOI: 10.1109/TASSP.1986.116478812
- S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 34, no. 1, pp. 52-59, 1986. DOI: 10.1109/TASSP.1986.116478812
- (1986) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.34 , Issue.1 , pp. 52-59
- Furui, S.¹

37
- 84867198032
- Silence models in weighted finite-state transducers
- 75
- P.Garner, "Silence models in weighted finite-state transducers," in Proc.Interspeech, Brisbane, Australia, 2008, pp. 1817-1820. 75
- (2008) Proc.Interspeech, Brisbane, Australia , pp. 1817-1820
- Garner, P.¹

38
- 0028996969
- A tree search strategy for large-vocabulary continuous speech recognition
- DOI: 10.1109/ICASSP.1995.4796624
- P. S. Gopalakrishnan, L. R. Bahl, and R. L. Mercer, "A tree search strategy for large-vocabulary continuous speech recognition," in Proc. ICASSP, vol. 572-575, 1995. DOI: 10.1109/ICASSP.1995.4796624
- (1995) Proc. ICASSP , pp. 572-575
- Gopalakrishnan, P.S.¹ Bahl, L.R.² Mercer, R.L.³

39
- 0000803388
- The population frequencies of species and the estimation of population parameters
- DOI: 10.2307/233334422
- I. J.Good, "The population frequencies of species and the estimation of population parameters," Biometrika, vol. 40, no. 3-4, pp. 237-264, 1953. DOI: 10.2307/233334422
- (1953) Biometrika , vol.40 , Issue.3-4 , pp. 237-264
- Good, I.J.¹

40
- 0004056285
- Prentice Hall 12
- X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall, 2001. 12
- (2001) Spoken Language Processing: A Guide to Theory Algorithm, and System Development
- Huang, X.¹ Acero, A.² Hon, H.-W.³

41
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- DOI: 10.1109/ICASSP.2000.86202412
- H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. ICASSP, vol. 3, 2000, pp. 1635-1638. DOI: 10.1109/ICASSP.2000.86202412
- (2000) Proc. ICASSP , vol.3 , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.P.W.² Sharma, S.³

42
- 85009152019
- The MIT finite-state transducer toolkit for speech and language processing
- 136
- I. L. Hetherington, "The MIT finite-state transducer toolkit for speech and language processing," in Proc. Interspeech-ICSLP, 2004. 136
- (2004) Proc. Interspeech-ICSLP
- Hetherington, I.L.¹

43
- 33745188707
- A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition
- I. L. Hetherington, "A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition," in Proc. Interspeech-Eurospeech, 2005, pp. 545-548. 131
- (2005) Proc. Interspeech-Eurospeech , vol.131 , pp. 545-548
- Hetherington, I.L.¹

44
- 17444406567
- The ATIS spoken language systems pilot corpus
- Hidden Valley, Pennsylvania June DOI: 10.3115/116580.1166131
- C.T.Hemphill, J.J.Godfrey, and G.R.Doddington, "The ATIS spoken language systems pilot corpus," in DARPA Speech and Natural Language Workshop, Hidden Valley, Pennsylvania, June 1990. DOI: 10.3115/116580.1166131
- (1990) DARPA Speech and Natural Language Workshop
- Hemphill, C.T.¹ Godfrey, J.J.² Doddington, G.R.³

45
- 85009204481
- Speech summarization using weighted finite-state transducers
- 134
- T. Hori, C. Hori, and Y. Minami, "Speech summarization using weighted finite-state transducers," in Proc. Eurospeech, 2003, pp. 2817-2820. 134
- (2003) Proc. Eurospeech , pp. 2817-2820
- Hori, T.¹ Hori, C.² Minami, Y.³

46
- 85009063824
- Fast on-The-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition
- 6 95 110
- T.Hori, C.Hori, and Y.Minami, "Fast on-The-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition," in Proc. Interspeech-ICSLP, vol. 1, 2004, pp. 289-292. 6, 95, 110
- (2004) Proc. Interspeech-ICSLP , vol.1 , pp. 289-292
- Hori, T.¹ Hori, C.² Minami, Y.³

47
- 45849093239
- Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
- DOI: 10.1109/TASL.2006.889790 6 95 110
- T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEETransactions on Audio, Speech, and Language Processing, vol.15, no.4, pp.1352-1365, 2007.DOI: 10.1109/TASL.2006.889790 6, 95, 110
- (2007) IEEETransactions on Audio, Speech, and Language Processing , vol.15 , Issue.4 , pp. 1352-1365
- Hori, T.¹ Hori, C.² Minami, Y.³ Nakamura, A.⁴

48
- 83755196741
- WFST enabled solutions to ASR problems: Beyond HMM decoding
- DOI: 10.1109/TASL.2011.2162402132
- B. Hoffmeister, G. Heigold, D. Rybach, R. Schluter, and H. Ney, "WFST enabled solutions to ASR problems: Beyond HMM decoding," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20(2), pp. 551-564, 2012. DOI: 10.1109/TASL.2011.2162402132
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.2 , pp. 551-564
- Hoffmeister, B.¹ Heigold, G.² Rybach, D.³ Schluter, R.⁴ Ney, H.⁵

49
- 0003620778
- Addison-Wesley Publishing Company 4 45 75
- J. E. Hopcroft, R. Motwani, and J. D. Ullman, Introduction to Automata Theory, Languages, and Computation, 3rd ed. Addison-Wesley Publishing Company, 2006. 4, 45, 75
- (2006) Introduction to Automata Theory, Languages, and Computation, 3rd Ed
- Hopcroft, J.E.¹ Motwani, R.² Ullman, J.D.³

50
- 33646426591
- Generalized fast on-The-fly composition algorithm for WFST-based speech recognition
- 94 95 110 130
- T. Hori and A. Nakamura, "Generalized fast on-The-fly composition algorithm for WFST-based speech recognition," in Proc. Interspeech- Eurospeech, 2005, pp. 557-560. 94, 95, 110, 130
- (2005) Proc. Interspeech-Eurospeech , pp. 557-560
- Hori, T.¹ Nakamura, A.²

51
- 84867199378
- Dialog management using weighted finite-state transducers
- DOI: 10.1109/ASRU.2009.5373350136
- C.Hori, K.Ohtaki, T.Misu, H.Kashioka, andS.Nakamura, "Dialog management using weighted finite-state transducers," in Proc. Interspeech, 2008, pp. 211-214. DOI: 10.1109/ASRU.2009.5373350136
- (2008) Proc. Interspeech , pp. 211-214
- Hori, C.¹ Ohtaki, K.² Misu, T.³ Nakamura, S.⁴

52
- 70349207745
- Statistical dialog management applied to WFST-based dialog systems
- DOI: 10.1109/ICASSP.2009.4960703136
- C. Hori, K. Ohtaki, T. Misu, H. Kashioka, and S. Nakamura, "Statistical dialog management applied to WFST-based dialog systems," in Proc. ICASSP, 2009, pp. 4793-4796. DOI: 10.1109/ICASSP.2009.4960703136
- (2009) Proc. ICASSP , pp. 4793-4796
- Hori, C.¹ Ohtaki, K.² Misu, T.³ Kashioka, H.⁴ Nakamura, S.⁵

53
- 33947677731
- Flexible multi-stream framework for speech recognition using multi-tape finite-state transducers
- DOI: 10.1109/ICASSP.2006.1660046132
- I. L. Hetherington, H. Shu, and J. R. Glass, "Flexible multi-stream framework for speech recognition using multi-tape finite-state transducers," in Proc. ICASSP, 2006, pp. 417-420. DOI: 10.1109/ICASSP.2006. 1660046132
- (2006) Proc. ICASSP , pp. 417-420
- Hetherington, I.L.¹ Shu, H.² Glass, J.R.³

54
- 70349226871
- A multimedia retrieval system using speech input
- DOI: 10.1145/1647314.1647356132
- G. Heigold, R. Schluter, and H. Ney, "A multimedia retrieval system using speech input," in Proc. ICASSP, 2009, pp. 3749-3752. DOI: 10.1145/1647314.1647356132
- (2009) Proc. ICASSP , pp. 3749-3752
- Heigold, G.¹ Schluter, R.² Ney, H.³

55
- 0141480041
- Language model adaptation using WFST-based speaking-style translation
- DOI: 10.1109/ICASSP.2003.1198759131
- T. Hori, D. Willett, and Y. Minami, "Language model adaptation using WFST-based speaking-style translation," in Proc. ICASSP, vol. I, 2003, pp. 228-231. DOI: 10.1109/ICASSP.2003.1198759131
- (2003) Proc. ICASSP , vol.1 , pp. 228-231
- Hori, T.¹ Willett, D.² Minami, Y.³

56
- 77954609645
- Paraphrasing spontaneous speech using weighted finite-state transducers
- 134
- T. Hori, D. Willett, and Y. Minami, "Paraphrasing spontaneous speech using weighted finite-state transducers," in Proc. SSPR, 2003. 134
- (2003) Proc. SSPR
- Hori, T.¹ Willett, D.² Minami, Y.³

57
- 70349208656
- Aflat direct model for speech recognition
- DOI: 10.1109/ICASSP.2009.4960470133
- G.Heigold, G.Zweig, andP.Nguyen, "Aflat direct model for speech recognition," in Proc. ICASSP, 2009, pp. 3861-3864. DOI: 10.1109/ICASSP.2009.4960470133
- (2009) Proc. ICASSP , pp. 3861-3864
- Heigold, G.¹ Nguyen, P.²

58
- 0040261516
- Language modeling with sentence-level mixtures
- DOI: 10.3115/1075812.1075828131
- R. Iyer, M. Ostendorf, and J. R. Rohlicek, "Language modeling with sentence-level mixtures," in Proc.Workshop on Human Language Technology, 1994, pp. 82-87. DOI: 10.3115/1075812.1075828131
- (1994) Proc.Workshop on Human Language Technology , pp. 82-87
- Iyer, R.¹ Ostendorf, M.² Rohlicek, J.R.³

59
- 0016507833
- Design of a linguistic statistical decoder for the recognition of continuous speech
- DOI: 10.1109/TIT.1975.10553849
- F. Jelinek, L. R. Bahl, and R. L. Mercer, "Design of a linguistic statistical decoder for the recognition of continuous speech," IEEE Transactions on Information Theory, vol. IT-21, no. 3, pp. 250-256, 1975. DOI: 10.1109/TIT.1975.10553849
- (1975) IEEE Transactions on Information Theory , vol.21 , Issue.3 , pp. 250-256
- Jelinek, F.¹ Bahl, L.R.² Mercer, R.L.³

60
- 0003786003
- The MIT Press 1, 9
- F. Jelinek, Ed., Statistical Methods for Speech Recognition. The MIT Press, 1998. 1, 9
- (1998) Statistical Methods for Speech Recognition
- Jelinek, F.¹

61
- 0012357341
- A dynamic language model for speech recognition
- DOI: 10.3115/112405.112464131
- F. Jelinek, B. Merialdo, R. S., and M. Strauss, "A dynamic language model for speech recognition," in Proc. DARPA Workshop on Speech and Natural Language, 1991, pp. 293-295. DOI: 10.3115/112405.112464131
- (1991) Proc. DARPA Workshop on Speech and Natural Language , pp. 293-295
- Jelinek, F.¹ Merialdo R S, B.² Strauss, M.³

62
- 77950550412
- Development of a WFST based speech recognition system for a resource deficient language using machine translation
- 131
- A. T. Jensson, T. Oonishi, K. Iwano, and S. Furui, "Development of a WFST based speech recognition system for a resource deficient language using machine translation," in Proc. APSIPA ASC, 2009, pp. 50-56. 131
- (2009) Proc. APSIPA ASC , pp. 50-56
- Jensson, A.T.¹ Oonishi, T.² Iwano, K.³ Furui, S.⁴

63
- 85009198110
- Speech recognition with dynamic grammars using finite-state transducers
- 131
- J. J. Schalkwyk, I. L. Hetherington, and E. Story, "Speech recognition with dynamic grammars using finite-state transducers," in Proc. Eurospeech, 2003, pp. 1969-1972. 131
- (2003) Proc. Eurospeech , pp. 1969-1972
- Schalkwyk, J.J.¹ Hetherington, I.L.² Story, E.³

64
- 0032289099
- Heteroscendastic discriminant analysis and reduced rank HMMs for improved speech recognition
- DOI: 10.1016/S0167-6393(98)00061-212
- N. Kumar and H. G. Andreou, "Heteroscendastic discriminant analysis and reduced rank HMMs for improved speech recognition," Speech Communication, vol. 26, pp. 283-297, 1998. DOI: 10.1016/S0167-6393(98)00061-212
- (1998) Speech Communication , vol.26 , pp. 283-297
- Kumar, N.¹ Andreou, H.G.²

65
- 0023312404
- Estimation of probabilities from sparse data for the language model component of a speech recognizer
- DOI: 10.1109/TASSP.1987.116512522
- S. M. Katz, "Estimation of probabilities from sparse data for the language model component of a speech recognizer," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 35, no. 3, pp. 400-401, 1987. DOI: 10.1109/TASSP.1987.116512522
- (1987) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.35 , Issue.3 , pp. 400-401
- Katz, S.M.¹

66
- 0026372947
- Admissible heuristics for rapid lexical access
- DOI: 10.1109/ICASSP.1991.1504334
- P. Kenny, R. Hollan, V. Gupta, M. Lennig, P.Mermelstein, and D. O'Shaughnessy, "A&z.ast;-admissible heuristics for rapid lexical access," in Proc. ICASSP, 1991, pp. 689-692. DOI: 10.1109/ICASSP.1991. 1504334
- (1991) Proc. ICASSP , pp. 689-692
- Kenny, P.¹ Hollan, R.² Gupta, V.³ Lennig, M.⁴ Mermelstein, P.⁵ O'Shaughnessy, D.⁶

67
- 0028996960
- Improved decision trees for phonetic modeling
- DOI: 10.1007/11965152-26131
- R. Kuhn, A. Lazarides, Y. Normandin, and J. Brousseau, "Improved decision trees for phonetic modeling," in Proc. ICASSP, vol. 1, 1995, pp. 552-555. DOI: 10.1007/11965152-26131
- (1995) Proc. ICASSP , vol.1 , pp. 552-555
- Kuhn, R.¹ Lazarides, A.² Normandin, Y.³ Brousseau, J.⁴

68
- 84941164160
- FSA: An efficient and flexible C++ toolkit for finite state automata using on-demand computation
- 136
- S. Kanthak and H. Ney, "FSA: An efficient and flexible C++ toolkit for finite state automata using on-demand computation," in Proc. ACL, 2004, pp. 510-517. 136
- (2004) Proc. ACL , pp. 510-517
- Kanthak, S.¹ Ney, H.²

69
- 84865227975
- Structural classification methods based on weighted finite-state transducers for automatic speech recognition
- in press. 133
- Y. Kubo, S. Watanabe, T. Hori, and A. Nakamura, "Structural classification methods based on weighted finite-state transducers for automatic speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, 2012, in press. 133
- IEEE Transactions on Audio, Speech, and Language Processing , vol.2012
- Kubo, Y.¹ Watanabe, S.² Hori, T.³ Nakamura, A.⁴

70
- 0003539541
- PhD thesis, Carnegie Mellon University, April 17
- K.-F. Lee, "Large-vocabulary speaker-independent continuous speech recognition: the SPHINX system," PhD thesis, Carnegie Mellon University, April 1988. 17
- (1988) Large-vocabulary Speaker-independent Continuous Speech Recognition: The SPHINX System
- Lee, K.-F.¹

71
- 78049379945
- Language model combination and adaptation using weighted finite state transducers
- 131
- X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland, "Language model combination and adaptation using weighted finite state transducers," in Proc. ICASSP, 2010, pp. 5390-5393. 131
- (2010) Proc. ICASSP , pp. 5390-5393
- Liu, X.¹ Gales, M.J.F.² Hieronymus, J.L.³ Woodland, P.C.⁴

72
- 0004244302
- Prentice Hall 12
- L. Labiner and B.-H. Juang, Fundamentals of Speech Recognition. Prentice Hall, 1993. 12
- (1993) Fundamentals of Speech Recognition
- Labiner, L.¹ Juang, B.-H.²

73
- 0003509020
- PhD theses, Dept. of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, USA 3, 27
- B. Lowerre, "The HARPY speech recognition system," PhD theses, Dept. of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, USA, 1976. 3, 27
- (1976) The HARPY Speech Recognition System
- Lowerre, B.¹

74
- 85135253868
- Efficient general lattice generation and rescor-ing
- 88 94 126
- A. Ljolje, F. Pereira, and M. Riley, "Efficient general lattice generation and rescor-ing," in Proc. Eurospeech, 1999, pp. 1251-1254. 88, 94, 126
- (1999) Proc. Eurospeech , pp. 1251-1254
- Ljolje, A.¹ Pereira, F.² Riley, M.³

75
- 78049355806
- Discriminatively estimated joint acoustic, duration, and language model for speech recognition
- DOI: 10.1109/ICASSP.2010.5495227133
- M. Lehr and I. Shafran, "Discriminatively estimated joint acoustic, duration, and language model for speech recognition," in Proc. ICASSP, 2010, pp. 5542-5545. DOI: 10.1109/ICASSP.2010.5495227133
- (2010) Proc. ICASSP , pp. 5542-5545
- Lehr, M.¹ Shafran, I.²

76
- 79952434203
- Deep belief networks for phone recognition
- 16
- A. Mohamed, G. Dahl, and G. Hinton, "Deep belief networks for phone recognition," in Proc. NIPS Workshop on Deep Learning for Speech Recognition, 2009. 16
- (2009) Proc. NIPS Workshop on Deep Learning for Speech Recognition
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

77
- 70450152774
- Juicer: A weighted finite-state transducer decoder
- 136
- D. Moore, J. Dines, M. Magimai Doss, J. Vepa, O. Cheng, and T. Hain, "Juicer: a weighted finite-state transducer decoder," Machine Learning for Multimodal In teraction, Lecture Notes in Computer Science, vol. 4299, pp. 285-296, 2006. 136
- (2006) Machine Learning for Multimodal in Teraction, Lecture Notes in Computer Science , vol.4299 , pp. 285-296
- Moore, D.¹ Dines, J.² Magimai Doss, M.³ Vepa, J.⁴ Cheng, O.⁵ Hain, T.⁶

78
- 34547522070
- Discriminative training for large vocabulary speech recognition using minimum classification error
- DOI: 10.1109/TASL.2006.876778132
- E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error," IEEE Transactions on Audio, Speech and Language Processing, vol. 15, pp. 203-223, 2007. DOI: 10.1109/TASL.2006. 876778132
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 203-223
- McDermott, E.¹ Hazen, T.J.² Le Roux, J.³ Nakamura, A.⁴ Katagiri, S.⁵

79
- 84872856906
- web page 136
- "The MIT FST Toolkit," web page http://people.csail.mit.edu/ ilh/fst. 136
- The MIT FST Toolkit

80
- 84892889057
- Generic epsilon-removal and input epsilon-normalization algorithms for weighted transducers
- DOI: 10.1142/S012905410200099665
- M. Mohri, "Generic epsilon-removal and input epsilon-normalization algorithms for weighted transducers," International Journal of Foundations of Computer Science, vol. 13(1), pp. 129-143, 2002. DOI: 10.1142/ S012905410200099665
- (2002) International Journal of Foundations of Computer Science , vol.13 , Issue.1 , pp. 129-143
- Mohri, M.¹

81
- 70350376504
- Weighted automata algorithms
- M. Droste, W. Kuich, and H. Vogler, Eds. Springer-Verlag New York Inc. DOI: 10.1007/978-3-642-01492-5 56 57 61 65
- M. Mohri, "Weighted automata algorithms," in Handbook of Weighted Automata, M. Droste, W. Kuich, and H. Vogler, Eds. Springer-Verlag New York Inc., 2009. DOI: 10.1007/978-3-642-01492-5 56, 57, 61, 65
- (2009) Handbook of Weighted Automata
- Mohri, M.¹

82
- 21444449828
- Weighted automata in text and speech processing
- Budapest, Hungary 4
- M. Mohri, F. Pereira, and M. Riley, "Weighted automata in text and speech processing," in Proc. ECAI-96, Workshop on Extended Finite State Models of Language, Budapest, Hungary, 1996. 4
- (1996) Proc. ECAI-96, Workshop on Extended Finite State Models of Language
- Mohri, M.¹ Pereira, F.² Riley, M.³

83
- 0012306376
- The design principles of a weighted finite-state transducer library
- DOI: 10.1016/S0304-3975(99)00014-6136
- M. Mohri, F. Pereira, and M. Riley, "The design principles of a weighted finite-state transducer library," Theoretical Computer Science, vol. 231(1), pp. 17-32, 2000. DOI: 10.1016/S0304-3975(99)00014-6136
- (2000) Theoretical Computer Science , vol.231 , Issue.1 , pp. 17-32
- Mohri, M.¹ Pereira, F.² Riley, M.³

84
- 0036460907
- Weighted finite-state transducers in speech recognition
- DOI: 10.1006/csla.2001.0184 4 41 71 80 83 94 95
- M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducers in speech recognition," Computer Speech and Language, vol. 16, pp. 69-88, 2002. DOI: 10.1006/csla.2001.0184 4, 41, 71, 80, 83, 94, 95
- (2002) Computer Speech and Language , vol.16 , pp. 69-88
- Mohri, M.¹ Pereira, F.² Riley, M.³

85
- 33646939678
- Weighted determinization and minimization for large vocabulary speech recognition
- 95
- M. Mohri and M. Riley, "Weighted determinization and minimization for large vocabulary speech recognition," in Proc. Eurospeech, vol. 1, 1997, pp. 131-134. 95
- (1997) Proc. Eurospeech , vol.1 , pp. 131-134
- Mohri, M.¹ Riley, M.²

86
- 85009070232
- A weight pushing algorithm for large vocabulary speech recognition
- 91
- M. Mohri and M.Riley, "A weight pushing algorithm for large vocabulary speech recognition," in Proc. Eurospeech, 2001, pp. 1603-1606. 91
- (2001) Proc. Eurospeech , pp. 1603-1606
- Mohri, M.¹ Riley, M.²

87
- 44849112578
- An algorithm for fast composition of weighted finite-state transducers
- DOI: 10.1109/ASRU.2007.4430156 95 99
- J. McDonough, E. Stoimenov, and D. Klakow, "An algorithm for fast composition of weighted finite-state transducers," in Proc. ASRU, 2007, pp. 461-466. DOI: 10.1109/ASRU.2007.4430156 95, 99
- (2007) Proc. ASRU , pp. 461-466
- McDonough, J.¹ Stoimenov, E.² Klakow, D.³

88
- 0021406359
- The use of a one-stage dynamic programming algorithm for connected word recognition
- Apr DOI: 10.1109/TASSP.1984.11643203
- H. Ney, "The use of a one-stage dynamic programming algorithm for connected word recognition," IEEETransactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, no. 2, pp. 263-271, Apr. 1984. DOI: 10.1109/TASSP.1984.11643203
- (1984) IEEETransactions on Acoustics, Speech, and Signal Processing , vol.ASSP-32 , Issue.2 , pp. 263-271
- Ney, H.¹

89
- 85017308347
- Improvementsin beam search for 10000-word continuous speech recognition
- DOI: 10.1109/89.27928727
- H.Ney, R.Haeb-Umbach, B.Tran, and M.Oerder, "Improvementsin beam search for 10000-word continuous speech recognition," in Proc. ICASSP, vol. I, 1992, pp. 9-12. DOI: 10.1109/89.27928727
- (1992) Proc. ICASSP , vol.1 , pp. 9-12
- Ney, H.¹ Haeb-Umbach, R.² Tran, B.³ Oerder, M.⁴

90
- 70349227632
- Generalization of specialized on-The-fly composition
- DOI: 10.1109/ICASSP.2009.4960584 95 99 102
- T. Oonishi, P. R. Dixon, K. Iwano, and S. Furui, "Generalization of specialized on-The-fly composition," in Proc. ICASSP, 2009, pp. 4317-4320. DOI: 10.1109/ICASSP.2009.4960584 95, 99, 102
- (2009) Proc. ICASSP , pp. 4317-4320
- Oonishi, T.¹ Dixon, P.R.² Iwano, K.³ Furui, S.⁴

91
- 84866849798
- Optimization of on-The-fly composition for WFST-based speech recognition decoders
- (in Japanese) 102 103
- T. Oonishi, P. R. Dixon, K. Iwano, and S. Furui, "Optimization of on-The-fly composition for WFST-based speech recognition decoders," IEICE Transactions on Information and Systems, vol. J92-D, no. 7, pp. 1026-1035, 2009, (in Japanese). 102, 103
- (2009) IEICE Transactions on Information and Systems , vol.J92-D , Issue.7 , pp. 1026-1035
- Oonishi, T.¹ Dixon, P.R.² Iwano, K.³ Furui, S.⁴

92
- 80051616419
- Round-robin duel discriminative language models in one-pass decoding with on-The-fly error correction
- DOI: 10.1109/ICASSP.2011.5947626131
- T. Oba, T. Hori, A. Ito, and A. Nakamura, "Round-robin duel discriminative language models in one-pass decoding with on-The-fly error correction," in Proc. ICASSP, 2011, pp. 5588-5591. DOI: 10.1109/ICASSP.2011.5947626131
- (2011) Proc. ICASSP , pp. 5588-5591
- Oba, T.¹ Hori, T.² Ito, A.³ Nakamura, A.⁴

93
- 0030719155
- A word graph algorithm for large vocabulary continuous speech recognition
- DOI: 10.1006/csla.1996.0022 4 33 38 94
- S.Ortmanns, H.Ney, and X.Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Computer Speech and Language, vol. 1, pp. 43-72, 1997. DOI: 10.1006/csla.1996.0022 4, 33, 38, 94
- (1997) Computer Speech and Language , vol.1 , pp. 43-72
- Ortmanns, S.¹ Ney, H.² Aubert, X.³

94
- 0030366694
- Language-model look-ahead for large vocabulary speech recognition
- DOI: 10.1109/ICSLP.1996.607215103
- S. Ortmanns, H. Ney, and A. Eiden, "Language-model look-ahead for large vocabulary speech recognition," in Proc. ICSLP, 1996, pp. 2095-2098. DOI: 10.1109/ICSLP.1996.607215103
- (1996) Proc. ICSLP , pp. 2095-2098
- Ortmanns, S.¹ Ney, H.² Eiden, A.³

95
- 84872842078
- web page 136
- "OpenFst Library," web page http://www.openfst.org/twiki/bin/ view/FST/WebHome. 136
- OpenFst Library

96
- 0024934084
- Benchmark tests for DARPA resource management database performance evaluations
- DOI: 10.1109/ICASSP.1989.2664821
- D. S. Pallett, "Benchmark tests for DARPA resource management database performance evaluations," in Proc. ICASSP, 1989, pp. 536-539. DOI: 10.1109/ICASSP.1989.2664821
- (1989) Proc. ICASSP , pp. 536-539
- Pallett, D.S.¹

97
- 0026368475
- Algorithm for an optimal A&z.ast; Search and linearizing the search in the stack decoder
- DOI: 10.1109/ICASSP.1991.1504344
- D. B. Paul, "Algorithm for an optimal A&z.ast; search and linearizing the search in the stack decoder," in Proc. ICASSP, 1991, pp. 693-696. DOI: 10.1109/ICASSP.1991.1504344
- (1991) Proc. ICASSP , pp. 693-696
- Paul, D.B.¹

98
- 78049502526
- The subspace Gaussian mixture model-A structured model for speech recognition
- April DOI: 10.1016/j.csl.2010.06.00316
- D. Povey, L. Burget, M. Agarwal, P. Akyazi, F. Kai, A. Ghoshal, O. Glembek, N. Goel, M. Karafiat, A. Rastrow, R. C. Rosei, P. Schwarz, and S. Thomas, "The subspace Gaussian mixture model-A structured model for speech recognition," Computer Speech & Language, vol. 25, no. 2, pp. 404-439, April 2011. DOI: 10.1016/j.csl.2010.06.00316
- (2011) Computer Speech & Language , vol.25 , Issue.2 , pp. 404-439
- Povey, D.¹ Burget, L.² Agarwal, M.³ Akyazi, P.⁴ Kai, F.⁵ Ghoshal, A.⁶ Glembek, O.⁷ Goel, N.⁸ Karafiat, M.⁹ Rastrow, A.¹⁰ Rosei, R.C.¹¹ Schwarz, P.¹² Thomas, S.¹³

99
- 6744225722
- DARPA ATIS test results June 1990
- R. Stern, Ed. Morgan Kaufmann Publishers, Inc., June 1
- D. S. Pallett, W. M. Fisher, J. G. Fiscus, and J. S. Garofolo, "DARPA ATIS test results June 1990," in Proc. Speech and Natural Language Workshop, R. Stern, Ed. Morgan Kaufmann Publishers, Inc., June 1990, pp. 114-121. 1
- (1990) Proc. Speech and Natural Language Workshop , pp. 114-121
- Pallett, D.S.¹ Fisher, W.M.² Fiscus, J.G.³ Garofolo, J.S.⁴

100
- 84858953642
- The Kaldi speech recognition toolkit
- 136
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The Kaldi speech recognition toolkit," in Proc. ASRU, 2011. 136
- (2011) Proc. ASRU
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlicek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsky, J.¹¹ Stemmer, G.¹² Vesely, K.¹³

101
- 0002837345
- Speech recognition by composition of weighted finite automata
- MIT Press 4
- F. Pereira and M. Riley, "Speech recognition by composition of weighted finite automata," in Finite-State Language Processing. MIT Press, 1996, pp. 431-453. 4
- (1996) Finite-State Language Processing , pp. 431-453
- Pereira, F.¹ Riley, M.²

102
- 0242312781
- Weighted rational transductions and their application to human language processing
- DOI: 10.3115/1075812.10758704
- F. Pereira, M. Riley, and R. Sproat, "Weighted rational transductions and their application to human language processing," in Proc. ARPA Workshop on Human Language technology, 1994, pp. 249-254. DOI: 10.3115/1075812.10758704
- (1994) Proc. ARPA Workshop on Human Language Technology , pp. 249-254
- Pereira, F.¹ Riley, M.² Sproat, R.³

103
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- DOI: 10.1109/ICASSP.2002.5743665 13 132
- D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP, vol. I, 2002, pp. 105-108. DOI: 10.1109/ICASSP.2002.5743665 13, 132
- (2002) Proc. ICASSP , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

104
- 0027113267
- Minimisation of acyclic deterministic automata in linear time
- DOI: 10.1016/0304-3975(92)90142-364
- D. Revuz, "Minimisation of acyclic deterministic automata in linear time," Theoretical Computer Science, vol. 92(1), pp. 181-189, 1992. DOI: 10.1016/0304-3975(92)90142-364
- (1992) Theoretical Computer Science , vol.92 , Issue.1 , pp. 181-189
- Revuz, D.¹

105
- 84878410921
- RASR-The RWTH Aachen university open source speech recognition toolkit
- 136
- D.Rybach, S.Hahn, P.Lehnen, D.Nolden, M.Sundermeyer, Z.Tuske, S.Wiesler, R. Schluter, and N. Ney, "RASR-The RWTH Aachen university open source speech recognition toolkit," in Proc. ASRU, 2011. 136
- (2011) Proc. ASRU
- Rybach, D.¹ Hahn, S.² Lehnen, P.³ Nolden, D.⁴ Sundermeyer, M.⁵ Tuske, Z.⁶ Wiesler, S.⁷ Schluter, R.⁸ Ney, N.⁹

106
- 0028194709
- Connectionist probability estimators in HMM speech recognition
- DOI: 10.1109/89.26035916
- S. Renals, N. Morgan, H. Boulard, M. Cohen, and H. Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994. DOI: 10.1109/89.26035916
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.1 , pp. 161-174
- Renals, S.¹ Morgan, N.² Boulard, H.³ Cohen, M.⁴ Franco, H.⁵

107
- 0002247642
- Transducer composition for context-dependent network expansion
- 71
- M. Riley, F. Pereira, and M. Mohri, "Transducer composition for context-dependent network expansion," in Proc. Eurospeech, 1997, pp. 1427-1430. 71
- (1997) Proc. Eurospeech , pp. 1427-1430
- Riley, M.¹ Pereira, F.² Mohri, M.³

108
- 0003845346
- A Bradford Book 44
- E. Roche and Y. Schabes, Eds., Finite-State Language Processing. A Bradford Book, 1997. 44
- (1997) Finite-State Language Processing
- Roche, E.¹ Schabes, Y.²

109
- 85149106909
- Discriminative language modeling with conditional random fields and the perceptron algorithm
- DOI: 10.3115/1218955.1218962130
- B. Roark, M. Saraclar, M. Collins, and M. Johnson, "Discriminative language modeling with conditional random fields and the perceptron algorithm," in Proc. ACL, 2004. DOI: 10.3115/1218955.1218962130
- (2004) Proc. ACL
- Roark, B.¹ Saraclar, M.² Collins, M.³ Johnson, M.⁴

110
- 84867593936
- Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders
- Kyoto, Japan DOI: 10.1109/ICASSP.2012.628884675
- D. Rybach, R. Schluter, and H. Ney, "Silence is golden: modeling non-speech events in WFST-based dynamic network decoders," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4205-4208. DOI: 10.1109/ICASSP.2012.628884675
- (2012) Proc. ICASSP , pp. 4205-4208
- Rybach, D.¹ Schluter, R.² Ney, H.³

111
- 84872853828
- web page 136
- "The RWTH FSA Toolkit," web page http://www-i6.informatik.rwth- aachen.de/kanthak/fsa.html. 136
- The RWTH FSA Toolkit

112
- 0026390882
- A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses
- DOI: 10.1109/ICASSP.1991.15043639
- R. Schwartz and Y. Austin, "A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses," in Proc. ICASSP, 1990, pp. 701-704. DOI: 10.1109/ICASSP.1991.15043639
- (1990) Proc. ICASSP , pp. 701-704
- Schwartz, R.¹ Austin, Y.²

113
- 80053442098
- A similarity evaluation of speech patterns by dynamic programming (in japanese)
- 2
- H. Sakoe and S. Chiba, "A similarity evaluation of speech patterns by dynamic programming (in japanese)," in the Dig. 1970 Nat. Meeting, Inst. Electrn. Comm. Eng. Japan, July 1970, p. 136. 2
- (1970) The Dig. 1970 Nat. Meeting, Inst. Electrn. Comm. Eng. Japan, July , pp. 136
- Sakoe, H.¹ Chiba, S.²

114
- 0005670423
- A dynamic programming approach to continuous speech recognition
- Budapest, Hungary, Paper 20 C 13, August 2
- H. Sakoe and S. Chiba, "A dynamic programming approach to continuous speech recognition," in Proc. ICA, Budapest, Hungary, Paper 20 C 13, August 1971, pp. 65-68. 2
- (1971) Proc. ICA , pp. 65-68
- Sakoe, H.¹ Chiba, S.²

115
- 0025627406
- The N-best algorithm:an efficient and exact procedure for finding the N most likely sentence hypotheses
- DOI: 10.1109/ICASSP.1990.1155424
- R.Schwartz andY.Chow, "The N-best algorithm:an efficient and exact procedure for finding the N most likely sentence hypotheses," in Proc. ICASSP, 1990, pp. 81-84. DOI: 10.1109/ICASSP.1990.1155424
- (1990) Proc. ICASSP , pp. 81-84
- Chow, Y.¹

116
- 0033896970
- Memory-efficient LVCSR search using a one-pass stack decoder
- January DOI: 10.1006/csla.1999.01354
- M. Schuster, "Memory-efficient LVCSR search using a one-pass stack decoder," Computer Speech & Language, vol. 14(1), pp. 47-77, January 2000. DOI: 10.1006/csla.1999.01354
- (2000) Computer Speech & Language , vol.14 , Issue.1 , pp. 47-77
- Schuster, M.¹

117
- 0026370988
- A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition
- DOI: 10.1109/ICASSP.1991.1504374
- F. K. Soong and E.-F. Huang, "A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition," in Proc. ICASSP, vol. 1, 1991, pp. 705-708. DOI: 10.1109/ICASSP.1991.1504374
- (1991) Proc. ICASSP , vol.1 , pp. 705-708
- Soong, F.K.¹ Huang, E.-F.²

118
- 85009292190
- EM training of finite-state transducers and its application to pronunciation modeling
- 132
- H. Shu and I. L. Hetherington, "EM training of finite-state transducers and its application to pronunciation modeling," in Proc. ICSLP, 2002, pp. 1293-1296. 132
- (2002) Proc. ICSLP , pp. 1293-1296
- Shu, H.¹ Hetherington, I.L.²

119
- 33645768509
- Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition
- DOI: 10.1109/ICASSP.2005.1415085132
- M. Schuster and T. Hori, "Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition," in Proc. ICASSP, 2005, pp. 201-204. DOI: 10.1109/ICASSP.2005. 1415085132
- (2005) Proc. ICASSP , pp. 201-204
- Schuster, M.¹ Hori, T.²

120
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- 16
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, 2011, pp. 437-440. 16
- (2011) Proc. Interspeech , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

121
- 33947638544
- Modeling polyphone context with weighted finite-state transducers
- DOI: 10.1109/ICASSP.2006.1659972132
- E. Stoimenov and J. McDonough, "Modeling polyphone context with weighted finite-state transducers," in Proc. ICASSP, vol. I, 2006, pp. 121-124. DOI: 10.1109/ICASSP.2006.1659972132
- (2006) Proc. ICASSP , vol.1 , pp. 121-124
- Stoimenov, E.¹ McDonough, J.²

122
- 0030362785
- Multilingual text analysis for text-to-speech synthesis
- DOI: 10.1017/S1351324997001654135
- R. Sproat, "Multilingual text analysis for text-to-speech synthesis," in Proc. ICSLP, vol. 3, 1996, pp. 1365-1368. DOI: 10.1017/S1351324997001654135
- (1996) Proc. ICSLP , vol.3 , pp. 1365-1368
- Sproat, R.¹

123
- 0012611072
- Entropy-based pruning of backoff language models
- 78
- A.Stolcke, "Entropy-based pruning of backoff language models," in Proc. DARPA Broadcast News Transcription and Understanding Workshop, 1998, pp. 270-274. 78
- (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 270-274
- Stolcke, A.¹

124
- 0033906251
- MDL-based context-dependent subword modeling for speech recognition
- DOI: 10.1250/ast.21.7919
- K.Shinoda andT.Watanabe, "MDL-based context-dependent subword modeling for speech recognition," Acoustic Science and Technology, vol. 21, no. 2, pp. 79-86, 2000. DOI: 10.1250/ast.21.7919
- (2000) Acoustic Science and Technology , vol.21 , Issue.2 , pp. 79-86
- Shinoda, K.¹ Watanabe, T.²

125
- 0029765807
- Spontaneous dialogue speech recognition using cross-word context constrained word graphs
- DOI: 10.1109/ICASSP.1996.540311126
- T. Shimizu, H. Yamamoto, H. Masataki, S. Matsunaga, and Y. Sagisaka, "Spontaneous dialogue speech recognition using cross-word context constrained word graphs," in Proc. ICASSP, 1996, pp. 145-148. DOI: 10.1109/ICASSP.1996.540311126
- (1996) Proc. ICASSP , pp. 145-148
- Shimizu, T.¹ Yamamoto, H.² Masataki, H.³ Matsunaga, S.⁴ Sagisaka, Y.⁵

126
- 78049374440
- A discriminative model for continuous speech recognition based on weighted finite state transducers
- DOI: 10.1109/ICASSP.2010.5495096133
- S.Watanabe, T. Hori, E. McDermott, and A. Nakamura, "A discriminative model for continuous speech recognition based on weighted finite state transducers," in Proc. ICASSP, 2010, pp. 4922-4925. DOI: 10.1109/ICASSP.2010.5495096133
- (2010) Proc. ICASSP , pp. 4922-4925
- Watanabe, S.¹ Hori, T.² McDermott, E.³ Nakamura, A.⁴

127
- 85009110509
- Time and memory efficient Viterbi decoding for LVCSR using a precompiled search network
- 95, 110 111
- D. Willett, E. McDermott, Y. Minami, and S. Katagiri, "Time and memory efficient Viterbi decoding for LVCSR using a precompiled search network," in Proc. Eurospeech, 2001, pp. 847-850. 95, 110, 111
- (2001) Proc. Eurospeech , pp. 847-850
- Willett, D.¹ McDermott, E.² Minami, Y.³ Katagiri, S.⁴

128
- 3042741069
- Variational Bayesian estimation and clustering for speech recognition
- DOI: 10.1109/TSA.2004.82864019
- S. Watanabe, Y. Minami, A. Nakamura, and N. Ueda, "Variational Bayesian estimation and clustering for speech recognition," IEEE Transactions on Speech and Audio Processing, vol.12, pp.365-381, 2004.DOI: 10.1109/TSA.2004.82864019
- (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , pp. 365-381
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

129
- 0002144369
- Tree-based state tying for high accuracy acoustics modeling
- DOI: 10.3115/1075812.107588518
- S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustics modeling," in Proc. ARPA Human Language Technology Workshop, 1994, pp. 307-312. DOI: 10.3115/1075812.107588518
- (1994) Proc. ARPA Human Language Technology Workshop , pp. 307-312
- Young, S.J.¹ Odell, J.J.² Woodland, P.C.³

130
- 84867598134
- A general discriminative training algorithm for speech recognition using weighted finite-state transducers
- Kyoto, Japan DOI: 10.1109/ICASSP.2012.6288849132
- Y. Zhao, A. Ljolje, D. Caseiro, and B.-H. Juang, "A general discriminative training algorithm for speech recognition using weighted finite-state transducers," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4217-4220. DOI: 10.1109/ICASSP.2012.6288849132
- (2012) Proc. ICASSP , pp. 4217-4220
- Zhao, Y.¹ Ljolje, A.² Caseiro, D.³ Juang, B.-H.⁴

131
- 77949370075
- A segmental CRF approach to large vocabulary continuous speech recognition
- DOI: 10.1109/ASRU.2009.5372916133
- G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition," in Proc. ASRU, 2009, pp. 152-157. DOI: 10.1109/ASRU.2009.5372916133
- (2009) Proc. ASRU , pp. 152-157
- Zweig, G.¹ Nguyen, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.