메뉴 건너뛰기




Volumn 10, Issue , 2013, Pages 1-162

Speech recognition algorithms using weighted finite-state transducers

Author keywords

automaton; decoder; optimization; speech recognition; Viterbi algorithm; weighted finite state transducer

Indexed keywords

AUTOMATON; BEST MATCH; BLACK BOXES; COMPUTATIONAL COSTS; CONVENTIONAL METHODS; DECODER; DECODING ALGORITHM; DECODING PROCESS; DECODING SPEED; IMPLEMENTATION TECHNIQUES; RECOGNITION ALGORITHM; RECOGNITION ERROR; SEARCH PROBLEM; SPEECH RECOGNITION TECHNOLOGY; SPEECH RECOGNIZER; SPEECH SIGNALS; SPOKEN LANGUAGE PROCESSING; VOCABULARY SIZE; WEIGHTED FINITE-STATE TRANSDUCERS;

EID: 84872838032     PISSN: 1932121X     EISSN: 19321678     Source Type: Book Series    
DOI: 10.2200/S00462ED1V01Y201212SAP010     Document Type: Article
Times cited : (16)

References (131)
  • 1
    • 0028996974 scopus 로고
    • Language model representations for beam-search decoding
    • DOI: 10.1109/ICASSP.1995.47966633
    • G. Antoniol, F. Brugnara, M. Cettolo, and M. Frederico, "Language model representations for beam-search decoding," in Proc. ICASSP, 1995, pp. 588-591. DOI: 10.1109/ICASSP.1995.47966633
    • (1995) Proc. ICASSP , pp. 588-591
    • Antoniol, G.1    Brugnara, F.2    Cettolo, M.3    Frederico, M.4
  • 3
    • 33745219793 scopus 로고    scopus 로고
    • General indexation of weighted automata-application to spoken utterance retrieval
    • C. Allauzen and M. Mohri, "General indexation of weighted automata-application to spoken utterance retrieval," in Proc. HLT-NAACL, 2004. 135
    • (2004) Proc. HLT-NAACL , vol.135
    • Allauzen, C.1    Mohri, M.2
  • 4
    • 85149121374 scopus 로고    scopus 로고
    • Generalized algorithms for constructing statistical language models
    • DOI: 10.3115/1075096.107510277,130
    • C. Allauzen, M. Mohri, and B. Roark, "Generalized algorithms for constructing statistical language models," in Proc. ACL, 2003, pp. 40-47. DOI: 10.3115/1075096.1075102 77, 130
    • (2003) Proc. ACL , pp. 40-47
    • Allauzen, C.1    Mohri, M.2    Roark, B.3
  • 5
    • 4544339437 scopus 로고    scopus 로고
    • A generalized construction of integrated speech recognition transducers
    • DOI: 10.1109/CASSP.2004.132609791
    • C. Allauzen, M. Mohri, M. Riley, and B. Roark, "A generalized construction of integrated speech recognition transducers," in Proc. ICASSP, vol. I, 2004, pp. 761-764. DOI: 10.1109/CASSP.2004.132609791
    • (2004) Proc. ICASSP , vol.1 , pp. 761-764
    • Allauzen, C.1    Mohri, M.2    Riley, M.3    Roark, B.4
  • 6
    • 38149133882 scopus 로고    scopus 로고
    • OpenFst: A general and efficient weighted finite-state transducer library
    • DOI: 10.1007/978-3-540-76336-9-3136
    • C. Allauzen, M. Riley, J. Schalkwyk, W. Skut, and M. Mohri, "OpenFst: A general and efficient weighted finite-state transducer library," in Proc. of CIAA, 2007, pp. 11-23. DOI: 10.1007/978-3-540-76336- 9-3136
    • (2007) Proc. of CIAA , pp. 11-23
    • Allauzen, C.1    Riley, M.2    Schalkwyk, J.3    Skut, W.4    Mohri, M.5
  • 7
    • 70450183653 scopus 로고    scopus 로고
    • A generalized composition algorithm for weighted finite-state transducers
    • 71, 91 95 99 104 105 107
    • C. Allauzen, M. Riley, and J. Schalkwyk, "A generalized composition algorithm for weighted finite-state transducers," in Proc. Interspeech, 2009, pp. 1203-1206. 71, 91, 95, 99, 104, 105, 107
    • (2009) Proc. Interspeech , pp. 1203-1206
    • Allauzen, C.1    Riley, M.2    Schalkwyk, J.3
  • 8
    • 84855817752 scopus 로고    scopus 로고
    • A filter-based algorithm for efficient composition of finite-state transducers
    • DOI: 10.1142/S0129054111009033 105, 106
    • C. Allauzen, M. Riley, and J. Schalkwyk, "A filter-based algorithm for efficient composition of finite-state transducers," International Journal of Foundations of Computer Science, 2011. DOI: 10.1142/S0129054111009033 105, 106
    • (2011) International Journal of Foundations of Computer Science
    • Allauzen, C.1    Riley, M.2    Schalkwyk, J.3
  • 9
    • 0026382117 scopus 로고
    • The forward-backward search algorithm
    • DOI: 10.1109/ICASSP.1991.1504354
    • S. Austin, R. Schwartz, and P. Placeway, "The forward-backward search algorithm," in Proc. ICASSP, vol. 1, 1991, pp. 697-700. DOI: 10.1109/ICASSP.1991.1504354
    • (1991) Proc. ICASSP , vol.1 , pp. 697-700
    • Austin, S.1    Schwartz, R.2    Placeway, P.3
  • 10
    • 0036460898 scopus 로고    scopus 로고
    • An overview of decoding techniques for large vocabulary continuous speech recognition
    • DOI: 10.1006/csla.2001.01854
    • X.L.Aubert, "An overview of decoding techniques for large vocabulary continuous speech recognition," Computer Speech and Language, vol. 16, pp. 89-114, 2002. DOI: 10.1006/csla.2001.01854
    • (2002) Computer Speech and Language , vol.16 , pp. 89-114
    • Aubert, X.L.1
  • 11
    • 84987256786 scopus 로고
    • An algorithm for connected word recognition
    • 3
    • J. S. Bridle, M. D. Brown, and R. M. Chamberlain, "An algorithm for connected word recognition," in Proc. ICASSP, 1982, pp. 899-902. 3
    • (1982) Proc. ICASSP , pp. 899-902
    • Bridle, J.S.1    Brown, M.D.2    Chamberlain, R.M.3
  • 12
  • 13
    • 85135168435 scopus 로고
    • Improvements in tree-based language model representation
    • F. Brugnara and M. Cettolo, "Improvements in tree-based language model representation," in Proc. EUROSPEECH, 1995, pp. 1797-1800. 33
    • (1995) Proc. EUROSPEECH , vol.33 , pp. 1797-1800
    • Brugnara, F.1    Cettolo, M.2
  • 16
    • 0026400222 scopus 로고
    • Decision trees for phonological rules in continuous speech
    • DOI: 10.1109/ICASSP.1991.150308 18 131
    • L. R. Bahl, P. V. de Souza, and P. S. Gopalakrishman, "Decision trees for phonological rules in continuous speech," in Proc. ICASSP, 1991, pp. 185-188. DOI: 10.1109/ICASSP.1991.150308 18, 131
    • (1991) Proc. ICASSP , pp. 185-188
    • Bahl, L.R.1    De Souza, P.V.2    Gopalakrishman, P.S.3
  • 17
    • 70349521673 scopus 로고    scopus 로고
    • Robust understanding in multimodal interfaces
    • DOI: 10.1162/coli.08-022-R2-06-26135
    • S. Bangalore and M. Johnston, "Robust understanding in multimodal interfaces," Computer Linguistics, vol. 35, no. 3, pp. 345-397, 2009. DOI: 10.1162/coli.08-022-R2-06-26135
    • (2009) Computer Linguistics , vol.35 , Issue.3 , pp. 345-397
    • Bangalore, S.1    Johnston, M.2
  • 18
    • 0020719320 scopus 로고
    • Maximum likelihood approach to continuous speech recognition
    • Mar DOI: 10.1109/TPAMI.1983.47673709
    • L. R. Bahl, F. Jelinek, and R. L. Mercer, "Maximum likelihood approach to continuous speech recognition," IEEE Transactions on Patten Analysis and Machine Intelligence, vol. PAMI-5, no. 2, pp. 179-190, Mar. 1983. DOI: 10.1109/TPAMI.1983.47673709
    • (1983) IEEE Transactions on Patten Analysis and Machine Intelligence , vol.PAMI-5 , Issue.2 , pp. 179-190
    • Bahl, L.R.1    Jelinek, F.2    Mercer, R.L.3
  • 19
    • 0017216776 scopus 로고
    • Testing for the consecutive ones property, interval graphs, and graph planarity using pq-tree algorithms
    • DOI: 10.1016/S0022-0000(76)80045-1109
    • K. Booth and G. Lueker, "Testing for the consecutive ones property, interval graphs, and graph planarity using pq-tree algorithms," Journal of Computer and System Sciences, vol. 13, pp. 335-379, 1976. DOI: 10.1016/S0022-0000(76)80045-1109
    • (1976) Journal of Computer and System Sciences , vol.13 , pp. 335-379
    • Booth, K.1    Lueker, G.2
  • 20
    • 0034854347 scopus 로고    scopus 로고
    • Joint prosody prediction and unit selection for concatenative speech synthesis
    • DOI: 10.1109/ICASSP.2001.941031135
    • I. Bulyko and M. Ostendorf, "Joint prosody prediction and unit selection for concatenative speech synthesis," in Proc. ICASSP, vol. 2, 2001, pp. 781-784. DOI: 10.1109/ICASSP.2001.941031135
    • (2001) Proc. ICASSP , vol.2 , pp. 781-784
    • Bulyko, I.1    Ostendorf, M.2
  • 21
    • 0036663562 scopus 로고    scopus 로고
    • Efficient integrated response generation from multiple targets using weighted finite state transducers
    • DOI: 10.1016/S0885-2308(02)00023-2135
    • I. Bulyko and M.Ostendorf, "Efficient integrated response generation from multiple targets using weighted finite state transducers," Computer Speech and Language, vol. 16(3-4), pp. 533-550, 2002. DOI: 10.1016/S0885- 2308(02)00023-2135
    • (2002) Computer Speech and Language , vol.16 , Issue.3-4 , pp. 533-550
    • Bulyko, I.1    Ostendorf, M.2
  • 22
    • 84962861457 scopus 로고    scopus 로고
    • Finite-state transducers for speech-input translation
    • DOI: 10.1109/ASRU.2001.1034664133
    • F. Casacuberta, "Finite-state transducers for speech-input translation," in Proc. ASRU, 2001, pp. 375-380. DOI: 10.1109/ASRU.2001. 1034664133
    • (2001) Proc. ASRU , pp. 375-380
    • Casacuberta, F.1
  • 23
    • 34547544207 scopus 로고    scopus 로고
    • A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition
    • DOI: 10.1109/ICASSP.2007.36692095, 99
    • O. Cheng, J. Dines, and M. M. Doss, "A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition," in Proc. ICASSP, 2007, pp. 348-351. DOI: 10.1109/ICASSP.2007.36692095, 99
    • (2007) Proc. ICASSP , pp. 348-351
    • Cheng, O.1    Dines, J.2    Doss, M.M.3
  • 25
    • 84962787683 scopus 로고    scopus 로고
    • Transducer composition for "on-The-fly" lexicon and language model integration
    • DOI: 10.1109/ASRU.2001.103466795 99
    • D. Caseiro and I.Trancoso, "Transducer composition for "on-The-fly" lexicon and language model integration," in Proc. ASRU, 2001, pp. 393-396. DOI: 10.1109/ASRU.2001.103466795, 99
    • (2001) Proc. ASRU , pp. 393-396
    • Caseiro, D.1    Trancoso, I.2
  • 26
    • 0141480004 scopus 로고    scopus 로고
    • A tail-sharing WFST composition for large vocabulary speech recognition
    • DOI: 10.1109/ICASSP.2003.119879195
    • D. Caseiro and I. Trancoso, "A tail-sharing WFST composition for large vocabulary speech recognition," in Proc. ICASSP, vol. I, 2003, pp. 356-359. DOI: 10.1109/ICASSP.2003.119879195
    • (2003) Proc. ICASSP , vol.1 , pp. 356-359
    • Caseiro, D.1    Trancoso, I.2
  • 27
    • 34047273021 scopus 로고    scopus 로고
    • A specialized on-The-fly algorithm for lexicon and language model composition
    • DOI: 10.1109/TSA.2005.86083899
    • D. Caseiro and I. Trancoso, "A specialized on-The-fly algorithm for lexicon and language model composition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1281-1291, 2006. DOI: 10.1109/TSA.2005.86083899
    • (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.4 , pp. 1281-1291
    • Caseiro, D.1    Trancoso, I.2
  • 29
    • 44849131087 scopus 로고    scopus 로고
    • The TITECH large vocabulary WFST speech recognition system
    • DOI: 10.1109/ASRU.2007.4430153136
    • P.R. Dixon, D. Caseiro, T. Oonishi, and S. Furui, "The TITECH large vocabulary WFST speech recognition system," in Proc. ASRU, 2007, pp. 443-448. DOI: 10.1109/ASRU.2007.4430153136
    • (2007) Proc. ASRU , pp. 443-448
    • Dixon, P.R.1    Caseiro, D.2    Oonishi, T.3    Furui, S.4
  • 30
    • 84962878172 scopus 로고    scopus 로고
    • Incremental language models for speech recognition using finite-state transducers
    • DOI: 10.1109/ASRU.2001.1034620 95, 110
    • H. J. G. A. Dolfing and I. L. Hetherington, "Incremental language models for speech recognition using finite-state transducers," in Proc. ASRU, 2001, pp. 194-197. DOI: 10.1109/ASRU.2001.1034620 95, 110
    • (2001) Proc. ASRU , pp. 194-197
    • Dolfing, H.J.G.A.1    Hetherington, I.L.2
  • 31
    • 84867588266 scopus 로고    scopus 로고
    • A comparison of dynamic WFST decoding approaches
    • Kyoto, Japan DOI: 10.1109/ICASSP.2012.6288847 93, 126
    • P. R. Dixon, C. Hori, and H. Kashioka, "A comparison of dynamic WFST decoding approaches," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4209-4212. DOI: 10.1109/ICASSP.2012.6288847 93, 126
    • (2012) Proc. ICASSP , pp. 4209-4212
    • Dixon, P.R.1    Hori, C.2    Kashioka, H.3
  • 32
    • 85149140805 scopus 로고    scopus 로고
    • Parameter estimation for probabilistic finite-state transducers
    • DOI: 10.3115/1073083.1073085133
    • J. Eisner, "Parameter estimation for probabilistic finite-state transducers," in Proc. ACL, 2002, pp. 1-8. DOI: 10.3115/1073083.1073085133
    • (2002) Proc. ACL , pp. 1-8
    • Eisner, J.1
  • 33
    • 0023776398 scopus 로고
    • The DARPA 1000-word resource management database for continuous speech recognition
    • DOI: 10.1109/ICASSP.1988.1966691
    • W. M. Fisher, J. Bernstein, and D. S. Pallett, "The DARPA 1000-word resource management database for continuous speech recognition," in Proc. ICASSP, vol. 1, 1988, pp. 651-654. DOI: 10.1109/ICASSP.1988.1966691
    • (1988) Proc. ICASSP , vol.1 , pp. 651-654
    • Fisher, W.M.1    Bernstein, J.2    Pallett, D.S.3
  • 34
    • 33745207361 scopus 로고    scopus 로고
    • A Japanese national project on spontaneous speech corpus and processing technology
    • 83
    • S.Furui, K.Maekawa, and H. Isahara, "A Japanese national project on spontaneous speech corpus and processing technology," in Proc. of ASR, 2000, pp. 244-248. 83
    • (2000) Proc. of ASR , pp. 244-248
    • Furui, S.1    Maekawa, K.2    Isahara, H.3
  • 35
    • 84872849317 scopus 로고    scopus 로고
    • web page
    • "AT&T FSM Library," web page http://www.itl.nist.gov/iad/ mig/tests/rt/2009/index.html. 136
    • AT&T FSM Library
  • 36
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • DOI: 10.1109/TASSP.1986.116478812
    • S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 34, no. 1, pp. 52-59, 1986. DOI: 10.1109/TASSP.1986.116478812
    • (1986) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 37
    • 84867198032 scopus 로고    scopus 로고
    • Silence models in weighted finite-state transducers
    • 75
    • P.Garner, "Silence models in weighted finite-state transducers," in Proc.Interspeech, Brisbane, Australia, 2008, pp. 1817-1820. 75
    • (2008) Proc.Interspeech, Brisbane, Australia , pp. 1817-1820
    • Garner, P.1
  • 38
    • 0028996969 scopus 로고
    • A tree search strategy for large-vocabulary continuous speech recognition
    • DOI: 10.1109/ICASSP.1995.4796624
    • P. S. Gopalakrishnan, L. R. Bahl, and R. L. Mercer, "A tree search strategy for large-vocabulary continuous speech recognition," in Proc. ICASSP, vol. 572-575, 1995. DOI: 10.1109/ICASSP.1995.4796624
    • (1995) Proc. ICASSP , pp. 572-575
    • Gopalakrishnan, P.S.1    Bahl, L.R.2    Mercer, R.L.3
  • 39
    • 0000803388 scopus 로고
    • The population frequencies of species and the estimation of population parameters
    • DOI: 10.2307/233334422
    • I. J.Good, "The population frequencies of species and the estimation of population parameters," Biometrika, vol. 40, no. 3-4, pp. 237-264, 1953. DOI: 10.2307/233334422
    • (1953) Biometrika , vol.40 , Issue.3-4 , pp. 237-264
    • Good, I.J.1
  • 41
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • DOI: 10.1109/ICASSP.2000.86202412
    • H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. ICASSP, vol. 3, 2000, pp. 1635-1638. DOI: 10.1109/ICASSP.2000.86202412
    • (2000) Proc. ICASSP , vol.3 , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 42
    • 85009152019 scopus 로고    scopus 로고
    • The MIT finite-state transducer toolkit for speech and language processing
    • 136
    • I. L. Hetherington, "The MIT finite-state transducer toolkit for speech and language processing," in Proc. Interspeech-ICSLP, 2004. 136
    • (2004) Proc. Interspeech-ICSLP
    • Hetherington, I.L.1
  • 43
    • 33745188707 scopus 로고    scopus 로고
    • A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition
    • I. L. Hetherington, "A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition," in Proc. Interspeech-Eurospeech, 2005, pp. 545-548. 131
    • (2005) Proc. Interspeech-Eurospeech , vol.131 , pp. 545-548
    • Hetherington, I.L.1
  • 45
    • 85009204481 scopus 로고    scopus 로고
    • Speech summarization using weighted finite-state transducers
    • 134
    • T. Hori, C. Hori, and Y. Minami, "Speech summarization using weighted finite-state transducers," in Proc. Eurospeech, 2003, pp. 2817-2820. 134
    • (2003) Proc. Eurospeech , pp. 2817-2820
    • Hori, T.1    Hori, C.2    Minami, Y.3
  • 46
    • 85009063824 scopus 로고    scopus 로고
    • Fast on-The-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition
    • 6 95 110
    • T.Hori, C.Hori, and Y.Minami, "Fast on-The-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition," in Proc. Interspeech-ICSLP, vol. 1, 2004, pp. 289-292. 6, 95, 110
    • (2004) Proc. Interspeech-ICSLP , vol.1 , pp. 289-292
    • Hori, T.1    Hori, C.2    Minami, Y.3
  • 47
    • 45849093239 scopus 로고    scopus 로고
    • Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
    • DOI: 10.1109/TASL.2006.889790 6 95 110
    • T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEETransactions on Audio, Speech, and Language Processing, vol.15, no.4, pp.1352-1365, 2007.DOI: 10.1109/TASL.2006.889790 6, 95, 110
    • (2007) IEEETransactions on Audio, Speech, and Language Processing , vol.15 , Issue.4 , pp. 1352-1365
    • Hori, T.1    Hori, C.2    Minami, Y.3    Nakamura, A.4
  • 50
    • 33646426591 scopus 로고    scopus 로고
    • Generalized fast on-The-fly composition algorithm for WFST-based speech recognition
    • 94 95 110 130
    • T. Hori and A. Nakamura, "Generalized fast on-The-fly composition algorithm for WFST-based speech recognition," in Proc. Interspeech- Eurospeech, 2005, pp. 557-560. 94, 95, 110, 130
    • (2005) Proc. Interspeech-Eurospeech , pp. 557-560
    • Hori, T.1    Nakamura, A.2
  • 51
    • 84867199378 scopus 로고    scopus 로고
    • Dialog management using weighted finite-state transducers
    • DOI: 10.1109/ASRU.2009.5373350136
    • C.Hori, K.Ohtaki, T.Misu, H.Kashioka, andS.Nakamura, "Dialog management using weighted finite-state transducers," in Proc. Interspeech, 2008, pp. 211-214. DOI: 10.1109/ASRU.2009.5373350136
    • (2008) Proc. Interspeech , pp. 211-214
    • Hori, C.1    Ohtaki, K.2    Misu, T.3    Nakamura, S.4
  • 52
    • 70349207745 scopus 로고    scopus 로고
    • Statistical dialog management applied to WFST-based dialog systems
    • DOI: 10.1109/ICASSP.2009.4960703136
    • C. Hori, K. Ohtaki, T. Misu, H. Kashioka, and S. Nakamura, "Statistical dialog management applied to WFST-based dialog systems," in Proc. ICASSP, 2009, pp. 4793-4796. DOI: 10.1109/ICASSP.2009.4960703136
    • (2009) Proc. ICASSP , pp. 4793-4796
    • Hori, C.1    Ohtaki, K.2    Misu, T.3    Kashioka, H.4    Nakamura, S.5
  • 53
    • 33947677731 scopus 로고    scopus 로고
    • Flexible multi-stream framework for speech recognition using multi-tape finite-state transducers
    • DOI: 10.1109/ICASSP.2006.1660046132
    • I. L. Hetherington, H. Shu, and J. R. Glass, "Flexible multi-stream framework for speech recognition using multi-tape finite-state transducers," in Proc. ICASSP, 2006, pp. 417-420. DOI: 10.1109/ICASSP.2006. 1660046132
    • (2006) Proc. ICASSP , pp. 417-420
    • Hetherington, I.L.1    Shu, H.2    Glass, J.R.3
  • 54
    • 70349226871 scopus 로고    scopus 로고
    • A multimedia retrieval system using speech input
    • DOI: 10.1145/1647314.1647356132
    • G. Heigold, R. Schluter, and H. Ney, "A multimedia retrieval system using speech input," in Proc. ICASSP, 2009, pp. 3749-3752. DOI: 10.1145/1647314.1647356132
    • (2009) Proc. ICASSP , pp. 3749-3752
    • Heigold, G.1    Schluter, R.2    Ney, H.3
  • 55
    • 0141480041 scopus 로고    scopus 로고
    • Language model adaptation using WFST-based speaking-style translation
    • DOI: 10.1109/ICASSP.2003.1198759131
    • T. Hori, D. Willett, and Y. Minami, "Language model adaptation using WFST-based speaking-style translation," in Proc. ICASSP, vol. I, 2003, pp. 228-231. DOI: 10.1109/ICASSP.2003.1198759131
    • (2003) Proc. ICASSP , vol.1 , pp. 228-231
    • Hori, T.1    Willett, D.2    Minami, Y.3
  • 56
    • 77954609645 scopus 로고    scopus 로고
    • Paraphrasing spontaneous speech using weighted finite-state transducers
    • 134
    • T. Hori, D. Willett, and Y. Minami, "Paraphrasing spontaneous speech using weighted finite-state transducers," in Proc. SSPR, 2003. 134
    • (2003) Proc. SSPR
    • Hori, T.1    Willett, D.2    Minami, Y.3
  • 57
    • 70349208656 scopus 로고    scopus 로고
    • Aflat direct model for speech recognition
    • DOI: 10.1109/ICASSP.2009.4960470133
    • G.Heigold, G.Zweig, andP.Nguyen, "Aflat direct model for speech recognition," in Proc. ICASSP, 2009, pp. 3861-3864. DOI: 10.1109/ICASSP.2009.4960470133
    • (2009) Proc. ICASSP , pp. 3861-3864
    • Heigold, G.1    Nguyen, P.2
  • 59
    • 0016507833 scopus 로고
    • Design of a linguistic statistical decoder for the recognition of continuous speech
    • DOI: 10.1109/TIT.1975.10553849
    • F. Jelinek, L. R. Bahl, and R. L. Mercer, "Design of a linguistic statistical decoder for the recognition of continuous speech," IEEE Transactions on Information Theory, vol. IT-21, no. 3, pp. 250-256, 1975. DOI: 10.1109/TIT.1975.10553849
    • (1975) IEEE Transactions on Information Theory , vol.21 , Issue.3 , pp. 250-256
    • Jelinek, F.1    Bahl, L.R.2    Mercer, R.L.3
  • 62
    • 77950550412 scopus 로고    scopus 로고
    • Development of a WFST based speech recognition system for a resource deficient language using machine translation
    • 131
    • A. T. Jensson, T. Oonishi, K. Iwano, and S. Furui, "Development of a WFST based speech recognition system for a resource deficient language using machine translation," in Proc. APSIPA ASC, 2009, pp. 50-56. 131
    • (2009) Proc. APSIPA ASC , pp. 50-56
    • Jensson, A.T.1    Oonishi, T.2    Iwano, K.3    Furui, S.4
  • 63
    • 85009198110 scopus 로고    scopus 로고
    • Speech recognition with dynamic grammars using finite-state transducers
    • 131
    • J. J. Schalkwyk, I. L. Hetherington, and E. Story, "Speech recognition with dynamic grammars using finite-state transducers," in Proc. Eurospeech, 2003, pp. 1969-1972. 131
    • (2003) Proc. Eurospeech , pp. 1969-1972
    • Schalkwyk, J.J.1    Hetherington, I.L.2    Story, E.3
  • 64
    • 0032289099 scopus 로고    scopus 로고
    • Heteroscendastic discriminant analysis and reduced rank HMMs for improved speech recognition
    • DOI: 10.1016/S0167-6393(98)00061-212
    • N. Kumar and H. G. Andreou, "Heteroscendastic discriminant analysis and reduced rank HMMs for improved speech recognition," Speech Communication, vol. 26, pp. 283-297, 1998. DOI: 10.1016/S0167-6393(98)00061-212
    • (1998) Speech Communication , vol.26 , pp. 283-297
    • Kumar, N.1    Andreou, H.G.2
  • 65
    • 0023312404 scopus 로고
    • Estimation of probabilities from sparse data for the language model component of a speech recognizer
    • DOI: 10.1109/TASSP.1987.116512522
    • S. M. Katz, "Estimation of probabilities from sparse data for the language model component of a speech recognizer," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 35, no. 3, pp. 400-401, 1987. DOI: 10.1109/TASSP.1987.116512522
    • (1987) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.35 , Issue.3 , pp. 400-401
    • Katz, S.M.1
  • 67
    • 0028996960 scopus 로고
    • Improved decision trees for phonetic modeling
    • DOI: 10.1007/11965152-26131
    • R. Kuhn, A. Lazarides, Y. Normandin, and J. Brousseau, "Improved decision trees for phonetic modeling," in Proc. ICASSP, vol. 1, 1995, pp. 552-555. DOI: 10.1007/11965152-26131
    • (1995) Proc. ICASSP , vol.1 , pp. 552-555
    • Kuhn, R.1    Lazarides, A.2    Normandin, Y.3    Brousseau, J.4
  • 68
    • 84941164160 scopus 로고    scopus 로고
    • FSA: An efficient and flexible C++ toolkit for finite state automata using on-demand computation
    • 136
    • S. Kanthak and H. Ney, "FSA: An efficient and flexible C++ toolkit for finite state automata using on-demand computation," in Proc. ACL, 2004, pp. 510-517. 136
    • (2004) Proc. ACL , pp. 510-517
    • Kanthak, S.1    Ney, H.2
  • 71
    • 78049379945 scopus 로고    scopus 로고
    • Language model combination and adaptation using weighted finite state transducers
    • 131
    • X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland, "Language model combination and adaptation using weighted finite state transducers," in Proc. ICASSP, 2010, pp. 5390-5393. 131
    • (2010) Proc. ICASSP , pp. 5390-5393
    • Liu, X.1    Gales, M.J.F.2    Hieronymus, J.L.3    Woodland, P.C.4
  • 73
    • 0003509020 scopus 로고
    • PhD theses, Dept. of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, USA 3, 27
    • B. Lowerre, "The HARPY speech recognition system," PhD theses, Dept. of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, USA, 1976. 3, 27
    • (1976) The HARPY Speech Recognition System
    • Lowerre, B.1
  • 74
    • 85135253868 scopus 로고    scopus 로고
    • Efficient general lattice generation and rescor-ing
    • 88 94 126
    • A. Ljolje, F. Pereira, and M. Riley, "Efficient general lattice generation and rescor-ing," in Proc. Eurospeech, 1999, pp. 1251-1254. 88, 94, 126
    • (1999) Proc. Eurospeech , pp. 1251-1254
    • Ljolje, A.1    Pereira, F.2    Riley, M.3
  • 75
    • 78049355806 scopus 로고    scopus 로고
    • Discriminatively estimated joint acoustic, duration, and language model for speech recognition
    • DOI: 10.1109/ICASSP.2010.5495227133
    • M. Lehr and I. Shafran, "Discriminatively estimated joint acoustic, duration, and language model for speech recognition," in Proc. ICASSP, 2010, pp. 5542-5545. DOI: 10.1109/ICASSP.2010.5495227133
    • (2010) Proc. ICASSP , pp. 5542-5545
    • Lehr, M.1    Shafran, I.2
  • 78
    • 34547522070 scopus 로고    scopus 로고
    • Discriminative training for large vocabulary speech recognition using minimum classification error
    • DOI: 10.1109/TASL.2006.876778132
    • E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error," IEEE Transactions on Audio, Speech and Language Processing, vol. 15, pp. 203-223, 2007. DOI: 10.1109/TASL.2006. 876778132
    • (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 203-223
    • McDermott, E.1    Hazen, T.J.2    Le Roux, J.3    Nakamura, A.4    Katagiri, S.5
  • 79
    • 84872856906 scopus 로고    scopus 로고
    • web page 136
    • "The MIT FST Toolkit," web page http://people.csail.mit.edu/ ilh/fst. 136
    • The MIT FST Toolkit
  • 80
    • 84892889057 scopus 로고    scopus 로고
    • Generic epsilon-removal and input epsilon-normalization algorithms for weighted transducers
    • DOI: 10.1142/S012905410200099665
    • M. Mohri, "Generic epsilon-removal and input epsilon-normalization algorithms for weighted transducers," International Journal of Foundations of Computer Science, vol. 13(1), pp. 129-143, 2002. DOI: 10.1142/ S012905410200099665
    • (2002) International Journal of Foundations of Computer Science , vol.13 , Issue.1 , pp. 129-143
    • Mohri, M.1
  • 81
    • 70350376504 scopus 로고    scopus 로고
    • Weighted automata algorithms
    • M. Droste, W. Kuich, and H. Vogler, Eds. Springer-Verlag New York Inc. DOI: 10.1007/978-3-642-01492-5 56 57 61 65
    • M. Mohri, "Weighted automata algorithms," in Handbook of Weighted Automata, M. Droste, W. Kuich, and H. Vogler, Eds. Springer-Verlag New York Inc., 2009. DOI: 10.1007/978-3-642-01492-5 56, 57, 61, 65
    • (2009) Handbook of Weighted Automata
    • Mohri, M.1
  • 83
    • 0012306376 scopus 로고    scopus 로고
    • The design principles of a weighted finite-state transducer library
    • DOI: 10.1016/S0304-3975(99)00014-6136
    • M. Mohri, F. Pereira, and M. Riley, "The design principles of a weighted finite-state transducer library," Theoretical Computer Science, vol. 231(1), pp. 17-32, 2000. DOI: 10.1016/S0304-3975(99)00014-6136
    • (2000) Theoretical Computer Science , vol.231 , Issue.1 , pp. 17-32
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 84
    • 0036460907 scopus 로고    scopus 로고
    • Weighted finite-state transducers in speech recognition
    • DOI: 10.1006/csla.2001.0184 4 41 71 80 83 94 95
    • M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducers in speech recognition," Computer Speech and Language, vol. 16, pp. 69-88, 2002. DOI: 10.1006/csla.2001.0184 4, 41, 71, 80, 83, 94, 95
    • (2002) Computer Speech and Language , vol.16 , pp. 69-88
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 85
    • 33646939678 scopus 로고    scopus 로고
    • Weighted determinization and minimization for large vocabulary speech recognition
    • 95
    • M. Mohri and M. Riley, "Weighted determinization and minimization for large vocabulary speech recognition," in Proc. Eurospeech, vol. 1, 1997, pp. 131-134. 95
    • (1997) Proc. Eurospeech , vol.1 , pp. 131-134
    • Mohri, M.1    Riley, M.2
  • 86
    • 85009070232 scopus 로고    scopus 로고
    • A weight pushing algorithm for large vocabulary speech recognition
    • 91
    • M. Mohri and M.Riley, "A weight pushing algorithm for large vocabulary speech recognition," in Proc. Eurospeech, 2001, pp. 1603-1606. 91
    • (2001) Proc. Eurospeech , pp. 1603-1606
    • Mohri, M.1    Riley, M.2
  • 87
    • 44849112578 scopus 로고    scopus 로고
    • An algorithm for fast composition of weighted finite-state transducers
    • DOI: 10.1109/ASRU.2007.4430156 95 99
    • J. McDonough, E. Stoimenov, and D. Klakow, "An algorithm for fast composition of weighted finite-state transducers," in Proc. ASRU, 2007, pp. 461-466. DOI: 10.1109/ASRU.2007.4430156 95, 99
    • (2007) Proc. ASRU , pp. 461-466
    • McDonough, J.1    Stoimenov, E.2    Klakow, D.3
  • 88
    • 0021406359 scopus 로고
    • The use of a one-stage dynamic programming algorithm for connected word recognition
    • Apr DOI: 10.1109/TASSP.1984.11643203
    • H. Ney, "The use of a one-stage dynamic programming algorithm for connected word recognition," IEEETransactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, no. 2, pp. 263-271, Apr. 1984. DOI: 10.1109/TASSP.1984.11643203
    • (1984) IEEETransactions on Acoustics, Speech, and Signal Processing , vol.ASSP-32 , Issue.2 , pp. 263-271
    • Ney, H.1
  • 89
    • 85017308347 scopus 로고
    • Improvementsin beam search for 10000-word continuous speech recognition
    • DOI: 10.1109/89.27928727
    • H.Ney, R.Haeb-Umbach, B.Tran, and M.Oerder, "Improvementsin beam search for 10000-word continuous speech recognition," in Proc. ICASSP, vol. I, 1992, pp. 9-12. DOI: 10.1109/89.27928727
    • (1992) Proc. ICASSP , vol.1 , pp. 9-12
    • Ney, H.1    Haeb-Umbach, R.2    Tran, B.3    Oerder, M.4
  • 90
    • 70349227632 scopus 로고    scopus 로고
    • Generalization of specialized on-The-fly composition
    • DOI: 10.1109/ICASSP.2009.4960584 95 99 102
    • T. Oonishi, P. R. Dixon, K. Iwano, and S. Furui, "Generalization of specialized on-The-fly composition," in Proc. ICASSP, 2009, pp. 4317-4320. DOI: 10.1109/ICASSP.2009.4960584 95, 99, 102
    • (2009) Proc. ICASSP , pp. 4317-4320
    • Oonishi, T.1    Dixon, P.R.2    Iwano, K.3    Furui, S.4
  • 91
    • 84866849798 scopus 로고    scopus 로고
    • Optimization of on-The-fly composition for WFST-based speech recognition decoders
    • (in Japanese) 102 103
    • T. Oonishi, P. R. Dixon, K. Iwano, and S. Furui, "Optimization of on-The-fly composition for WFST-based speech recognition decoders," IEICE Transactions on Information and Systems, vol. J92-D, no. 7, pp. 1026-1035, 2009, (in Japanese). 102, 103
    • (2009) IEICE Transactions on Information and Systems , vol.J92-D , Issue.7 , pp. 1026-1035
    • Oonishi, T.1    Dixon, P.R.2    Iwano, K.3    Furui, S.4
  • 92
    • 80051616419 scopus 로고    scopus 로고
    • Round-robin duel discriminative language models in one-pass decoding with on-The-fly error correction
    • DOI: 10.1109/ICASSP.2011.5947626131
    • T. Oba, T. Hori, A. Ito, and A. Nakamura, "Round-robin duel discriminative language models in one-pass decoding with on-The-fly error correction," in Proc. ICASSP, 2011, pp. 5588-5591. DOI: 10.1109/ICASSP.2011.5947626131
    • (2011) Proc. ICASSP , pp. 5588-5591
    • Oba, T.1    Hori, T.2    Ito, A.3    Nakamura, A.4
  • 93
    • 0030719155 scopus 로고    scopus 로고
    • A word graph algorithm for large vocabulary continuous speech recognition
    • DOI: 10.1006/csla.1996.0022 4 33 38 94
    • S.Ortmanns, H.Ney, and X.Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Computer Speech and Language, vol. 1, pp. 43-72, 1997. DOI: 10.1006/csla.1996.0022 4, 33, 38, 94
    • (1997) Computer Speech and Language , vol.1 , pp. 43-72
    • Ortmanns, S.1    Ney, H.2    Aubert, X.3
  • 94
    • 0030366694 scopus 로고    scopus 로고
    • Language-model look-ahead for large vocabulary speech recognition
    • DOI: 10.1109/ICSLP.1996.607215103
    • S. Ortmanns, H. Ney, and A. Eiden, "Language-model look-ahead for large vocabulary speech recognition," in Proc. ICSLP, 1996, pp. 2095-2098. DOI: 10.1109/ICSLP.1996.607215103
    • (1996) Proc. ICSLP , pp. 2095-2098
    • Ortmanns, S.1    Ney, H.2    Eiden, A.3
  • 95
    • 84872842078 scopus 로고    scopus 로고
    • web page 136
    • "OpenFst Library," web page http://www.openfst.org/twiki/bin/ view/FST/WebHome. 136
    • OpenFst Library
  • 96
    • 0024934084 scopus 로고
    • Benchmark tests for DARPA resource management database performance evaluations
    • DOI: 10.1109/ICASSP.1989.2664821
    • D. S. Pallett, "Benchmark tests for DARPA resource management database performance evaluations," in Proc. ICASSP, 1989, pp. 536-539. DOI: 10.1109/ICASSP.1989.2664821
    • (1989) Proc. ICASSP , pp. 536-539
    • Pallett, D.S.1
  • 97
    • 0026368475 scopus 로고
    • Algorithm for an optimal A&z.ast; Search and linearizing the search in the stack decoder
    • DOI: 10.1109/ICASSP.1991.1504344
    • D. B. Paul, "Algorithm for an optimal A&z.ast; search and linearizing the search in the stack decoder," in Proc. ICASSP, 1991, pp. 693-696. DOI: 10.1109/ICASSP.1991.1504344
    • (1991) Proc. ICASSP , pp. 693-696
    • Paul, D.B.1
  • 101
    • 0002837345 scopus 로고    scopus 로고
    • Speech recognition by composition of weighted finite automata
    • MIT Press 4
    • F. Pereira and M. Riley, "Speech recognition by composition of weighted finite automata," in Finite-State Language Processing. MIT Press, 1996, pp. 431-453. 4
    • (1996) Finite-State Language Processing , pp. 431-453
    • Pereira, F.1    Riley, M.2
  • 102
    • 0242312781 scopus 로고
    • Weighted rational transductions and their application to human language processing
    • DOI: 10.3115/1075812.10758704
    • F. Pereira, M. Riley, and R. Sproat, "Weighted rational transductions and their application to human language processing," in Proc. ARPA Workshop on Human Language technology, 1994, pp. 249-254. DOI: 10.3115/1075812.10758704
    • (1994) Proc. ARPA Workshop on Human Language Technology , pp. 249-254
    • Pereira, F.1    Riley, M.2    Sproat, R.3
  • 103
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • DOI: 10.1109/ICASSP.2002.5743665 13 132
    • D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP, vol. I, 2002, pp. 105-108. DOI: 10.1109/ICASSP.2002.5743665 13, 132
    • (2002) Proc. ICASSP , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 104
    • 0027113267 scopus 로고
    • Minimisation of acyclic deterministic automata in linear time
    • DOI: 10.1016/0304-3975(92)90142-364
    • D. Revuz, "Minimisation of acyclic deterministic automata in linear time," Theoretical Computer Science, vol. 92(1), pp. 181-189, 1992. DOI: 10.1016/0304-3975(92)90142-364
    • (1992) Theoretical Computer Science , vol.92 , Issue.1 , pp. 181-189
    • Revuz, D.1
  • 107
    • 0002247642 scopus 로고    scopus 로고
    • Transducer composition for context-dependent network expansion
    • 71
    • M. Riley, F. Pereira, and M. Mohri, "Transducer composition for context-dependent network expansion," in Proc. Eurospeech, 1997, pp. 1427-1430. 71
    • (1997) Proc. Eurospeech , pp. 1427-1430
    • Riley, M.1    Pereira, F.2    Mohri, M.3
  • 109
    • 85149106909 scopus 로고    scopus 로고
    • Discriminative language modeling with conditional random fields and the perceptron algorithm
    • DOI: 10.3115/1218955.1218962130
    • B. Roark, M. Saraclar, M. Collins, and M. Johnson, "Discriminative language modeling with conditional random fields and the perceptron algorithm," in Proc. ACL, 2004. DOI: 10.3115/1218955.1218962130
    • (2004) Proc. ACL
    • Roark, B.1    Saraclar, M.2    Collins, M.3    Johnson, M.4
  • 110
    • 84867593936 scopus 로고    scopus 로고
    • Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders
    • Kyoto, Japan DOI: 10.1109/ICASSP.2012.628884675
    • D. Rybach, R. Schluter, and H. Ney, "Silence is golden: modeling non-speech events in WFST-based dynamic network decoders," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4205-4208. DOI: 10.1109/ICASSP.2012.628884675
    • (2012) Proc. ICASSP , pp. 4205-4208
    • Rybach, D.1    Schluter, R.2    Ney, H.3
  • 111
    • 84872853828 scopus 로고    scopus 로고
    • web page 136
    • "The RWTH FSA Toolkit," web page http://www-i6.informatik.rwth- aachen.de/kanthak/fsa.html. 136
    • The RWTH FSA Toolkit
  • 112
    • 0026390882 scopus 로고
    • A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses
    • DOI: 10.1109/ICASSP.1991.15043639
    • R. Schwartz and Y. Austin, "A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses," in Proc. ICASSP, 1990, pp. 701-704. DOI: 10.1109/ICASSP.1991.15043639
    • (1990) Proc. ICASSP , pp. 701-704
    • Schwartz, R.1    Austin, Y.2
  • 114
    • 0005670423 scopus 로고
    • A dynamic programming approach to continuous speech recognition
    • Budapest, Hungary, Paper 20 C 13, August 2
    • H. Sakoe and S. Chiba, "A dynamic programming approach to continuous speech recognition," in Proc. ICA, Budapest, Hungary, Paper 20 C 13, August 1971, pp. 65-68. 2
    • (1971) Proc. ICA , pp. 65-68
    • Sakoe, H.1    Chiba, S.2
  • 115
    • 0025627406 scopus 로고
    • The N-best algorithm:an efficient and exact procedure for finding the N most likely sentence hypotheses
    • DOI: 10.1109/ICASSP.1990.1155424
    • R.Schwartz andY.Chow, "The N-best algorithm:an efficient and exact procedure for finding the N most likely sentence hypotheses," in Proc. ICASSP, 1990, pp. 81-84. DOI: 10.1109/ICASSP.1990.1155424
    • (1990) Proc. ICASSP , pp. 81-84
    • Chow, Y.1
  • 116
    • 0033896970 scopus 로고    scopus 로고
    • Memory-efficient LVCSR search using a one-pass stack decoder
    • January DOI: 10.1006/csla.1999.01354
    • M. Schuster, "Memory-efficient LVCSR search using a one-pass stack decoder," Computer Speech & Language, vol. 14(1), pp. 47-77, January 2000. DOI: 10.1006/csla.1999.01354
    • (2000) Computer Speech & Language , vol.14 , Issue.1 , pp. 47-77
    • Schuster, M.1
  • 117
    • 0026370988 scopus 로고
    • A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition
    • DOI: 10.1109/ICASSP.1991.1504374
    • F. K. Soong and E.-F. Huang, "A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition," in Proc. ICASSP, vol. 1, 1991, pp. 705-708. DOI: 10.1109/ICASSP.1991.1504374
    • (1991) Proc. ICASSP , vol.1 , pp. 705-708
    • Soong, F.K.1    Huang, E.-F.2
  • 118
    • 85009292190 scopus 로고    scopus 로고
    • EM training of finite-state transducers and its application to pronunciation modeling
    • 132
    • H. Shu and I. L. Hetherington, "EM training of finite-state transducers and its application to pronunciation modeling," in Proc. ICSLP, 2002, pp. 1293-1296. 132
    • (2002) Proc. ICSLP , pp. 1293-1296
    • Shu, H.1    Hetherington, I.L.2
  • 119
    • 33645768509 scopus 로고    scopus 로고
    • Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition
    • DOI: 10.1109/ICASSP.2005.1415085132
    • M. Schuster and T. Hori, "Efficient generation of high-order context-dependent weighted finite state transducers for speech recognition," in Proc. ICASSP, 2005, pp. 201-204. DOI: 10.1109/ICASSP.2005. 1415085132
    • (2005) Proc. ICASSP , pp. 201-204
    • Schuster, M.1    Hori, T.2
  • 120
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • 16
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, 2011, pp. 437-440. 16
    • (2011) Proc. Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 121
    • 33947638544 scopus 로고    scopus 로고
    • Modeling polyphone context with weighted finite-state transducers
    • DOI: 10.1109/ICASSP.2006.1659972132
    • E. Stoimenov and J. McDonough, "Modeling polyphone context with weighted finite-state transducers," in Proc. ICASSP, vol. I, 2006, pp. 121-124. DOI: 10.1109/ICASSP.2006.1659972132
    • (2006) Proc. ICASSP , vol.1 , pp. 121-124
    • Stoimenov, E.1    McDonough, J.2
  • 122
    • 0030362785 scopus 로고    scopus 로고
    • Multilingual text analysis for text-to-speech synthesis
    • DOI: 10.1017/S1351324997001654135
    • R. Sproat, "Multilingual text analysis for text-to-speech synthesis," in Proc. ICSLP, vol. 3, 1996, pp. 1365-1368. DOI: 10.1017/S1351324997001654135
    • (1996) Proc. ICSLP , vol.3 , pp. 1365-1368
    • Sproat, R.1
  • 124
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based context-dependent subword modeling for speech recognition
    • DOI: 10.1250/ast.21.7919
    • K.Shinoda andT.Watanabe, "MDL-based context-dependent subword modeling for speech recognition," Acoustic Science and Technology, vol. 21, no. 2, pp. 79-86, 2000. DOI: 10.1250/ast.21.7919
    • (2000) Acoustic Science and Technology , vol.21 , Issue.2 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 125
    • 0029765807 scopus 로고    scopus 로고
    • Spontaneous dialogue speech recognition using cross-word context constrained word graphs
    • DOI: 10.1109/ICASSP.1996.540311126
    • T. Shimizu, H. Yamamoto, H. Masataki, S. Matsunaga, and Y. Sagisaka, "Spontaneous dialogue speech recognition using cross-word context constrained word graphs," in Proc. ICASSP, 1996, pp. 145-148. DOI: 10.1109/ICASSP.1996.540311126
    • (1996) Proc. ICASSP , pp. 145-148
    • Shimizu, T.1    Yamamoto, H.2    Masataki, H.3    Matsunaga, S.4    Sagisaka, Y.5
  • 126
    • 78049374440 scopus 로고    scopus 로고
    • A discriminative model for continuous speech recognition based on weighted finite state transducers
    • DOI: 10.1109/ICASSP.2010.5495096133
    • S.Watanabe, T. Hori, E. McDermott, and A. Nakamura, "A discriminative model for continuous speech recognition based on weighted finite state transducers," in Proc. ICASSP, 2010, pp. 4922-4925. DOI: 10.1109/ICASSP.2010.5495096133
    • (2010) Proc. ICASSP , pp. 4922-4925
    • Watanabe, S.1    Hori, T.2    McDermott, E.3    Nakamura, A.4
  • 127
    • 85009110509 scopus 로고    scopus 로고
    • Time and memory efficient Viterbi decoding for LVCSR using a precompiled search network
    • 95, 110 111
    • D. Willett, E. McDermott, Y. Minami, and S. Katagiri, "Time and memory efficient Viterbi decoding for LVCSR using a precompiled search network," in Proc. Eurospeech, 2001, pp. 847-850. 95, 110, 111
    • (2001) Proc. Eurospeech , pp. 847-850
    • Willett, D.1    McDermott, E.2    Minami, Y.3    Katagiri, S.4
  • 128
    • 3042741069 scopus 로고    scopus 로고
    • Variational Bayesian estimation and clustering for speech recognition
    • DOI: 10.1109/TSA.2004.82864019
    • S. Watanabe, Y. Minami, A. Nakamura, and N. Ueda, "Variational Bayesian estimation and clustering for speech recognition," IEEE Transactions on Speech and Audio Processing, vol.12, pp.365-381, 2004.DOI: 10.1109/TSA.2004.82864019
    • (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , pp. 365-381
    • Watanabe, S.1    Minami, Y.2    Nakamura, A.3    Ueda, N.4
  • 129
    • 0002144369 scopus 로고
    • Tree-based state tying for high accuracy acoustics modeling
    • DOI: 10.3115/1075812.107588518
    • S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustics modeling," in Proc. ARPA Human Language Technology Workshop, 1994, pp. 307-312. DOI: 10.3115/1075812.107588518
    • (1994) Proc. ARPA Human Language Technology Workshop , pp. 307-312
    • Young, S.J.1    Odell, J.J.2    Woodland, P.C.3
  • 130
    • 84867598134 scopus 로고    scopus 로고
    • A general discriminative training algorithm for speech recognition using weighted finite-state transducers
    • Kyoto, Japan DOI: 10.1109/ICASSP.2012.6288849132
    • Y. Zhao, A. Ljolje, D. Caseiro, and B.-H. Juang, "A general discriminative training algorithm for speech recognition using weighted finite-state transducers," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4217-4220. DOI: 10.1109/ICASSP.2012.6288849132
    • (2012) Proc. ICASSP , pp. 4217-4220
    • Zhao, Y.1    Ljolje, A.2    Caseiro, D.3    Juang, B.-H.4
  • 131
    • 77949370075 scopus 로고    scopus 로고
    • A segmental CRF approach to large vocabulary continuous speech recognition
    • DOI: 10.1109/ASRU.2009.5372916133
    • G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition," in Proc. ASRU, 2009, pp. 152-157. DOI: 10.1109/ASRU.2009.5372916133
    • (2009) Proc. ASRU , pp. 152-157
    • Zweig, G.1    Nguyen, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.