메뉴 건너뛰기




Volumn , Issue , 2012, Pages 4209-4212

A comparison of dynamic WFST decoding approaches

Author keywords

on the fly composition; Speech recognition; WFST

Indexed keywords

DYNAMIC CONSTRUCTION; ERROR RATE; LARGE VOCABULARY SPEECH RECOGNITION; LOOK-AHEAD; MEMORY USAGE; ON-THE-FLY; REAL TIME; WFST;

EID: 84867588266     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2012.6288847     Document Type: Conference Paper
Times cited : (13)

References (27)
  • 2
    • 70450183653 scopus 로고    scopus 로고
    • A generalized composition algorithm for weighted finite-state transducers
    • C. Allauzen, M. Riley, and J. Schalkwyk, "A generalized composition algorithm for weighted finite-state transducers," in Proc. Interspeech, 2000, pp. 1203-1206.
    • Proc. Interspeech, 2000 , pp. 1203-1206
    • Allauzen, C.1    Riley, M.2    Schalkwyk, J.3
  • 3
    • 84962787683 scopus 로고    scopus 로고
    • Transducer composition for on-the-fly lexicon and language model integration
    • D. Caseiro and I. Trancoso, "Transducer composition for on-the-fly lexicon and language model integration," in Proc. ASRU, 2001, pp. 393-396.
    • Proc. ASRU, 2001 , pp. 393-396
    • Caseiro, D.1    Trancoso, I.2
  • 4
    • 85009252175 scopus 로고    scopus 로고
    • Using dynamic WFST composition for recognizing broadcast news
    • D. Caseiro and I. Trancoso, "Using dynamic WFST composition for recognizing broadcast news," in Proc. ICSLP, 2002, pp. 1301-1304.
    • Proc. ICSLP, 2002 , pp. 1301-1304
    • Caseiro, D.1    Trancoso, I.2
  • 5
    • 85009063824 scopus 로고    scopus 로고
    • Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition
    • T. Hori, C. Hori, and Y. Minami, "Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition," in Proc. Interspeech, 2004, pp. 289-292.
    • Proc. Interspeech, 2004 , pp. 289-292
    • Hori, T.1    Hori, C.2    Minami, Y.3
  • 6
    • 34547517191 scopus 로고    scopus 로고
    • Generalized fast on-the-fly composition algorithm for WFST-based speech recognition
    • T. Hori and A. Nakamura, "Generalized fast on-the-fly composition algorithm for WFST-based speech recognition," in Proc. Interspeech, 2005, pp. 847-850.
    • Proc. Interspeech, 2005 , pp. 847-850
    • Hori, T.1    Nakamura, A.2
  • 7
    • 34047273021 scopus 로고    scopus 로고
    • A specialized on-the-fly algorithm for lexicon and language model composition
    • D. A. Caseiro and I. Trancoso, "A specialized on-the-fly algorithm for lexicon and language model composition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1281-1291, 2006.
    • (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.4 , pp. 1281-1291
    • Caseiro, D.A.1    Trancoso, I.2
  • 8
    • 45849093239 scopus 로고    scopus 로고
    • Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
    • T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEE Transactions on Audio, Speech and Language Processing, vol. 15, pp. 1352-1365, 2007.
    • (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 1352-1365
    • Hori, T.1    Hori, C.2    Minami, Y.3    Nakamura, A.4
  • 9
    • 84867223796 scopus 로고    scopus 로고
    • Implementation and evaluation of fast on-the-fly WFST composition algorithms
    • T. Oonishi, P. R. Dixon, K. Iwano, and S. Furui, "Implementation and evaluation of fast on-the-fly WFST composition algorithms," in Proc. Interspeech, 2008, pp. 2110-2113.
    • Proc. Interspeech, 2008 , pp. 2110-2113
    • Oonishi, T.1    Dixon, P.R.2    Iwano, K.3    Furui, S.4
  • 11
    • 79959854261 scopus 로고    scopus 로고
    • On-the-fly lattice rescoring for real-time automatic speech recognition
    • H. Sak, M. Saraclar, and T. Gungor, "On-the-fly lattice rescoring for real-time automatic speech recognition," in Proc. Interspeech, 2010, pp. 2450-2453.
    • Proc. Interspeech, 2010 , pp. 2450-2453
    • Sak, H.1    Saraclar, M.2    Gungor, T.3
  • 12
    • 80051622488 scopus 로고    scopus 로고
    • A comparative analysis of dynamic network decoding
    • D. Rybach, R. Schluter, and H. Ney, "A comparative analysis of dynamic network decoding," in Proc. ICASPP, 2011, pp. 5184-5187.
    • Proc. ICASPP, 2011 , pp. 5184-5187
    • Rybach, D.1    Schluter, R.2    Ney, H.3
  • 13
    • 80051634911 scopus 로고    scopus 로고
    • A comparison of two LVR search optimization techniques
    • S. Kanthak, H. Ney, M. Riley, and M. Mohri, "A comparison of two LVR search optimization techniques," in Proc. ICSLP, 2002, pp. 1309-1312.
    • Proc. ICSLP, 2002 , pp. 1309-1312
    • Kanthak, S.1    Ney, H.2    Riley, M.3    Mohri, M.4
  • 14
    • 77949347726 scopus 로고    scopus 로고
    • Dynamic network decoding revisited
    • H. Soltau and G. Saon, "Dynamic network decoding revisited," in Proc. ASRU, 2009, pp. 276-281.
    • (2009) Proc. ASRU , pp. 276-281
    • Soltau, H.1    Saon, G.2
  • 15
    • 79959851726 scopus 로고    scopus 로고
    • An empirical comparison of the t3, juicer, hdecode and sphinx3 decoders
    • J. R. Novak, P. R. Dixon, and S. Furui, "An empirical comparison of the t3, juicer, hdecode and sphinx3 decoders," in Proc. Interspeech, 2010, pp. 1890-1893.
    • Proc. Interspeech, 2010 , pp. 1890-1893
    • Novak, J.R.1    Dixon, P.R.2    Furui, S.3
  • 16
    • 0002247642 scopus 로고    scopus 로고
    • Transducer composition for context-dependent network expansion
    • M. Riley, F. Pereira, and M. Mohri, "Transducer composition for context-dependent network expansion," in Proc. Eurospeech, 1997, pp. 1427-1430.
    • Proc. Eurospeech, 1997 , pp. 1427-1430
    • Riley, M.1    Pereira, F.2    Mohri, M.3
  • 17
    • 84892168937 scopus 로고    scopus 로고
    • Full expansion of context-dependent networks in large vocabulary speech recognition
    • M. Mohri, M. Riley, D. Hindle, A. Ljolje, and F. Pereira, "Full expansion of context-dependent networks in large vocabulary speech recognition," in Proc. ICASSP, 1998, pp. 393-396.
    • Proc. ICASSP, 1998 , pp. 393-396
    • Mohri, M.1    Riley, M.2    Hindle, D.3    Ljolje, A.4    Pereira, F.5
  • 20
    • 84962878172 scopus 로고    scopus 로고
    • Incremental language models for speech recognition using finite-state transducers
    • H. J. G. A. Dolfing and I. L. Hetherington, "Incremental language models for speech recognition using finite-state transducers," in Proc ASRU, 2001, pp. 194-197.
    • Proc ASRU, 2001 , pp. 194-197
    • Dolfing, H.J.G.A.1    Hetherington, I.L.2
  • 21
    • 0036298116 scopus 로고    scopus 로고
    • Recent advances in efficient decoding combining on-line transducer composition and smoothed language model incorporation
    • D.Willett and S. Katagiri, "Recent advances in efficient decoding combining on-line transducer composition and smoothed language model incorporation," in Proc. ICASSP, 2002, pp. 713-716.
    • Proc. ICASSP, 2002 , pp. 713-716
    • Willett, D.1    Katagiri, S.2
  • 22
    • 34547544207 scopus 로고    scopus 로고
    • A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition
    • O. Cheng, J. Dines, and M. M. Doss, "A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition," in Proc ICASSP, 2007, pp. 348-351.
    • Proc ICASSP, 2007 , pp. 348-351
    • Cheng, O.1    Dines, J.2    Doss, M.M.3
  • 23
    • 84867624751 scopus 로고    scopus 로고
    • An algorithm for fast composition of weighted finite-state transducers
    • J. McDonough, E. Stoimenov, and D. Klakow, "An algorithm for fast composition of weighted finite-state transducers," in Proc. ASRU, 2007, pp. 1-4.
    • Proc. ASRU, 2007 , pp. 1-4
    • McDonough, J.1    Stoimenov, E.2    Klakow, D.3
  • 25
    • 84867199458 scopus 로고    scopus 로고
    • Iterative language model estimation: Efficient data structure & algorithms
    • H. Bo-June and J. Glass, "Iterative language model estimation: Efficient data structure & algorithms," in Proc. Interspeech, 2008, pp. 841-844.
    • Proc. Interspeech, 2008 , pp. 841-844
    • Bo-June, H.1    Glass, J.2
  • 26
    • 67349176526 scopus 로고    scopus 로고
    • The Titech large vocabulary WFST speech recognition system
    • P. R. Dixon, D. A. Caseiro, T. Oonishi, and S. Furui, "The Titech large vocabulary WFST speech recognition system," in Proc. ASRU, 2007, pp. 1301-1304.
    • Proc. ASRU, 2007 , pp. 1301-1304
    • Dixon, P.R.1    Caseiro, D.A.2    Oonishi, T.3    Furui, S.4
  • 27
    • 38149133882 scopus 로고    scopus 로고
    • OpenFst: A general and efficient weighted finite-state transducer library
    • C. Allauzen, M. Riley, J. Schalkwyk, W. Skut, and M. Mohri, "OpenFst: A general and efficient weighted finite-state transducer library," in Proc. of CIAA 2007, 2007, pp. 11-23.
    • (2007) Proc. of CIAA 2007 , pp. 11-23
    • Allauzen, C.1    Riley, M.2    Schalkwyk, J.3    Skut, W.4    Mohri, M.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.