메뉴 건너뛰기




Volumn 15, Issue 4, 2007, Pages 1352-1365

Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition

Author keywords

On the fly composition; Speech recognition; Weighted finite state transducer (WFST)

Indexed keywords

HIGH-ACCURACY; LARGE VOCABULARIES; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; ON-THE-FLY COMPOSITION; ONE PASS; SEARCH ALGORITHMS; SEARCH METHODS; SEARCH SPACES; SPEECH TRANSCRIPTIONS; VITERBI SEARCHES; WEIGHTED FINITE-STATE TRANSDUCER (WFST);

EID: 45849093239     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.889790     Document Type: Article
Times cited : (153)

References (25)
  • 1
    • 0141479119 scopus 로고    scopus 로고
    • Deriving disambiguous queries in a spoken interactive ODQA system
    • C. Hori, T. Hori, H. Isozaki, E. Maeda, S. Katagiri, and S. Furui, "Deriving disambiguous queries in a spoken interactive ODQA system," in Proc. ICASSP, 2003, vol. I, pp. 624-627.
    • (2003) Proc. ICASSP , vol.1 , pp. 624-627
    • Hori, C.1    Hori, T.2    Isozaki, H.3    Maeda, E.4    Katagiri, S.5    Furui, S.6
  • 2
    • 0036460907 scopus 로고    scopus 로고
    • Weighted finite-state transducers in speech recognition
    • M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducers in speech recognition," Comput. Speech Lang., vol. 16, pp. 69-88, 2002.
    • (2002) Comput. Speech Lang , vol.16 , pp. 69-88
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 3
    • 0002247642 scopus 로고    scopus 로고
    • Transducer composition for contextdependent network expansion
    • M. Riley, F. Pereira, and M. Mohri, "Transducer composition for contextdependent network expansion," in Proc. Eurospeech, 1997, vol. 3, pp. 1427-1430.
    • (1997) Proc. Eurospeech , vol.3 , pp. 1427-1430
    • Riley, M.1    Pereira, F.2    Mohri, M.3
  • 4
    • 33646939678 scopus 로고    scopus 로고
    • Weighted determinization and minimization for large vocabulary speech recognition
    • M. Mohri and M. Riley, "Weighted determinization and minimization for large vocabulary speech recognition," in Proc. Eurospeech, 1997, vol. 1, pp. 131-134.
    • (1997) Proc. Eurospeech , vol.1 , pp. 131-134
    • Mohri, M.1    Riley, M.2
  • 5
    • 84962822365 scopus 로고    scopus 로고
    • A finite-state approach to machine translation
    • S. Bangalore and G. Riccardi, "A finite-state approach to machine translation," in Proc. ASRU, 2001, pp. 381-388.
    • (2001) Proc. ASRU , pp. 381-388
    • Bangalore, S.1    Riccardi, G.2
  • 6
    • 84962861457 scopus 로고    scopus 로고
    • Finite-state transducers for speech-input translation
    • F. Casacuberta, "Finite-state transducers for speech-input translation," in Proc. ASRU, 2001, pp. 375-380.
    • (2001) Proc. ASRU , pp. 375-380
    • Casacuberta, F.1
  • 7
    • 85009204481 scopus 로고    scopus 로고
    • Speech summarization using weighted finite-state transducers
    • T. Hori, C. Hori, and Y. Minami, "Speech summarization using weighted finite-state transducers," in Proc. Eurospeech, 2003, pp. 2817-2820.
    • (2003) Proc. Eurospeech , pp. 2817-2820
    • Hori, T.1    Hori, C.2    Minami, Y.3
  • 8
    • 84880839432 scopus 로고    scopus 로고
    • A rational design for a weighted finite-state transducer library
    • Proc. Int. Workshop Implementing Automata 1997
    • M. Mohri, F. Pereira, and M. Riley, "A rational design for a weighted finite-state transducer library," in Proc. Int. Workshop Implementing Automata 1997, 1997, vol. 1436, Lecture Notes in Computer Science, pp. 144-158.
    • (1997) Lecture Notes in Computer Science , vol.1436 , pp. 144-158
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 9
    • 84962878172 scopus 로고    scopus 로고
    • Incremental language models for speech recognition using finite-state transducers
    • H. J. G. A. Dolfing and I. L. Hetherington, "Incremental language models for speech recognition using finite-state transducers," in Proc. ASRU, 2001, pp. 194-197.
    • (2001) Proc. ASRU , pp. 194-197
    • Dolfing, H.J.G.A.1    Hetherington, I.L.2
  • 10
    • 0036298116 scopus 로고    scopus 로고
    • Recent advances in efficient decoding combining on-line transducer composition and smoothed language model incorporation
    • D.Willett and S. Katagiri, "Recent advances in efficient decoding combining on-line transducer composition and smoothed language model incorporation," in Proc. ICASSP, 2002, vol. I, pp. 713-716.
    • (2002) Proc. ICASSP , vol.1 , pp. 713-716
    • Willett, D.1    Katagiri, S.2
  • 11
    • 84962787683 scopus 로고    scopus 로고
    • Transducer composition for on-the-fly lexicon and language model integration
    • D. Caseiro and I. Trancoso, "Transducer composition for on-the-fly lexicon and language model integration," in Proc. ASRU, 2001, pp. 393-396.
    • (2001) Proc. ASRU , pp. 393-396
    • Caseiro, D.1    Trancoso, I.2
  • 12
    • 0141480004 scopus 로고    scopus 로고
    • A tail-sharing WFST composition for large vocabulary speech recognition
    • [12] --, "A tail-sharing WFST composition for large vocabulary speech recognition," in ICASSP, 2003, vol. I, pp. 356-359.
    • (2003) ICASSP , vol.1 , pp. 356-359
    • Caseiro, D.1    Trancoso, I.2
  • 13
    • 0030719155 scopus 로고    scopus 로고
    • A word graph algorithm for large vocabulary continuous speech recognition
    • S. Ortmanns, H. Ney, and X. Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 11, pp. 43-72, 1996.
    • (1996) Comput. Speech Lang , vol.11 , pp. 43-72
    • Ortmanns, S.1    Ney, H.2    Aubert, X.3
  • 14
    • 85135253868 scopus 로고    scopus 로고
    • Efficient general lattice generation and rescoring
    • A. Ljolje, F. Pereira, and M. Riley, "Efficient general lattice generation and rescoring," in Proc. Eurospeech, 1999, pp. 1251-1254.
    • (1999) Proc. Eurospeech , pp. 1251-1254
    • Ljolje, A.1    Pereira, F.2    Riley, M.3
  • 15
    • 0029765807 scopus 로고    scopus 로고
    • Spontaneous dialogue speech recognition using cross-word context constrained word graphs
    • T. Shimizu, H. Yamamoto, H. Masataki, S. Matsunaga, and Y. Sagisaka, "Spontaneous dialogue speech recognition using cross-word context constrained word graphs," in Proc. ICASSP, 1996, pp. 145-148.
    • (1996) Proc. ICASSP , pp. 145-148
    • Shimizu, T.1    Yamamoto, H.2    Masataki, H.3    Matsunaga, S.4    Sagisaka, Y.5
  • 16
    • 0029770143 scopus 로고    scopus 로고
    • Minimizing search errors due to delayed bigrams in real-time speech recognition systems
    • M.Woszczyna and M. Finke, "Minimizing search errors due to delayed bigrams in real-time speech recognition systems," in Proc. ICASSP, 1996, pp. 137-140.
    • (1996) Proc. ICASSP , pp. 137-140
    • Woszczyna, M.1    Finke, M.2
  • 17
    • 85128392820 scopus 로고    scopus 로고
    • The BBN single-phonetic-tree fast-match algorithm
    • L. Nguyen and R. Schwartz, "The BBN single-phonetic-tree fast-match algorithm," in Proc. ICSLP, 1998, pp. 1827-1830.
    • (1998) Proc. ICSLP , pp. 1827-1830
    • Nguyen, L.1    Schwartz, R.2
  • 18
    • 0026390882 scopus 로고
    • A comparison of several approximate algorithms for finding multiple (N-BEST) sentence hypotheses
    • R. Schwartz and S. Austin, "A comparison of several approximate algorithms for finding multiple (N-BEST) sentence hypotheses," in Proc. ICASSP, 1991, pp. 701-704.
    • (1991) Proc. ICASSP , pp. 701-704
    • Schwartz, R.1    Austin, S.2
  • 19
    • 0142007749 scopus 로고    scopus 로고
    • Improved phoneme- historydependent search method for large-vocabulary continuous-speech recognition
    • T. Hori, Y. Noda, and S. Matsunaga, "Improved phoneme- historydependent search method for large-vocabulary continuous-speech recognition," IEICE Trans. Info. Syst., vol. E86-D, no. 6, pp. 1059-1067, 2003.
    • (2003) IEICE Trans. Info. Syst , vol.E86-D , Issue.6 , pp. 1059-1067
    • Hori, T.1    Noda, Y.2    Matsunaga, S.3
  • 20
    • 3042854734 scopus 로고    scopus 로고
    • Benchmark test for speech recognition using the corpus of spontaneous Japanese
    • T. Kawahara, H. Nanjo, T. Shinozaki, and S. Furui, "Benchmark test for speech recognition using the corpus of spontaneous Japanese," in Proc. SSPR, 2003, pp. 135-138.
    • (2003) Proc. SSPR , pp. 135-138
    • Kawahara, T.1    Nanjo, H.2    Shinozaki, T.3    Furui, S.4
  • 21
    • 64149119992 scopus 로고    scopus 로고
    • NTT speech recognizer with outLook on the next generation: SOLON
    • T. Hori, "NTT speech recognizer with outLook on the next generation: SOLON," in Proc. Commun. Scene Anal., 2004.
    • (2004) Proc. Commun. Scene Anal
    • Hori, T.1
  • 22
    • 1642296635 scopus 로고    scopus 로고
    • Efficient support vector classifiers for named entity recognition
    • H. Isozaki et al., "Efficient support vector classifiers for named entity recognition," in Proc. COLING, 2002, pp. 390-396.
    • (2002) Proc. COLING , pp. 390-396
    • Isozaki, H.1
  • 24
    • 85009271609 scopus 로고    scopus 로고
    • Towards automatic closed captioning: Low latency real time broadcast news transcription
    • M. Saraclar,M. Riley, E. Bocchieri, and V. Goffin, "Towards automatic closed captioning: Low latency real time broadcast news transcription," in Proc. ICSLP, 2002, pp. 1741-1744.
    • (2002) Proc. ICSLP , pp. 1741-1744
    • Saraclar, M.1    Riley, M.2    Bocchieri, E.3    Goffin, V.4
  • 25
    • 0027297381 scopus 로고
    • Vector quantization for the efficient computation of continuous density likelihoods
    • E. Bocchieri, "Vector quantization for the efficient computation of continuous density likelihoods," in Proc. ICASSP, 1993, vol. II, pp. 692-695.
    • (1993) Proc. ICASSP , vol.2 , pp. 692-695
    • Bocchieri, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.