SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 4, 2007, Pages 1352-1365

Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition

(4) Hori, Takaaki a,b Hori, Chiori a,c Minami, Yasuhiro a Nakamura, Atsushi a

a Japan (Japan)

b MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

c CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

On the fly composition; Speech recognition; Weighted finite state transducer (WFST)

Indexed keywords

HIGH-ACCURACY; LARGE VOCABULARIES; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; ON-THE-FLY COMPOSITION; ONE PASS; SEARCH ALGORITHMS; SEARCH METHODS; SEARCH SPACES; SPEECH TRANSCRIPTIONS; VITERBI SEARCHES; WEIGHTED FINITE-STATE TRANSDUCER (WFST);

CONTINUOUS SPEECH RECOGNITION; FAULT DETECTION; IMAGE CODING; LEARNING ALGORITHMS; PIEZOELECTRIC TRANSDUCERS; SPEECH ANALYSIS; SPEECH TRANSMISSION; TRANSCRIPTION; VOCABULARY CONTROL;

DECODING;

EID: 45849093239 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.889790 Document Type: Article

Times cited : (154)

References (25)

1
- 0141479119
- Deriving disambiguous queries in a spoken interactive ODQA system
- C. Hori, T. Hori, H. Isozaki, E. Maeda, S. Katagiri, and S. Furui, "Deriving disambiguous queries in a spoken interactive ODQA system," in Proc. ICASSP, 2003, vol. I, pp. 624-627.
- (2003) Proc. ICASSP , vol.1 , pp. 624-627
- Hori, C.¹ Hori, T.² Isozaki, H.³ Maeda, E.⁴ Katagiri, S.⁵ Furui, S.⁶

2
- 0036460907
- Weighted finite-state transducers in speech recognition
- M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducers in speech recognition," Comput. Speech Lang., vol. 16, pp. 69-88, 2002.
- (2002) Comput. Speech Lang , vol.16 , pp. 69-88
- Mohri, M.¹ Pereira, F.² Riley, M.³

3
- 0002247642
- Transducer composition for contextdependent network expansion
- M. Riley, F. Pereira, and M. Mohri, "Transducer composition for contextdependent network expansion," in Proc. Eurospeech, 1997, vol. 3, pp. 1427-1430.
- (1997) Proc. Eurospeech , vol.3 , pp. 1427-1430
- Riley, M.¹ Pereira, F.² Mohri, M.³

4
- 33646939678
- Weighted determinization and minimization for large vocabulary speech recognition
- M. Mohri and M. Riley, "Weighted determinization and minimization for large vocabulary speech recognition," in Proc. Eurospeech, 1997, vol. 1, pp. 131-134.
- (1997) Proc. Eurospeech , vol.1 , pp. 131-134
- Mohri, M.¹ Riley, M.²

5
- 84962822365
- A finite-state approach to machine translation
- S. Bangalore and G. Riccardi, "A finite-state approach to machine translation," in Proc. ASRU, 2001, pp. 381-388.
- (2001) Proc. ASRU , pp. 381-388
- Bangalore, S.¹ Riccardi, G.²

6
- 84962861457
- Finite-state transducers for speech-input translation
- F. Casacuberta, "Finite-state transducers for speech-input translation," in Proc. ASRU, 2001, pp. 375-380.
- (2001) Proc. ASRU , pp. 375-380
- Casacuberta, F.¹

7
- 85009204481
- Speech summarization using weighted finite-state transducers
- T. Hori, C. Hori, and Y. Minami, "Speech summarization using weighted finite-state transducers," in Proc. Eurospeech, 2003, pp. 2817-2820.
- (2003) Proc. Eurospeech , pp. 2817-2820
- Hori, T.¹ Hori, C.² Minami, Y.³

8
- 84880839432
- A rational design for a weighted finite-state transducer library
- Proc. Int. Workshop Implementing Automata 1997
- M. Mohri, F. Pereira, and M. Riley, "A rational design for a weighted finite-state transducer library," in Proc. Int. Workshop Implementing Automata 1997, 1997, vol. 1436, Lecture Notes in Computer Science, pp. 144-158.
- (1997) Lecture Notes in Computer Science , vol.1436 , pp. 144-158
- Mohri, M.¹ Pereira, F.² Riley, M.³

9
- 84962878172
- Incremental language models for speech recognition using finite-state transducers
- H. J. G. A. Dolfing and I. L. Hetherington, "Incremental language models for speech recognition using finite-state transducers," in Proc. ASRU, 2001, pp. 194-197.
- (2001) Proc. ASRU , pp. 194-197
- Dolfing, H.J.G.A.¹ Hetherington, I.L.²

10
- 0036298116
- Recent advances in efficient decoding combining on-line transducer composition and smoothed language model incorporation
- D.Willett and S. Katagiri, "Recent advances in efficient decoding combining on-line transducer composition and smoothed language model incorporation," in Proc. ICASSP, 2002, vol. I, pp. 713-716.
- (2002) Proc. ICASSP , vol.1 , pp. 713-716
- Willett, D.¹ Katagiri, S.²

11
- 84962787683
- Transducer composition for on-the-fly lexicon and language model integration
- D. Caseiro and I. Trancoso, "Transducer composition for on-the-fly lexicon and language model integration," in Proc. ASRU, 2001, pp. 393-396.
- (2001) Proc. ASRU , pp. 393-396
- Caseiro, D.¹ Trancoso, I.²

12
- 0141480004
- A tail-sharing WFST composition for large vocabulary speech recognition
- [12] --, "A tail-sharing WFST composition for large vocabulary speech recognition," in ICASSP, 2003, vol. I, pp. 356-359.
- (2003) ICASSP , vol.1 , pp. 356-359
- Caseiro, D.¹ Trancoso, I.²

13
- 0030719155
- A word graph algorithm for large vocabulary continuous speech recognition
- S. Ortmanns, H. Ney, and X. Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 11, pp. 43-72, 1996.
- (1996) Comput. Speech Lang , vol.11 , pp. 43-72
- Ortmanns, S.¹ Ney, H.² Aubert, X.³

14
- 85135253868
- Efficient general lattice generation and rescoring
- A. Ljolje, F. Pereira, and M. Riley, "Efficient general lattice generation and rescoring," in Proc. Eurospeech, 1999, pp. 1251-1254.
- (1999) Proc. Eurospeech , pp. 1251-1254
- Ljolje, A.¹ Pereira, F.² Riley, M.³

15
- 0029765807
- Spontaneous dialogue speech recognition using cross-word context constrained word graphs
- T. Shimizu, H. Yamamoto, H. Masataki, S. Matsunaga, and Y. Sagisaka, "Spontaneous dialogue speech recognition using cross-word context constrained word graphs," in Proc. ICASSP, 1996, pp. 145-148.
- (1996) Proc. ICASSP , pp. 145-148
- Shimizu, T.¹ Yamamoto, H.² Masataki, H.³ Matsunaga, S.⁴ Sagisaka, Y.⁵

16
- 0029770143
- Minimizing search errors due to delayed bigrams in real-time speech recognition systems
- M.Woszczyna and M. Finke, "Minimizing search errors due to delayed bigrams in real-time speech recognition systems," in Proc. ICASSP, 1996, pp. 137-140.
- (1996) Proc. ICASSP , pp. 137-140
- Woszczyna, M.¹ Finke, M.²

17
- 85128392820
- The BBN single-phonetic-tree fast-match algorithm
- L. Nguyen and R. Schwartz, "The BBN single-phonetic-tree fast-match algorithm," in Proc. ICSLP, 1998, pp. 1827-1830.
- (1998) Proc. ICSLP , pp. 1827-1830
- Nguyen, L.¹ Schwartz, R.²

18
- 0026390882
- A comparison of several approximate algorithms for finding multiple (N-BEST) sentence hypotheses
- R. Schwartz and S. Austin, "A comparison of several approximate algorithms for finding multiple (N-BEST) sentence hypotheses," in Proc. ICASSP, 1991, pp. 701-704.
- (1991) Proc. ICASSP , pp. 701-704
- Schwartz, R.¹ Austin, S.²

19
- 0142007749
- Improved phoneme- historydependent search method for large-vocabulary continuous-speech recognition
- T. Hori, Y. Noda, and S. Matsunaga, "Improved phoneme- historydependent search method for large-vocabulary continuous-speech recognition," IEICE Trans. Info. Syst., vol. E86-D, no. 6, pp. 1059-1067, 2003.
- (2003) IEICE Trans. Info. Syst , vol.E86-D , Issue.6 , pp. 1059-1067
- Hori, T.¹ Noda, Y.² Matsunaga, S.³

20
- 3042854734
- Benchmark test for speech recognition using the corpus of spontaneous Japanese
- T. Kawahara, H. Nanjo, T. Shinozaki, and S. Furui, "Benchmark test for speech recognition using the corpus of spontaneous Japanese," in Proc. SSPR, 2003, pp. 135-138.
- (2003) Proc. SSPR , pp. 135-138
- Kawahara, T.¹ Nanjo, H.² Shinozaki, T.³ Furui, S.⁴

21
- 64149119992
- NTT speech recognizer with outLook on the next generation: SOLON
- T. Hori, "NTT speech recognizer with outLook on the next generation: SOLON," in Proc. Commun. Scene Anal., 2004.
- (2004) Proc. Commun. Scene Anal
- Hori, T.¹

22
- 1642296635
- Efficient support vector classifiers for named entity recognition
- H. Isozaki et al., "Efficient support vector classifiers for named entity recognition," in Proc. COLING, 2002, pp. 390-396.
- (2002) Proc. COLING , pp. 390-396
- Isozaki, H.¹

23
- 0012611072
- Entropy-based pruning of backoff language models
- A. Stolcke, "Entropy-based pruning of backoff language models," in Proc. DARPA Broadcast News Transcription and UnderstandingWorkshop, 1998, pp. 270-274.
- (1998) Proc. DARPA Broadcast News Transcription and UnderstandingWorkshop , pp. 270-274
- Stolcke, A.¹

24
- 85009271609
- Towards automatic closed captioning: Low latency real time broadcast news transcription
- M. Saraclar,M. Riley, E. Bocchieri, and V. Goffin, "Towards automatic closed captioning: Low latency real time broadcast news transcription," in Proc. ICSLP, 2002, pp. 1741-1744.
- (2002) Proc. ICSLP , pp. 1741-1744
- Saraclar, M.¹ Riley, M.² Bocchieri, E.³ Goffin, V.⁴

25
- 0027297381
- Vector quantization for the efficient computation of continuous density likelihoods
- E. Bocchieri, "Vector quantization for the efficient computation of continuous density likelihoods," in Proc. ICASSP, 1993, vol. II, pp. 692-695.
- (1993) Proc. ICASSP , vol.2 , pp. 692-695
- Bocchieri, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.