SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

2012 International Workshop on Spoken Language Translation, IWSLT 2012

Volumn , Issue , 2012, Pages 87-90

The 2012 KIT and KIT-NAIST English ASR Systems for the IWSLT Evaluation

(12) Saam, Christian a Mohr, Christian a Kilgour, Kevin a Heck, Michael a Sperber, Matthias a Kubo, Keigo b Stüker, Sebastian a Sakti, Sakriani b Neubig, Graham b Toda, Tomoki b Nakamura, Satoshi b Waibel, Alex a

a KARLSRUHE INSTITUTE OF TECHNOLOGY (Germany)

b NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

evaluation system; IWSLT; speech recognition; system development; TED talks

Indexed keywords

CONFUSION NETWORKS; EVALUATION SYSTEM; FRONT END; IWSLT; SPEECH-TO-TEXT SYSTEM; SYSTEM DEVELOPMENT; TED TALK;

SPEECH RECOGNITION;

EID: 84906235762 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (5)

References (17)

1
- 85045373614
- Overview of the iwslt 2012 evaluation campaign
- Hong Kong, December 6-7
- M. Federico, L. Bentivogli, M. Paul, and S. Stüker, “Overview of the iwslt 2012 evaluation campaign,” in Proceedings of the International Workshop on Spoken Language Translation (IWSLT) 2012, Hong Kong, December 6-7 2012.
- (2012) Proceedings of the International Workshop on Spoken Language Translation (IWSLT) 2012
- Federico, M.¹ Bentivogli, L.² Paul, M.³ Stüker, S.⁴

2
- 85133205526
- The 2011 kit english asr system for the iwslt evaluation
- San Francisco, December 8-9
- S. Stüker, K. Kilgour, C. Saam, and A. Waibel, “The 2011 kit english asr system for the iwslt evaluation,” in Proceedings of the International Workshop on Spoken Language Translation (IWSLT) 2011, San Francisco, December 8-9 2011.
- (2011) Proceedings of the International Workshop on Spoken Language Translation (IWSLT) 2011
- Stüker, S.¹ Kilgour, K.² Saam, C.³ Waibel, A.⁴

3
- 85133179054
- The kit-naist (contrastive) english asr system for iwslt 2012
- Hong Kong, December 6-7
- M. Heck, K. Kubo, M. Sperber, S. Sakti, S. Stüker, K. Kilgour, C. Mohr, C. Saam, G. Neubig, T. Toda, S. Nakamura, and A. Waibel, “The kit-naist (contrastive) english asr system for iwslt 2012,” in Proceedings of the International Workshop on Spoken Language Translation (IWSLT) 2012, Hong Kong, December 6-7 2012.
- (2012) Proceedings of the International Workshop on Spoken Language Translation (IWSLT) 2012
- Heck, M.¹ Kubo, K.² Sperber, M.³ Sakti, S.⁴ Stüker, S.⁵ Kilgour, K.⁶ Mohr, C.⁷ Saam, C.⁸ Neubig, G.⁹ Toda, T.¹⁰ Nakamura, S.¹¹ Waibel, A.¹²

4
- 85032772258
- Minimum variance distortionless response spectralestimation, review and refinements
- September
- M. Wölfel and J. McDonough, “Minimum variance distortionless response spectralestimation, review and refinements,” IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 117-126, September 2005.
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 117-126
- Wölfel, M.¹ McDonough, J.²

5
- 0030705337
- Speaker normalization based on frequency warping
- Munich, Germany, April
- P. Zhan and M. Westphal, “Speaker normalization based on frequency warping,” in ICASSP, Munich, Germany, April 1997.
- (1997) ICASSP
- Zhan, P.¹ Westphal, M.²

6
- 85009097225
- On using mlp features in lvcsr
- Citeseer
- Q. Zhu, B. Chen, N. Morgan, and A. Stolcke, “On using mlp features in lvcsr,” in Proceedings of ICSLP. Citeseer, 2004.
- (2004) Proceedings of ICSLP
- Zhu, Q.¹ Chen, B.² Morgan, N.³ Stolcke, A.⁴

7
- 84890452769
- K. Kilgour, C. Saam, C. Mohr, S. Stüker, and A. Waibel, “The 2011 kit quaero speech-to-text system for spanish,” 2011.
- (2011) The 2011 kit quaero speech-to-text system for spanish
- Kilgour, K.¹ Saam, C.² Mohr, C.³ Stüker, S.⁴ Waibel, A.⁵

8
- 0003571407
- Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83
- A. W. Black and P. A. Taylor, “The Festival Speech Synthesis System: System documentation,” Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83, 1997.
- (1997) The Festival Speech Synthesis System: System documentation
- Black, A. W.¹ Taylor, P. A.²

9
- 41049105254
- Joint-sequence models for grapheme-to-phoneme conversion
- May
- M. Bisani and H. Ney, “Joint-sequence models for grapheme-to-phoneme conversion,” Speech Communication, vol. 50, May 2008.
- (2008) Speech Communication , vol.50
- Bisani, M.¹ Ney, H.²

10
- 84891308106
- Srilm - an extensible language modeling toolkit
- A. Stolcke, “Srilm - an extensible language modeling toolkit,” in ICSLP, 2002.
- (2002) ICSLP
- Stolcke, A.¹

11
- 84883091818
- Arxiv preprint cs/0306022
- A. Venkataraman and W. Wang, “Techniques for effective vocabulary selection,” Arxiv preprint cs/0306022, 2003.
- (2003) Techniques for effective vocabulary selection
- Venkataraman, A.¹ Wang, W.²

12
- 84962868641
- A one-pass decoder based on polymorphic linguistic context assignment
- H. Soltau, F. Metze, C. Fuegen, and A. Waibel, “A one-pass decoder based on polymorphic linguistic context assignment,” in ASRU, 2001.
- (2001) ASRU
- Soltau, H.¹ Metze, F.² Fuegen, C.³ Waibel, A.⁴

13
- 44849122416
- Cross-system adaptation and combination for continuous speech recognition: The influence of phoneme set and acoustic front-end
- Pittsburgh, PA, USA: ISCA, September
- S. Stüker, C. Fügen, S. Burger, and M. Wölfel, “Cross-system adaptation and combination for continuous speech recognition: The influence of phoneme set and acoustic front-end,” in Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006, ICSLP). Pittsburgh, PA, USA: ISCA, September 2006, pp. 521-524.
- (2006) Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006, ICSLP) , pp. 521-524
- Stüker, S.¹ Fügen, C.² Burger, S.³ Wölfel, M.⁴

14
- 0030638031
- A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (rover)
- Santa Barbara, CA, USA: IEEE, December
- J. Fiscus, “A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (rover),” in Proceedings the IEEE Workshop on Automatic Speech Recognition and Understanding. Santa Barbara, CA, USA: IEEE, December 1997, pp. 347-354.
- (1997) Proceedings the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 347-354
- Fiscus, J.¹

15
- 0034296009
- Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
- October
- L. Mangu, E. Brill, and A. Stolcke, “Finding consensus in speech recognition: Word error minimization and other applications of confusion networks,” Computer Speech and Language, vol. 14, no. 4, pp. 373-400, October 2000.
- (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.² Stolcke, A.³

16
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
- C. Leggetter and P. Woodland, “Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models,” Computer Speech and Language, vol. 9, pp. 171-185, 1995.
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

17
- 0029375590
- Speaker adaptation using constrained estimation of gaussian mixtures
- V. Digalakis, D. Rtischev, and L. Neumeyer, “Speaker adaptation using constrained estimation of gaussian mixtures,” Speech and Audio Processing, IEEE Transactions on, vol. 3, no. 5, pp. 357-366, 1995.
- (1995) Speech and Audio Processing, IEEE Transactions on , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.¹ Rtischev, D.² Neumeyer, L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.