메뉴 건너뛰기




Volumn , Issue , 2012, Pages 87-90

The 2012 KIT and KIT-NAIST English ASR Systems for the IWSLT Evaluation

Author keywords

evaluation system; IWSLT; speech recognition; system development; TED talks

Indexed keywords

CONFUSION NETWORKS; EVALUATION SYSTEM; FRONT END; IWSLT; SPEECH-TO-TEXT SYSTEM; SYSTEM DEVELOPMENT; TED TALK;

EID: 84906235762     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (17)
  • 4
    • 85032772258 scopus 로고    scopus 로고
    • Minimum variance distortionless response spectralestimation, review and refinements
    • September
    • M. Wölfel and J. McDonough, “Minimum variance distortionless response spectralestimation, review and refinements,” IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 117-126, September 2005.
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 117-126
    • Wölfel, M.1    McDonough, J.2
  • 5
    • 0030705337 scopus 로고    scopus 로고
    • Speaker normalization based on frequency warping
    • Munich, Germany, April
    • P. Zhan and M. Westphal, “Speaker normalization based on frequency warping,” in ICASSP, Munich, Germany, April 1997.
    • (1997) ICASSP
    • Zhan, P.1    Westphal, M.2
  • 8
    • 0003571407 scopus 로고    scopus 로고
    • Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83
    • A. W. Black and P. A. Taylor, “The Festival Speech Synthesis System: System documentation,” Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83, 1997.
    • (1997) The Festival Speech Synthesis System: System documentation
    • Black, A. W.1    Taylor, P. A.2
  • 9
    • 41049105254 scopus 로고    scopus 로고
    • Joint-sequence models for grapheme-to-phoneme conversion
    • May
    • M. Bisani and H. Ney, “Joint-sequence models for grapheme-to-phoneme conversion,” Speech Communication, vol. 50, May 2008.
    • (2008) Speech Communication , vol.50
    • Bisani, M.1    Ney, H.2
  • 10
    • 84891308106 scopus 로고    scopus 로고
    • Srilm - an extensible language modeling toolkit
    • A. Stolcke, “Srilm - an extensible language modeling toolkit,” in ICSLP, 2002.
    • (2002) ICSLP
    • Stolcke, A.1
  • 12
    • 84962868641 scopus 로고    scopus 로고
    • A one-pass decoder based on polymorphic linguistic context assignment
    • H. Soltau, F. Metze, C. Fuegen, and A. Waibel, “A one-pass decoder based on polymorphic linguistic context assignment,” in ASRU, 2001.
    • (2001) ASRU
    • Soltau, H.1    Metze, F.2    Fuegen, C.3    Waibel, A.4
  • 14
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (rover)
    • Santa Barbara, CA, USA: IEEE, December
    • J. Fiscus, “A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (rover),” in Proceedings the IEEE Workshop on Automatic Speech Recognition and Understanding. Santa Barbara, CA, USA: IEEE, December 1997, pp. 347-354.
    • (1997) Proceedings the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 347-354
    • Fiscus, J.1
  • 15
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
    • October
    • L. Mangu, E. Brill, and A. Stolcke, “Finding consensus in speech recognition: Word error minimization and other applications of confusion networks,” Computer Speech and Language, vol. 14, no. 4, pp. 373-400, October 2000.
    • (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 16
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • C. Leggetter and P. Woodland, “Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models,” Computer Speech and Language, vol. 9, pp. 171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.