메뉴 건너뛰기




Volumn 3, Issue , 2006, Pages 1229-1232

Advances in lecture recognition: The ISL RT-06S evaluation system

Author keywords

CHIL; Distant speech; Lectures; RT 06S; Speech recognition

Indexed keywords

MICROPHONES;

EID: 44949181081     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (15)

References (23)
  • 2
    • 33947687166 scopus 로고    scopus 로고
    • Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination
    • M. Wölfel and J. McDonough, "Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination," in INTERSPEECH, 2005.
    • (2005) INTERSPEECH
    • Wölfel, M.1    McDonough, J.2
  • 3
    • 84887145372 scopus 로고    scopus 로고
    • Issues in Meeting Transcription -The ISL Meeting Transcription System
    • F. Metze, Q. Jin, C. Fügen, K. Laskowski, Y. Pan, and T. Schultz, "Issues in Meeting Transcription -The ISL Meeting Transcription System," in ICSLP, 2004.
    • (2004) ICSLP
    • Metze, F.1    Jin, Q.2    Fügen, C.3    Laskowski, K.4    Pan, Y.5    Schultz, T.6
  • 5
    • 44949186919 scopus 로고    scopus 로고
    • Minimum Variance Distortionless Response Spectral Estimation Review and Refinements
    • September
    • M. Wölfel and J. McDonough, "Minimum Variance Distortionless Response Spectral Estimation Review and Refinements," IEEE Signal Processing Magazine, September 2005.
    • (2005) IEEE Signal Processing Magazine
    • Wölfel, M.1    McDonough, J.2
  • 7
    • 33646805430 scopus 로고    scopus 로고
    • Alternate Phone Models for Conversational Speech
    • L. Lamel and J.-L. Gauvain, "Alternate Phone Models for Conversational Speech," in ICASSP, 2005.
    • (2005) ICASSP
    • Lamel, L.1    Gauvain, J.-L.2
  • 8
    • 85009080849 scopus 로고    scopus 로고
    • Speaker Segmentation and Clustering in Meetings
    • Q. Jin and T. Schultz, "Speaker Segmentation and Clustering in Meetings," in ICSLP, 2004.
    • (2004) ICSLP
    • Jin, Q.1    Schultz, T.2
  • 9
    • 85022115603 scopus 로고    scopus 로고
    • Linguistic data consortium
    • "Linguistic data consortium," http://www.ldc.upenn.edu.
  • 10
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • J. Makhoul, "Linear prediction: A tutorial review," in Proc. of the IEEE, 1975, vol. 63(4), pp. 561-580.
    • (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 11
    • 84962868641 scopus 로고    scopus 로고
    • A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment
    • H. Soltau, F. Metze, C. Fügen, and A. Waibel, "A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment," in ASRU, 2001.
    • (2001) ASRU
    • Soltau, H.1    Metze, F.2    Fügen, C.3    Waibel, A.4
  • 12
    • 4243460174 scopus 로고    scopus 로고
    • Semi-tied covariance matrices
    • M. J. F. Gales, "Semi-tied covariance matrices," in ICASSP, 1998.
    • (1998) ICASSP
    • Gales, M.J.F.1
  • 13
    • 0036294871 scopus 로고    scopus 로고
    • On Maximum Mutual Information Speaker-Adapted Training
    • J. McDonough, T. Schaaf, and A. Waibel, "On Maximum Mutual Information Speaker-Adapted Training," in ICASSP, 2002.
    • (2002) ICASSP
    • McDonough, J.1    Schaaf, T.2    Waibel, A.3
  • 14
    • 0032639647 scopus 로고    scopus 로고
    • A Statistical Text-to-Phone Function Using Ngrams and Rules
    • W. M. Fisher, "A Statistical Text-to-Phone Function Using Ngrams and Rules," in ICASSP, 1999.
    • (1999) ICASSP
    • Fisher, W.M.1
  • 15
    • 85022109131 scopus 로고    scopus 로고
    • I. Bulyko, M. Ostendorf, and A. Stolcke, Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures, in Proc. HLT-NAACL, 2003, Comp., pp. 7-9.
    • I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures," in Proc. HLT-NAACL, 2003, vol. Comp., pp. 7-9.
  • 16
    • 84891308106 scopus 로고    scopus 로고
    • SRILM - An Extensible Language Modeling Toolkit
    • A. Stoicke, "SRILM - An Extensible Language Modeling Toolkit," in ICSLP, 2002.
    • (2002) ICSLP
    • Stoicke, A.1
  • 17
    • 0003396042 scopus 로고    scopus 로고
    • An Empirical Study of Smoothing Techniques for Language Modeling,
    • Tech. Rep. TR-10-98, Computer Science Group, Harvard University
    • S. F Chen and J. Goodman, "An Empirical Study of Smoothing Techniques for Language Modeling," Tech. Rep. TR-10-98, Computer Science Group, Harvard University, 1998.
    • (1998)
    • Chen, S.F.1    Goodman, J.2
  • 18
    • 0003571407 scopus 로고    scopus 로고
    • The Festival Speech Synthesis System: System documentation,
    • Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom
    • A. W. Black and P. A. Taylor, "The Festival Speech Synthesis System: System documentation," Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, 1997.
    • (1997)
    • Black, A.W.1    Taylor, P.A.2
  • 19
    • 0030705337 scopus 로고    scopus 로고
    • Speaker Normalization Based on Frequency Warping
    • P. Zhan and M. Westphal, "Speaker Normalization Based on Frequency Warping," in ICASSP, 1997.
    • (1997) ICASSP
    • Zhan, P.1    Westphal, M.2
  • 20
    • 0003454539 scopus 로고    scopus 로고
    • Maximum Likelihood Linear Transformations for HMM-based Speech Recognition,
    • Tech. Rep, Cambridge University, Cambridge, United Kingdom
    • M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-based Speech Recognition," Tech. Rep., Cambridge University, Cambridge, United Kingdom, 1997.
    • (1997)
    • Gales, M.J.F.1
  • 21
    • 0029288633 scopus 로고
    • Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models
    • C. J. Leggetter and P. C. Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 22
    • 85135271674 scopus 로고    scopus 로고
    • Finding Consensus among Words: Lattice-based Word Error Minimization
    • L. Mangu, E. Brill, and A. Stolcke, "Finding Consensus among Words: Lattice-based Word Error Minimization," in EUROSPEECH, 1999.
    • (1999) EUROSPEECH
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 23
    • 44949114262 scopus 로고    scopus 로고
    • Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures
    • M. Wölfel, C. Fügen, S. Ikbal, and J. W. McDonough, "Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures," in INTERSPEECH, 2006.
    • (2006) INTERSPEECH
    • Wölfel, M.1    Fügen, C.2    Ikbal, S.3    McDonough, J.W.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.