메뉴 건너뛰기




Volumn 4299 LNCS, Issue , 2006, Pages 407-418

The ISL RT-06S speech-to-text system

Author keywords

[No Author keywords available]

Indexed keywords

CURRENT SYSTEM; INTERACTIVE SYSTEM; LANGUAGE MODEL; NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY; PREVIOUS YEAR; SPEECH SEGMENTATION; SPEECH-TO-TEXT SYSTEM;

EID: 70349220516     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11965152_36     Document Type: Conference Paper
Times cited : (6)

References (31)
  • 2
    • 33947687166 scopus 로고    scopus 로고
    • Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination
    • M. Wölfel and J. McDonough, "Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination," in INTERSPEECH, 2005.
    • (2005) INTERSPEECH
    • Wölfel, M.1    McDonough, J.2
  • 3
    • 84887145372 scopus 로고    scopus 로고
    • Issuesin Meeting Transcription - The ISL Meeting Transcription System
    • F. Metze, Q. Jin, C. Fügen, K. Laskowski, Y. Pan, and T. Schultz, "Issuesin Meeting Transcription - The ISL Meeting Transcription System," in ICSLP, 2004.
    • (2004) ICSLP
    • Metze, F.1    Jin, Q.2    Fügen, C.3    Laskowski, K.4    Pan, Y.5    Schultz, T.6
  • 4
    • 44949186919 scopus 로고    scopus 로고
    • Minimum Variance Distortionless Response Spectral Estimation Review and Refinements
    • September
    • M. Wölfel and J. McDonough, "Minimum Variance Distortionless Response Spectral Estimation Review and Refinements," IEEE Signal Processing Magazine, September 2005.
    • (2005) IEEE Signal Processing Magazine
    • Wölfel, M.1    McDonough, J.2
  • 5
    • 44849122416 scopus 로고    scopus 로고
    • Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End
    • S. Stüker, C. Fügen, S. Burger, and M. Wölfel, "Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End," in INTERSPEECH, 2006.
    • (2006) INTERSPEECH
    • Stüker, S.1    Fügen, C.2    Burger, S.3    Wölfel, M.4
  • 6
    • 85009080849 scopus 로고    scopus 로고
    • Speaker Segmentation and Clustering in Meetings
    • Q. Jin and T. Schultz, "Speaker Segmentation and Clustering in Meetings," in ICSLP, 2004.
    • (2004) ICSLP
    • Jin, Q.1    Schultz, T.2
  • 8
    • 0016495091 scopus 로고
    • Linear Prediction: A Tutorial Review
    • J. Makhoul, "Linear Prediction: A Tutorial Review," Proc. of the IEEE, vol. 63, no. 4, pp. 561-580, 1975.
    • (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 10
    • 0141469852 scopus 로고    scopus 로고
    • Multispeaker Speech Activity Detection for the ICSI Meeting Recorder
    • T. Pfau, D. P. W. Ellis, and A. Stolcke, "Multispeaker Speech Activity Detection for the ICSI Meeting Recorder," in Proc. ASRU, 2001.
    • (2001) Proc. ASRU
    • Pfau, T.1    Ellis, D.P.W.2    Stolcke, A.3
  • 12
    • 33947615205 scopus 로고    scopus 로고
    • Unsupervised Learning of Overlapped Speech Model Parameters for Multichannel Speech Activity Detection in Meetings
    • K. Laskowski and T. Schultz, "Unsupervised Learning of Overlapped Speech Model Parameters for Multichannel Speech Activity Detection in Meetings," in Proc. ICASSP, 2006.
    • (2006) Proc. ICASSP
    • Laskowski, K.1    Schultz, T.2
  • 13
    • 33947640630 scopus 로고    scopus 로고
    • Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap
    • Ö. Çetin and E. Shriberg, "Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap," in Proc. ICASSP, 2006.
    • (2006) Proc. ICASSP
    • Çetin, O.1    Shriberg, E.2
  • 14
    • 84962868641 scopus 로고    scopus 로고
    • A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment
    • H. Soltau, F. Metze, C. Fügen, and A. Waibel, "A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment," in ASRU, 2001.
    • (2001) ASRU
    • Soltau, H.1    Metze, F.2    Fügen, C.3    Waibel, A.4
  • 15
    • 4243460174 scopus 로고    scopus 로고
    • Semi-tied covariance matrices
    • M. J. F. Gales, "Semi-tied covariance matrices," in ICASSP, 1998.
    • (1998) ICASSP
    • Gales, M.J.F.1
  • 16
    • 0036294871 scopus 로고    scopus 로고
    • On Maximum Mutual Information Speaker-Adapted Training
    • J. McDonough, T. Schaaf, and A. Waibel, "On Maximum Mutual Information Speaker-Adapted Training," in ICASSP, 2002.
    • (2002) ICASSP
    • McDonough, J.1    Schaaf, T.2    Waibel, A.3
  • 17
    • 0032639647 scopus 로고    scopus 로고
    • A Statistical Text-to-Phone Function Using Ngrams and Rules
    • W. M. Fisher, "A Statistical Text-to-Phone Function Using Ngrams and Rules," in ICASSP, 1999.
    • (1999) ICASSP
    • Fisher, W.M.1
  • 18
    • 84891308106 scopus 로고    scopus 로고
    • SRILM - An Extensible Language Modeling Toolkit
    • A. Stolcke, "SRILM - An Extensible Language Modeling Toolkit," in ICSLP, 2002.
    • (2002) ICSLP
    • Stolcke, A.1
  • 19
    • 0003396042 scopus 로고    scopus 로고
    • An Empirical Study of Smoothing Techniques for Language Modeling
    • Computer Science Group, Harvard University, Tech. Rep. TR-10-98
    • S. F. Chen and J. Goodman, "An Empirical Study of Smoothing Techniques for Language Modeling," Computer Science Group, Harvard University, Tech. Rep. TR-10-98, 1998.
    • (1998)
    • Chen, S.F.1    Goodman, J.2
  • 20
    • 44949090835 scopus 로고    scopus 로고
    • Getting more Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures
    • I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures," in Proc. HLT-NAACL, 2003.
    • (2003) Proc. HLT-NAACL
    • Bulyko, I.1    Ostendorf, M.2    Stolcke, A.3
  • 22
    • 85009223249 scopus 로고    scopus 로고
    • Techniques for Effective Vocabulary Selection
    • A. Venkataraman and W. Wang, "Techniques for Effective Vocabulary Selection," in Proc. Eurospeech, 2003.
    • (2003) Proc. Eurospeech
    • Venkataraman, A.1    Wang, W.2
  • 23
    • 0003571407 scopus 로고    scopus 로고
    • Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83
    • A. W. Black and P. A. Taylor, "The Festival Speech Synthesis System: System documentation," Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83, 1997.
    • (1997) The Festival Speech Synthesis System: System documentation
    • Black, A.W.1    Taylor, P.A.2
  • 24
    • 0030705337 scopus 로고    scopus 로고
    • Speaker Normalization Based on Frequency Warping
    • P. Zhan and M. Westphal, "Speaker Normalization Based on Frequency Warping," in ICASSP, 1997.
    • (1997) ICASSP
    • Zhan, P.1    Westphal, M.2
  • 26
    • 0029288633 scopus 로고
    • Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models
    • C. J. Leggetter and P. C. Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 28
    • 33646805430 scopus 로고    scopus 로고
    • Alternate Phone Models for Conversational Speech
    • L. Lamel and J.-L. Gauvain, "Alternate Phone Models for Conversational Speech," in ICASSP, 2005.
    • (2005) ICASSP
    • Lamel, L.1    Gauvain, J.-L.2
  • 29
    • 85135271674 scopus 로고    scopus 로고
    • Finding Consensus among Words: Lattice-based Word Error Minimization
    • L. Mangu, E. Brill, and A. Stolcke, "Finding Consensus among Words: Lattice-based Word Error Minimization," in EUROSPEECH, 1999.
    • (1999) EUROSPEECH
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 30
    • 44949114262 scopus 로고    scopus 로고
    • Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures
    • M. Wölfel, C. Fügen, S. Ikbal, and J. W. McDonough, "Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures," in INTERSPEECH, 2006.
    • (2006) INTERSPEECH
    • Wölfel, M.1    Fügen, C.2    Ikbal, S.3    McDonough, J.W.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.