SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 4299 LNCS, Issue , 2006, Pages 407-418

The ISL RT-06S speech-to-text system

(9) Fügen, Christian a Ikbal, Shajith a Kraft, Florian a Kumatani, Kenichi a Laskowski, Kornel a McDonough, John W a Ostendorf, Mari a,b Stüker, Sebastian a Wölfel, Matthias a

a UNIVERSITY OF KARLSRUHE (Germany)

b University of Washington (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CURRENT SYSTEM; INTERACTIVE SYSTEM; LANGUAGE MODEL; NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY; PREVIOUS YEAR; SPEECH SEGMENTATION; SPEECH-TO-TEXT SYSTEM;

COMPUTATIONAL LINGUISTICS; INTERACTIVE COMPUTER SYSTEMS; LEARNING SYSTEMS; USER INTERFACES;

MICROPHONES;

EID: 70349220516 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11965152_36 Document Type: Conference Paper

Times cited : (6)

References (31)

1
- 33947687235
- Open Domain Speech Recognition & Translation: Lectures and Speeches
- C. Fügen, M. Kolss, D. Bernreuther, M. Paulik, S. Stüker, S. Vogel, and A. Waibel, "Open Domain Speech Recognition & Translation: Lectures and Speeches," in ICASSP, 2006.
- (2006) ICASSP
- Fügen, C.¹ Kolss, M.² Bernreuther, D.³ Paulik, M.⁴ Stüker, S.⁵ Vogel, S.⁶ Waibel, A.⁷

2
- 33947687166
- Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination
- M. Wölfel and J. McDonough, "Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination," in INTERSPEECH, 2005.
- (2005) INTERSPEECH
- Wölfel, M.¹ McDonough, J.²

3
- 84887145372
- Issuesin Meeting Transcription - The ISL Meeting Transcription System
- F. Metze, Q. Jin, C. Fügen, K. Laskowski, Y. Pan, and T. Schultz, "Issuesin Meeting Transcription - The ISL Meeting Transcription System," in ICSLP, 2004.
- (2004) ICSLP
- Metze, F.¹ Jin, Q.² Fügen, C.³ Laskowski, K.⁴ Pan, Y.⁵ Schultz, T.⁶

4
- 44949186919
- Minimum Variance Distortionless Response Spectral Estimation Review and Refinements
- September
- M. Wölfel and J. McDonough, "Minimum Variance Distortionless Response Spectral Estimation Review and Refinements," IEEE Signal Processing Magazine, September 2005.
- (2005) IEEE Signal Processing Magazine
- Wölfel, M.¹ McDonough, J.²

5
- 44849122416
- Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End
- S. Stüker, C. Fügen, S. Burger, and M. Wölfel, "Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End," in INTERSPEECH, 2006.
- (2006) INTERSPEECH
- Stüker, S.¹ Fügen, C.² Burger, S.³ Wölfel, M.⁴

6
- 85009080849
- Speaker Segmentation and Clustering in Meetings
- Q. Jin and T. Schultz, "Speaker Segmentation and Clustering in Meetings," in ICSLP, 2004.
- (2004) ICSLP
- Jin, Q.¹ Schultz, T.²

7
- 44849131194
- The ISL TC-STAR Spring 2006 ASR Evaluation Systems
- S. Stüker, C. Fügen, R. Hsiao, S. Ikbal, Q. Jin, F. Kraft, M. Paulik, and M. W. M. Raab, Y.-C. Tam, "The ISL TC-STAR Spring 2006 ASR Evaluation Systems," in TC-Star Workshop on Speech-to-Speech Translation, 2006.
- (2006) TC-Star Workshop on Speech-to-Speech Translation
- Stüker, S.¹ Fügen, C.² Hsiao, R.³ Ikbal, S.⁴ Jin, Q.⁵ Kraft, F.⁶ Paulik, M.⁷ Raab, M.W.M.⁸ Tam, Y.-C.⁹

8
- 0016495091
- Linear Prediction: A Tutorial Review
- J. Makhoul, "Linear Prediction: A Tutorial Review," Proc. of the IEEE, vol. 63, no. 4, pp. 561-580, 1975.
- (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

9
- 44949181081
- Advances in Lecture Recognition: The ISL RT-06S Evaluation System
- C. Fügen, M. Wölfel, J. W. McDonough, S. Ikbal, F. Kraft, K. Laskowski, M. Ostendorf, S. Stüker, and K. Kumatani, "Advances in Lecture Recognition: The ISL RT-06S Evaluation System," in INTERSPEECH, 2006.
- (2006) INTERSPEECH
- Fügen, C.¹ Wölfel, M.² McDonough, J.W.³ Ikbal, S.⁴ Kraft, F.⁵ Laskowski, K.⁶ Ostendorf, M.⁷ Stüker, S.⁸ Kumatani, K.⁹

10
- 0141469852
- Multispeaker Speech Activity Detection for the ICSI Meeting Recorder
- T. Pfau, D. P. W. Ellis, and A. Stolcke, "Multispeaker Speech Activity Detection for the ICSI Meeting Recorder," in Proc. ASRU, 2001.
- (2001) Proc. ASRU
- Pfau, T.¹ Ellis, D.P.W.² Stolcke, A.³

11
- 11144232847
- Speech and Crosstalk Detection in Multichannel Audio
- S. N. Wrigley, G. J. Brown, V. Wan, and S. Renals, "Speech and Crosstalk Detection in Multichannel Audio," IEEE Trans on Speech and Audio Processing, vol. 13, pp. 84-91, 2005.
- (2005) IEEE Trans on Speech and Audio Processing , vol.13 , pp. 84-91
- Wrigley, S.N.¹ Brown, G.J.² Wan, V.³ Renals, S.⁴

12
- 33947615205
- Unsupervised Learning of Overlapped Speech Model Parameters for Multichannel Speech Activity Detection in Meetings
- K. Laskowski and T. Schultz, "Unsupervised Learning of Overlapped Speech Model Parameters for Multichannel Speech Activity Detection in Meetings," in Proc. ICASSP, 2006.
- (2006) Proc. ICASSP
- Laskowski, K.¹ Schultz, T.²

13
- 33947640630
- Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap
- Ö. Çetin and E. Shriberg, "Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap," in Proc. ICASSP, 2006.
- (2006) Proc. ICASSP
- Çetin, O.¹ Shriberg, E.²

14
- 84962868641
- A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment
- H. Soltau, F. Metze, C. Fügen, and A. Waibel, "A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment," in ASRU, 2001.
- (2001) ASRU
- Soltau, H.¹ Metze, F.² Fügen, C.³ Waibel, A.⁴

15
- 4243460174
- Semi-tied covariance matrices
- M. J. F. Gales, "Semi-tied covariance matrices," in ICASSP, 1998.
- (1998) ICASSP
- Gales, M.J.F.¹

16
- 0036294871
- On Maximum Mutual Information Speaker-Adapted Training
- J. McDonough, T. Schaaf, and A. Waibel, "On Maximum Mutual Information Speaker-Adapted Training," in ICASSP, 2002.
- (2002) ICASSP
- McDonough, J.¹ Schaaf, T.² Waibel, A.³

17
- 0032639647
- A Statistical Text-to-Phone Function Using Ngrams and Rules
- W. M. Fisher, "A Statistical Text-to-Phone Function Using Ngrams and Rules," in ICASSP, 1999.
- (1999) ICASSP
- Fisher, W.M.¹

18
- 84891308106
- SRILM - An Extensible Language Modeling Toolkit
- A. Stolcke, "SRILM - An Extensible Language Modeling Toolkit," in ICSLP, 2002.
- (2002) ICSLP
- Stolcke, A.¹

19
- 0003396042
- An Empirical Study of Smoothing Techniques for Language Modeling
- Computer Science Group, Harvard University, Tech. Rep. TR-10-98
- S. F. Chen and J. Goodman, "An Empirical Study of Smoothing Techniques for Language Modeling," Computer Science Group, Harvard University, Tech. Rep. TR-10-98, 1998.
- (1998)
- Chen, S.F.¹ Goodman, J.²

20
- 44949090835
- Getting more Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures
- I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures," in Proc. HLT-NAACL, 2003.
- (2003) Proc. HLT-NAACL
- Bulyko, I.¹ Ostendorf, M.² Stolcke, A.³

21
- 33745563257
- International Computer Science Institute, Berkeley, CA, USA, Tech. Rep. TR-05-006
- Ö. Çetin and A. Stolcke, "Language Modeling in the ICSI-SRI Spring 2005 Meeting Speech Recognition Evaluation System," International Computer Science Institute, Berkeley, CA, USA, Tech. Rep. TR-05-006, 2005.
- (2005) Language Modeling in the ICSI-SRI Spring 2005 Meeting Speech Recognition Evaluation System
- Çetin, O.¹ Stolcke, A.²

22
- 85009223249
- Techniques for Effective Vocabulary Selection
- A. Venkataraman and W. Wang, "Techniques for Effective Vocabulary Selection," in Proc. Eurospeech, 2003.
- (2003) Proc. Eurospeech
- Venkataraman, A.¹ Wang, W.²

23
- 0003571407
- Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83
- A. W. Black and P. A. Taylor, "The Festival Speech Synthesis System: System documentation," Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83, 1997.
- (1997) The Festival Speech Synthesis System: System documentation
- Black, A.W.¹ Taylor, P.A.²

24
- 0030705337
- Speaker Normalization Based on Frequency Warping
- P. Zhan and M. Westphal, "Speaker Normalization Based on Frequency Warping," in ICASSP, 1997.
- (1997) ICASSP
- Zhan, P.¹ Westphal, M.²

25
- 0003454539
- Cambridge University, Cambridge, United Kingdom, Tech. Rep
- M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-based Speech Recognition," Cambridge University, Cambridge, United Kingdom, Tech. Rep., 1997.
- (1997) Maximum Likelihood Linear Transformations for HMM-based Speech Recognition
- Gales, M.J.F.¹

26
- 0029288633
- Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models
- C. J. Leggetter and P. C. Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

27
- 33745225118
- The ISL RT04 Mandarin Broadcast News Evaluation System
- H. Yu, Y.-C. Tam, T. Schaaf, S. Stüker, Q. Jin, M. Noamany, and T. Schultz, "The ISL RT04 Mandarin Broadcast News Evaluation System," in EARS Rich Transcription Workshop, 2004.
- (2004) EARS Rich Transcription Workshop
- Yu, H.¹ Tam, Y.-C.² Schaaf, T.³ Stüker, S.⁴ Jin, Q.⁵ Noamany, M.⁶ Schultz, T.⁷

28
- 33646805430
- Alternate Phone Models for Conversational Speech
- L. Lamel and J.-L. Gauvain, "Alternate Phone Models for Conversational Speech," in ICASSP, 2005.
- (2005) ICASSP
- Lamel, L.¹ Gauvain, J.-L.²

29
- 85135271674
- Finding Consensus among Words: Lattice-based Word Error Minimization
- L. Mangu, E. Brill, and A. Stolcke, "Finding Consensus among Words: Lattice-based Word Error Minimization," in EUROSPEECH, 1999.
- (1999) EUROSPEECH
- Mangu, L.¹ Brill, E.² Stolcke, A.³

30
- 44949114262
- Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures
- M. Wölfel, C. Fügen, S. Ikbal, and J. W. McDonough, "Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures," in INTERSPEECH, 2006.
- (2006) INTERSPEECH
- Wölfel, M.¹ Fügen, C.² Ikbal, S.³ McDonough, J.W.⁴

31
- 34250167213
- "CHIL - Computers in the Human Interaction Loop," http://chil.server.de.
- CHIL - Computers in the Human Interaction Loop

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.