SCOPUS 정보 검색 플랫폼

INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP

Volumn 3, Issue , 2006, Pages 1229-1232

Advances in lecture recognition: The ISL RT-06S evaluation system

(9) Fügen, Christian a Wölfel, Matthias a McDonough, John W a Ikbal, Shajith a Kraft, Florian a Laskowski, Kornel a Ostendorf, Mari a,b Stüker, Sebastian a Kumatani, Kenichi a

a UNIVERSITY OF KARLSRUHE (Germany)

b University of Washington (United States)

Author keywords

CHIL; Distant speech; Lectures; RT 06S; Speech recognition

Indexed keywords

MICROPHONES;

ACOUSTIC AND LANGUAGE MODELS; CHIL; INTERACTIVE SYSTEM; LECTURE RECOGNITION; LECTURES; NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY; RICH TRANSCRIPTIONS; RT-06S;

SPEECH RECOGNITION;

EID: 44949181081 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (15)

References (23)

1
- 33947687235
- Open Domain Speech Recognition & Translation: Lectures and Speeches
- C. Fügen, M. Kolss, D. Bernreuther, M. Paulik, S. Stüker, S. Vogel, and A. Waibel, "Open Domain Speech Recognition & Translation: Lectures and Speeches," in ICASSP, 2006.
- (2006) ICASSP
- Fügen, C.¹ Kolss, M.² Bernreuther, D.³ Paulik, M.⁴ Stüker, S.⁵ Vogel, S.⁶ Waibel, A.⁷

2
- 33947687166
- Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination
- M. Wölfel and J. McDonough, "Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination," in INTERSPEECH, 2005.
- (2005) INTERSPEECH
- Wölfel, M.¹ McDonough, J.²

3
- 84887145372
- Issues in Meeting Transcription -The ISL Meeting Transcription System
- F. Metze, Q. Jin, C. Fügen, K. Laskowski, Y. Pan, and T. Schultz, "Issues in Meeting Transcription -The ISL Meeting Transcription System," in ICSLP, 2004.
- (2004) ICSLP
- Metze, F.¹ Jin, Q.² Fügen, C.³ Laskowski, K.⁴ Pan, Y.⁵ Schultz, T.⁶

4
- 34250692938
- "CHIL - Computers in the Human Interaction Loop," http://chil.server.de.
- CHIL - Computers in the Human Interaction Loop

5
- 44949186919
- Minimum Variance Distortionless Response Spectral Estimation Review and Refinements
- September
- M. Wölfel and J. McDonough, "Minimum Variance Distortionless Response Spectral Estimation Review and Refinements," IEEE Signal Processing Magazine, September 2005.
- (2005) IEEE Signal Processing Magazine
- Wölfel, M.¹ McDonough, J.²

6
- 33745225118
- The ISL RT04 Mandarin Broadcast News Evaluation System
- November
- H. Yu, Y.-C. Tam, T. Schaaf, S. Stüker, Q. Jin, M. Noamany, and T. Schultz, "The ISL RT04 Mandarin Broadcast News Evaluation System," in EARS Rich Transcription Workshop, November 2004.
- (2004) EARS Rich Transcription Workshop
- Yu, H.¹ Tam, Y.-C.² Schaaf, T.³ Stüker, S.⁴ Jin, Q.⁵ Noamany, M.⁶ Schultz, T.⁷

7
- 33646805430
- Alternate Phone Models for Conversational Speech
- L. Lamel and J.-L. Gauvain, "Alternate Phone Models for Conversational Speech," in ICASSP, 2005.
- (2005) ICASSP
- Lamel, L.¹ Gauvain, J.-L.²

8
- 85009080849
- Speaker Segmentation and Clustering in Meetings
- Q. Jin and T. Schultz, "Speaker Segmentation and Clustering in Meetings," in ICSLP, 2004.
- (2004) ICSLP
- Jin, Q.¹ Schultz, T.²

9
- 85022115603
- Linguistic data consortium
- "Linguistic data consortium," http://www.ldc.upenn.edu.

10
- 0016495091
- Linear prediction: A tutorial review
- J. Makhoul, "Linear prediction: A tutorial review," in Proc. of the IEEE, 1975, vol. 63(4), pp. 561-580.
- (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

11
- 84962868641
- A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment
- H. Soltau, F. Metze, C. Fügen, and A. Waibel, "A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment," in ASRU, 2001.
- (2001) ASRU
- Soltau, H.¹ Metze, F.² Fügen, C.³ Waibel, A.⁴

12
- 4243460174
- Semi-tied covariance matrices
- M. J. F. Gales, "Semi-tied covariance matrices," in ICASSP, 1998.
- (1998) ICASSP
- Gales, M.J.F.¹

13
- 0036294871
- On Maximum Mutual Information Speaker-Adapted Training
- J. McDonough, T. Schaaf, and A. Waibel, "On Maximum Mutual Information Speaker-Adapted Training," in ICASSP, 2002.
- (2002) ICASSP
- McDonough, J.¹ Schaaf, T.² Waibel, A.³

14
- 0032639647
- A Statistical Text-to-Phone Function Using Ngrams and Rules
- W. M. Fisher, "A Statistical Text-to-Phone Function Using Ngrams and Rules," in ICASSP, 1999.
- (1999) ICASSP
- Fisher, W.M.¹

15
- 85022109131
- I. Bulyko, M. Ostendorf, and A. Stolcke, Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures, in Proc. HLT-NAACL, 2003, Comp., pp. 7-9.
- I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures," in Proc. HLT-NAACL, 2003, vol. Comp., pp. 7-9.

16
- 84891308106
- SRILM - An Extensible Language Modeling Toolkit
- A. Stoicke, "SRILM - An Extensible Language Modeling Toolkit," in ICSLP, 2002.
- (2002) ICSLP
- Stoicke, A.¹

17
- 0003396042
- An Empirical Study of Smoothing Techniques for Language Modeling,
- Tech. Rep. TR-10-98, Computer Science Group, Harvard University
- S. F Chen and J. Goodman, "An Empirical Study of Smoothing Techniques for Language Modeling," Tech. Rep. TR-10-98, Computer Science Group, Harvard University, 1998.
- (1998)
- Chen, S.F.¹ Goodman, J.²

18
- 0003571407
- The Festival Speech Synthesis System: System documentation,
- Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom
- A. W. Black and P. A. Taylor, "The Festival Speech Synthesis System: System documentation," Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, 1997.
- (1997)
- Black, A.W.¹ Taylor, P.A.²

19
- 0030705337
- Speaker Normalization Based on Frequency Warping
- P. Zhan and M. Westphal, "Speaker Normalization Based on Frequency Warping," in ICASSP, 1997.
- (1997) ICASSP
- Zhan, P.¹ Westphal, M.²

20
- 0003454539
- Maximum Likelihood Linear Transformations for HMM-based Speech Recognition,
- Tech. Rep, Cambridge University, Cambridge, United Kingdom
- M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-based Speech Recognition," Tech. Rep., Cambridge University, Cambridge, United Kingdom, 1997.
- (1997)
- Gales, M.J.F.¹

21
- 0029288633
- Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models
- C. J. Leggetter and P. C. Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

22
- 85135271674
- Finding Consensus among Words: Lattice-based Word Error Minimization
- L. Mangu, E. Brill, and A. Stolcke, "Finding Consensus among Words: Lattice-based Word Error Minimization," in EUROSPEECH, 1999.
- (1999) EUROSPEECH
- Mangu, L.¹ Brill, E.² Stolcke, A.³

23
- 44949114262
- Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures
- M. Wölfel, C. Fügen, S. Ikbal, and J. W. McDonough, "Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures," in INTERSPEECH, 2006.
- (2006) INTERSPEECH
- Wölfel, M.¹ Fügen, C.² Ikbal, S.³ McDonough, J.W.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.