-
1
-
-
33947687235
-
Open Domain Speech Recognition & Translation: Lectures and Speeches
-
C. Fügen, M. Kolss, D. Bernreuther, M. Paulik, S. Stüker, S. Vogel, and A. Waibel, "Open Domain Speech Recognition & Translation: Lectures and Speeches," in ICASSP, 2006.
-
(2006)
ICASSP
-
-
Fügen, C.1
Kolss, M.2
Bernreuther, D.3
Paulik, M.4
Stüker, S.5
Vogel, S.6
Waibel, A.7
-
2
-
-
33947687166
-
Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination
-
M. Wölfel and J. McDonough, "Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination," in INTERSPEECH, 2005.
-
(2005)
INTERSPEECH
-
-
Wölfel, M.1
McDonough, J.2
-
3
-
-
84887145372
-
Issuesin Meeting Transcription - The ISL Meeting Transcription System
-
F. Metze, Q. Jin, C. Fügen, K. Laskowski, Y. Pan, and T. Schultz, "Issuesin Meeting Transcription - The ISL Meeting Transcription System," in ICSLP, 2004.
-
(2004)
ICSLP
-
-
Metze, F.1
Jin, Q.2
Fügen, C.3
Laskowski, K.4
Pan, Y.5
Schultz, T.6
-
4
-
-
44949186919
-
Minimum Variance Distortionless Response Spectral Estimation Review and Refinements
-
September
-
M. Wölfel and J. McDonough, "Minimum Variance Distortionless Response Spectral Estimation Review and Refinements," IEEE Signal Processing Magazine, September 2005.
-
(2005)
IEEE Signal Processing Magazine
-
-
Wölfel, M.1
McDonough, J.2
-
5
-
-
44849122416
-
Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End
-
S. Stüker, C. Fügen, S. Burger, and M. Wölfel, "Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End," in INTERSPEECH, 2006.
-
(2006)
INTERSPEECH
-
-
Stüker, S.1
Fügen, C.2
Burger, S.3
Wölfel, M.4
-
6
-
-
85009080849
-
Speaker Segmentation and Clustering in Meetings
-
Q. Jin and T. Schultz, "Speaker Segmentation and Clustering in Meetings," in ICSLP, 2004.
-
(2004)
ICSLP
-
-
Jin, Q.1
Schultz, T.2
-
7
-
-
44849131194
-
The ISL TC-STAR Spring 2006 ASR Evaluation Systems
-
S. Stüker, C. Fügen, R. Hsiao, S. Ikbal, Q. Jin, F. Kraft, M. Paulik, and M. W. M. Raab, Y.-C. Tam, "The ISL TC-STAR Spring 2006 ASR Evaluation Systems," in TC-Star Workshop on Speech-to-Speech Translation, 2006.
-
(2006)
TC-Star Workshop on Speech-to-Speech Translation
-
-
Stüker, S.1
Fügen, C.2
Hsiao, R.3
Ikbal, S.4
Jin, Q.5
Kraft, F.6
Paulik, M.7
Raab, M.W.M.8
Tam, Y.-C.9
-
8
-
-
0016495091
-
Linear Prediction: A Tutorial Review
-
J. Makhoul, "Linear Prediction: A Tutorial Review," Proc. of the IEEE, vol. 63, no. 4, pp. 561-580, 1975.
-
(1975)
Proc. of the IEEE
, vol.63
, Issue.4
, pp. 561-580
-
-
Makhoul, J.1
-
9
-
-
44949181081
-
Advances in Lecture Recognition: The ISL RT-06S Evaluation System
-
C. Fügen, M. Wölfel, J. W. McDonough, S. Ikbal, F. Kraft, K. Laskowski, M. Ostendorf, S. Stüker, and K. Kumatani, "Advances in Lecture Recognition: The ISL RT-06S Evaluation System," in INTERSPEECH, 2006.
-
(2006)
INTERSPEECH
-
-
Fügen, C.1
Wölfel, M.2
McDonough, J.W.3
Ikbal, S.4
Kraft, F.5
Laskowski, K.6
Ostendorf, M.7
Stüker, S.8
Kumatani, K.9
-
10
-
-
0141469852
-
Multispeaker Speech Activity Detection for the ICSI Meeting Recorder
-
T. Pfau, D. P. W. Ellis, and A. Stolcke, "Multispeaker Speech Activity Detection for the ICSI Meeting Recorder," in Proc. ASRU, 2001.
-
(2001)
Proc. ASRU
-
-
Pfau, T.1
Ellis, D.P.W.2
Stolcke, A.3
-
11
-
-
11144232847
-
Speech and Crosstalk Detection in Multichannel Audio
-
S. N. Wrigley, G. J. Brown, V. Wan, and S. Renals, "Speech and Crosstalk Detection in Multichannel Audio," IEEE Trans on Speech and Audio Processing, vol. 13, pp. 84-91, 2005.
-
(2005)
IEEE Trans on Speech and Audio Processing
, vol.13
, pp. 84-91
-
-
Wrigley, S.N.1
Brown, G.J.2
Wan, V.3
Renals, S.4
-
12
-
-
33947615205
-
Unsupervised Learning of Overlapped Speech Model Parameters for Multichannel Speech Activity Detection in Meetings
-
K. Laskowski and T. Schultz, "Unsupervised Learning of Overlapped Speech Model Parameters for Multichannel Speech Activity Detection in Meetings," in Proc. ICASSP, 2006.
-
(2006)
Proc. ICASSP
-
-
Laskowski, K.1
Schultz, T.2
-
13
-
-
33947640630
-
Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap
-
Ö. Çetin and E. Shriberg, "Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap," in Proc. ICASSP, 2006.
-
(2006)
Proc. ICASSP
-
-
Çetin, O.1
Shriberg, E.2
-
14
-
-
84962868641
-
A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment
-
H. Soltau, F. Metze, C. Fügen, and A. Waibel, "A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment," in ASRU, 2001.
-
(2001)
ASRU
-
-
Soltau, H.1
Metze, F.2
Fügen, C.3
Waibel, A.4
-
15
-
-
4243460174
-
Semi-tied covariance matrices
-
M. J. F. Gales, "Semi-tied covariance matrices," in ICASSP, 1998.
-
(1998)
ICASSP
-
-
Gales, M.J.F.1
-
16
-
-
0036294871
-
On Maximum Mutual Information Speaker-Adapted Training
-
J. McDonough, T. Schaaf, and A. Waibel, "On Maximum Mutual Information Speaker-Adapted Training," in ICASSP, 2002.
-
(2002)
ICASSP
-
-
McDonough, J.1
Schaaf, T.2
Waibel, A.3
-
17
-
-
0032639647
-
A Statistical Text-to-Phone Function Using Ngrams and Rules
-
W. M. Fisher, "A Statistical Text-to-Phone Function Using Ngrams and Rules," in ICASSP, 1999.
-
(1999)
ICASSP
-
-
Fisher, W.M.1
-
18
-
-
84891308106
-
SRILM - An Extensible Language Modeling Toolkit
-
A. Stolcke, "SRILM - An Extensible Language Modeling Toolkit," in ICSLP, 2002.
-
(2002)
ICSLP
-
-
Stolcke, A.1
-
19
-
-
0003396042
-
An Empirical Study of Smoothing Techniques for Language Modeling
-
Computer Science Group, Harvard University, Tech. Rep. TR-10-98
-
S. F. Chen and J. Goodman, "An Empirical Study of Smoothing Techniques for Language Modeling," Computer Science Group, Harvard University, Tech. Rep. TR-10-98, 1998.
-
(1998)
-
-
Chen, S.F.1
Goodman, J.2
-
20
-
-
44949090835
-
Getting more Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures
-
I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures," in Proc. HLT-NAACL, 2003.
-
(2003)
Proc. HLT-NAACL
-
-
Bulyko, I.1
Ostendorf, M.2
Stolcke, A.3
-
21
-
-
33745563257
-
-
International Computer Science Institute, Berkeley, CA, USA, Tech. Rep. TR-05-006
-
Ö. Çetin and A. Stolcke, "Language Modeling in the ICSI-SRI Spring 2005 Meeting Speech Recognition Evaluation System," International Computer Science Institute, Berkeley, CA, USA, Tech. Rep. TR-05-006, 2005.
-
(2005)
Language Modeling in the ICSI-SRI Spring 2005 Meeting Speech Recognition Evaluation System
-
-
Çetin, O.1
Stolcke, A.2
-
22
-
-
85009223249
-
Techniques for Effective Vocabulary Selection
-
A. Venkataraman and W. Wang, "Techniques for Effective Vocabulary Selection," in Proc. Eurospeech, 2003.
-
(2003)
Proc. Eurospeech
-
-
Venkataraman, A.1
Wang, W.2
-
23
-
-
0003571407
-
-
Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83
-
A. W. Black and P. A. Taylor, "The Festival Speech Synthesis System: System documentation," Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, Tech. Rep. HCRC/TR-83, 1997.
-
(1997)
The Festival Speech Synthesis System: System documentation
-
-
Black, A.W.1
Taylor, P.A.2
-
24
-
-
0030705337
-
Speaker Normalization Based on Frequency Warping
-
P. Zhan and M. Westphal, "Speaker Normalization Based on Frequency Warping," in ICASSP, 1997.
-
(1997)
ICASSP
-
-
Zhan, P.1
Westphal, M.2
-
25
-
-
0003454539
-
-
Cambridge University, Cambridge, United Kingdom, Tech. Rep
-
M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-based Speech Recognition," Cambridge University, Cambridge, United Kingdom, Tech. Rep., 1997.
-
(1997)
Maximum Likelihood Linear Transformations for HMM-based Speech Recognition
-
-
Gales, M.J.F.1
-
26
-
-
0029288633
-
Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models
-
C. J. Leggetter and P. C. Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
-
(1995)
Computer Speech and Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
27
-
-
33745225118
-
The ISL RT04 Mandarin Broadcast News Evaluation System
-
H. Yu, Y.-C. Tam, T. Schaaf, S. Stüker, Q. Jin, M. Noamany, and T. Schultz, "The ISL RT04 Mandarin Broadcast News Evaluation System," in EARS Rich Transcription Workshop, 2004.
-
(2004)
EARS Rich Transcription Workshop
-
-
Yu, H.1
Tam, Y.-C.2
Schaaf, T.3
Stüker, S.4
Jin, Q.5
Noamany, M.6
Schultz, T.7
-
28
-
-
33646805430
-
Alternate Phone Models for Conversational Speech
-
L. Lamel and J.-L. Gauvain, "Alternate Phone Models for Conversational Speech," in ICASSP, 2005.
-
(2005)
ICASSP
-
-
Lamel, L.1
Gauvain, J.-L.2
-
29
-
-
85135271674
-
Finding Consensus among Words: Lattice-based Word Error Minimization
-
L. Mangu, E. Brill, and A. Stolcke, "Finding Consensus among Words: Lattice-based Word Error Minimization," in EUROSPEECH, 1999.
-
(1999)
EUROSPEECH
-
-
Mangu, L.1
Brill, E.2
Stolcke, A.3
-
30
-
-
44949114262
-
Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures
-
M. Wölfel, C. Fügen, S. Ikbal, and J. W. McDonough, "Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures," in INTERSPEECH, 2006.
-
(2006)
INTERSPEECH
-
-
Wölfel, M.1
Fügen, C.2
Ikbal, S.3
McDonough, J.W.4
|