-
1
-
-
33947687235
-
Open Domain Speech Recognition & Translation: Lectures and Speeches
-
C. Fügen, M. Kolss, D. Bernreuther, M. Paulik, S. Stüker, S. Vogel, and A. Waibel, "Open Domain Speech Recognition & Translation: Lectures and Speeches," in ICASSP, 2006.
-
(2006)
ICASSP
-
-
Fügen, C.1
Kolss, M.2
Bernreuther, D.3
Paulik, M.4
Stüker, S.5
Vogel, S.6
Waibel, A.7
-
2
-
-
33947687166
-
Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination
-
M. Wölfel and J. McDonough, "Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination," in INTERSPEECH, 2005.
-
(2005)
INTERSPEECH
-
-
Wölfel, M.1
McDonough, J.2
-
3
-
-
84887145372
-
Issues in Meeting Transcription -The ISL Meeting Transcription System
-
F. Metze, Q. Jin, C. Fügen, K. Laskowski, Y. Pan, and T. Schultz, "Issues in Meeting Transcription -The ISL Meeting Transcription System," in ICSLP, 2004.
-
(2004)
ICSLP
-
-
Metze, F.1
Jin, Q.2
Fügen, C.3
Laskowski, K.4
Pan, Y.5
Schultz, T.6
-
5
-
-
44949186919
-
Minimum Variance Distortionless Response Spectral Estimation Review and Refinements
-
September
-
M. Wölfel and J. McDonough, "Minimum Variance Distortionless Response Spectral Estimation Review and Refinements," IEEE Signal Processing Magazine, September 2005.
-
(2005)
IEEE Signal Processing Magazine
-
-
Wölfel, M.1
McDonough, J.2
-
6
-
-
33745225118
-
The ISL RT04 Mandarin Broadcast News Evaluation System
-
November
-
H. Yu, Y.-C. Tam, T. Schaaf, S. Stüker, Q. Jin, M. Noamany, and T. Schultz, "The ISL RT04 Mandarin Broadcast News Evaluation System," in EARS Rich Transcription Workshop, November 2004.
-
(2004)
EARS Rich Transcription Workshop
-
-
Yu, H.1
Tam, Y.-C.2
Schaaf, T.3
Stüker, S.4
Jin, Q.5
Noamany, M.6
Schultz, T.7
-
7
-
-
33646805430
-
Alternate Phone Models for Conversational Speech
-
L. Lamel and J.-L. Gauvain, "Alternate Phone Models for Conversational Speech," in ICASSP, 2005.
-
(2005)
ICASSP
-
-
Lamel, L.1
Gauvain, J.-L.2
-
8
-
-
85009080849
-
Speaker Segmentation and Clustering in Meetings
-
Q. Jin and T. Schultz, "Speaker Segmentation and Clustering in Meetings," in ICSLP, 2004.
-
(2004)
ICSLP
-
-
Jin, Q.1
Schultz, T.2
-
9
-
-
85022115603
-
-
Linguistic data consortium
-
"Linguistic data consortium," http://www.ldc.upenn.edu.
-
-
-
-
10
-
-
0016495091
-
Linear prediction: A tutorial review
-
J. Makhoul, "Linear prediction: A tutorial review," in Proc. of the IEEE, 1975, vol. 63(4), pp. 561-580.
-
(1975)
Proc. of the IEEE
, vol.63
, Issue.4
, pp. 561-580
-
-
Makhoul, J.1
-
11
-
-
84962868641
-
A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment
-
H. Soltau, F. Metze, C. Fügen, and A. Waibel, "A One Pass-Decoder Based on Polymorphic Linguistic Context Assignment," in ASRU, 2001.
-
(2001)
ASRU
-
-
Soltau, H.1
Metze, F.2
Fügen, C.3
Waibel, A.4
-
12
-
-
4243460174
-
Semi-tied covariance matrices
-
M. J. F. Gales, "Semi-tied covariance matrices," in ICASSP, 1998.
-
(1998)
ICASSP
-
-
Gales, M.J.F.1
-
13
-
-
0036294871
-
On Maximum Mutual Information Speaker-Adapted Training
-
J. McDonough, T. Schaaf, and A. Waibel, "On Maximum Mutual Information Speaker-Adapted Training," in ICASSP, 2002.
-
(2002)
ICASSP
-
-
McDonough, J.1
Schaaf, T.2
Waibel, A.3
-
14
-
-
0032639647
-
A Statistical Text-to-Phone Function Using Ngrams and Rules
-
W. M. Fisher, "A Statistical Text-to-Phone Function Using Ngrams and Rules," in ICASSP, 1999.
-
(1999)
ICASSP
-
-
Fisher, W.M.1
-
15
-
-
85022109131
-
-
I. Bulyko, M. Ostendorf, and A. Stolcke, Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures, in Proc. HLT-NAACL, 2003, Comp., pp. 7-9.
-
I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures," in Proc. HLT-NAACL, 2003, vol. Comp., pp. 7-9.
-
-
-
-
16
-
-
84891308106
-
SRILM - An Extensible Language Modeling Toolkit
-
A. Stoicke, "SRILM - An Extensible Language Modeling Toolkit," in ICSLP, 2002.
-
(2002)
ICSLP
-
-
Stoicke, A.1
-
17
-
-
0003396042
-
An Empirical Study of Smoothing Techniques for Language Modeling,
-
Tech. Rep. TR-10-98, Computer Science Group, Harvard University
-
S. F Chen and J. Goodman, "An Empirical Study of Smoothing Techniques for Language Modeling," Tech. Rep. TR-10-98, Computer Science Group, Harvard University, 1998.
-
(1998)
-
-
Chen, S.F.1
Goodman, J.2
-
18
-
-
0003571407
-
The Festival Speech Synthesis System: System documentation,
-
Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom
-
A. W. Black and P. A. Taylor, "The Festival Speech Synthesis System: System documentation," Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Edinburgh, Scotland, United Kongdom, 1997.
-
(1997)
-
-
Black, A.W.1
Taylor, P.A.2
-
19
-
-
0030705337
-
Speaker Normalization Based on Frequency Warping
-
P. Zhan and M. Westphal, "Speaker Normalization Based on Frequency Warping," in ICASSP, 1997.
-
(1997)
ICASSP
-
-
Zhan, P.1
Westphal, M.2
-
20
-
-
0003454539
-
Maximum Likelihood Linear Transformations for HMM-based Speech Recognition,
-
Tech. Rep, Cambridge University, Cambridge, United Kingdom
-
M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-based Speech Recognition," Tech. Rep., Cambridge University, Cambridge, United Kingdom, 1997.
-
(1997)
-
-
Gales, M.J.F.1
-
21
-
-
0029288633
-
Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models
-
C. J. Leggetter and P. C. Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
-
(1995)
Computer Speech and Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
22
-
-
85135271674
-
Finding Consensus among Words: Lattice-based Word Error Minimization
-
L. Mangu, E. Brill, and A. Stolcke, "Finding Consensus among Words: Lattice-based Word Error Minimization," in EUROSPEECH, 1999.
-
(1999)
EUROSPEECH
-
-
Mangu, L.1
Brill, E.2
Stolcke, A.3
-
23
-
-
44949114262
-
Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures
-
M. Wölfel, C. Fügen, S. Ikbal, and J. W. McDonough, "Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures," in INTERSPEECH, 2006.
-
(2006)
INTERSPEECH
-
-
Wölfel, M.1
Fügen, C.2
Ikbal, S.3
McDonough, J.W.4
|