-
1
-
-
85055771355
-
-
http://coretex.itc.it
-
-
-
-
2
-
-
0342321463
-
The THISL Broadcast News Retrieval System
-
14-19, Cambridge, U.K
-
D. Abberley, D. Kirby, S. Renais and T. Robinson, The THISL Broadcast News Retrieval System, Proc. ESCA ETRW on Accessing Information in Spoken Audio, 14-19, Cambridge, U.K., April 1999.
-
(1999)
Proc. ESCA ETRW on Accessing Information in Spoken Audio
-
-
Abberley, D.1
Kirby, D.2
Renais, S.3
Robinson, T.4
-
4
-
-
0030362995
-
A Compact Model for Speaker Adaptation Training
-
Philadelphia, PA
-
T. Anastasakos, J. McDonough, R. Schwartz and J. Makhoul, A Compact Model for Speaker Adaptation Training, Proc. ICSLP’96, 1137-1140, Philadelphia, PA, October 1996.
-
(1996)
Proc. ICSLP’96
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
5
-
-
85009118347
-
One Pass Cross Word Decoding for Large Vocabularies Based on a Lexical Tree Search Organization
-
Budapest, Hungary
-
X. Aubert, One Pass Cross Word Decoding for Large Vocabularies Based on a Lexical Tree Search Organization, Proc. ESCA Eurospeech’99, 4:1559-1562, Budapest, Hungary, September 1999.
-
(1999)
Proc. ESCA Eurospeech’99
, vol.4
, pp. 1559-1562
-
-
Aubert, X.1
-
6
-
-
0026382117
-
The Forward-Backward Search Strategy for Real-Time Speech Recognition
-
Toronto, Canada
-
S. Austin, R. Schwartz and P. Placeway, The Forward-Backward Search Strategy for Real-Time Speech Recognition, Proc. IEEE ICASSP-91, 697-700, Toronto, Canada, May 1991.
-
(1991)
Proc. IEEE ICASSP-91
, pp. 697-700
-
-
Austin, S.1
Schwartz, R.2
Placeway, P.3
-
7
-
-
85006228776
-
Preliminary results on the performance of a system for the automatic recognition of continuous speech
-
Philadelphia, PA
-
L.R. Bahl, J.K. Baker, P.S. Cohen, N.R. Dixon, F. Jelinek, R.L. Mercer and H.F. Silverman, Preliminary results on the performance of a system for the automatic recognition of continuous speech, Proc. IEEE ICASSP-76, Philadelphia, PA, April 1976.
-
(1976)
Proc. IEEE ICASSP-76
-
-
Bahl, L.R.1
Baker, J.K.2
Cohen, P.S.3
Dixon, N.R.4
Jelinek, F.5
Mercer, R.L.6
Silverman, H.F.7
-
8
-
-
0023725866
-
Acoustic Markov Models used in the Tangora Speech Recognition System
-
New York, NY
-
L.R. Bahl, P. Brown, P. de Souza, R.L. Mercer and M. Picheny, Acoustic Markov Models used in the Tangora Speech Recognition System, Proc. IEEE ICASSP-88 1:497-500, New York, NY, April 1988.
-
(1988)
Proc. IEEE ICASSP-88 1
, pp. 497-500
-
-
Bahl, L.R.1
Brown, P.2
De Souza, P.3
Mercer, R.L.4
Picheny, M.5
-
9
-
-
0020719320
-
A Maximum Likelihood Approach to Continuous Speech Recognition
-
L.R. Bahl, F. Jelinek and R.L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition, IEEE Trans. Pattern Analysis &Machine Intelligence, PAMI-5(2):179-190, March 1983.
-
(1983)
IEEE Trans. Pattern Analysis &Machine Intelligence
, vol.PAMI-5
, Issue.2
, pp. 179-190
-
-
Bahl, L.R.1
Jelinek, F.2
Mercer, R.L.3
-
10
-
-
77949374939
-
A Fast Match for Continuous Speech Recognition Using Allophonic Models
-
San Francisco, CA
-
L.R. Bahl, P.V. de Souza, P.S. Gopalakrishnan, D. Nahamoo and M. Picheny, A Fast Match for Continuous Speech Recognition Using Allophonic Models, Proc. IEEE ICASSP-92, CA, 1:17-21, San Francisco, CA, March 1992.
-
(1992)
Proc. IEEE ICASSP-92
, vol.1
, pp. 17-21
-
-
Bahl, L.R.1
De Souza, P.V.2
Gopalakrishnan, P.S.3
Nahamoo, D.4
Picheny, M.5
-
11
-
-
33646933249
-
Large Vocabulary Recognition ofWall Street Journal Sentences at Dragon Systems
-
Harriman, NY
-
J. Baker, J. Baker, P. Bamberg, K. Bishop, L. Gillick, V. Helman, Z. Huang, Y. Ito, S. Lowe, B. Peskin, R. Roth and F. Scattone, Large Vocabulary Recognition ofWall Street Journal Sentences at Dragon Systems, Proc. DARPA Speech &Natural Language Workshop, 387-392, Harriman, NY, February 1992.
-
(1992)
Proc. DARPA Speech &Natural Language Workshop
, pp. 387-392
-
-
Baker, J.1
Baker, J.2
Bamberg, P.3
Bishop, K.4
Gillick, L.5
Helman, V.6
Huang, Z.7
Ito, Y.8
Lowe, S.9
Peskin, B.10
Roth, R.11
Scattone, F.12
-
12
-
-
0000353178
-
A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
-
Baum, L.E., T. Petrie, G. Soules, and N. Weiss, A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains, Ann. Math. Stat. 41:164-171,1970.
-
(1970)
Ann. Math. Stat.
, vol.41
, pp. 164-171
-
-
Baum, L.E.1
Petrie, T.2
Soules, G.3
Weiss, N.4
-
13
-
-
0027297381
-
Vector quantization for efficient computation of continuous density likelihoods
-
Minneapolis, MN
-
E. Bocchieri, Vector quantization for efficient computation of continuous density likelihoods, Proc. IEEE ICASSP-93, 2:692-695, Minneapolis, MN, May 1993.
-
(1993)
Proc. IEEE ICASSP-93
, vol.2
, pp. 692-695
-
-
Bocchieri, E.1
-
14
-
-
0033693013
-
A Baseline for the Transcription ofItalian Broadcast News
-
Istanbul, Turkey
-
F. Brugnara, M. Cettolo, M. Federico and D. Giuliani, A Baseline for the Transcription ofItalian Broadcast News, Proc. IEEE ICASSP-00, Istanbul, Turkey, June 2000.
-
(2000)
Proc. IEEE ICASSP-00
-
-
Brugnara, F.1
Cettolo, M.2
Federico, M.3
Giuliani, D.4
-
15
-
-
85135168075
-
Word and acoustic confidence annotation for large vocabulary speech recognition
-
Rhodes, Greece
-
L. Chase, Word and acoustic confidence annotation for large vocabulary speech recognition, Proc. ESCA Eurospeech’97, 815-818, Rhodes, Greece, September 1997.
-
(1997)
Proc. ESCA Eurospeech’97
, pp. 815-818
-
-
Chase, L.1
-
16
-
-
33646916912
-
Improvements in Language, Lexical and Phonetic Modeling in Sphinx-II
-
Austin, TX
-
L. Chase, R. Rosenberg, A. Hauptmann, M. Ravishankar, E. Thayer, P. Placeway, R. Weide and C. Lu, Improvements in Language, Lexical and Phonetic Modeling in Sphinx-II, Proc. ARPA Spoken Language Systems Technology Workshop, 60-65, Austin, TX, January 1995.
-
(1995)
Proc. ARPA Spoken Language Systems Technology Workshop
, pp. 60-65
-
-
Chase, L.1
Rosenberg, R.2
Hauptmann, A.3
Ravishankar, M.4
Thayer, E.5
Placeway, P.6
Weide, R.7
Lu, C.8
-
17
-
-
0033329799
-
An empirical study of smoothing techniques for language modeling
-
S.F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Computer, Speech &Language, 13(4):359-394, October 1999.
-
(1999)
Computer, Speech &Language
, vol.13
, Issue.4
, pp. 359-394
-
-
Chen, S.F.1
Goodman, J.2
-
18
-
-
0002595416
-
Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion
-
Landsdowne, VA
-
S.S. Chen and P.S. Gopalakrishnan, Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 127-132, Landsdowne, VA, February 1998.
-
(1998)
Proc. D
, pp. 127-132
-
-
Chen, S.S.1
Gopalakrishnan, P.S.2
-
19
-
-
0022859679
-
The Role of Word-Dependent Coartic-ulatory Effects in a Phoneme-Based Speech Recognition System
-
Tokyo, Japan
-
Y.L. Chow, R. Schwartz, S. Roukos, O. Kimball, P Price, F. Kubala, M. O. Dunham, M Krasner and J Makhoul, The Role of Word-Dependent Coartic-ulatory Effects in a Phoneme-Based Speech Recognition System, Proc. IEEE ICASSP-86, 3:1593-1596, Tokyo, Japan, April 1986.
-
(1986)
Proc. IEEE ICASSP-86
, vol.3
, pp. 1593-1596
-
-
Chow, Y.L.1
Schwartz, R.2
Roukos, S.3
Kimball, O.4
Price, P.5
Kubala, F.6
Dunham, M.O.7
Krasner, M.8
Makhoul, J.9
-
20
-
-
85118743743
-
Statistical Language Modelling using CMU-Cambridge Toolkit
-
Rhodes, Greece
-
P Clarkson and R. Rosenfeld, Statistical Language Modelling using CMU-Cambridge Toolkit, Proc. ESCA EuroSpeech’97, 2707-2710, Rhodes, Greece, September 1997
-
(1997)
Proc. ESCA EuroSpeech’97
, pp. 2707-2710
-
-
Clarkson, P.1
Rosenfeld, R.2
-
21
-
-
0019053271
-
Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences
-
S. Davis and P Mermelstein, Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences, IEEE Trans. Acoustics, Speech, &Signal Processing, 28(4):357-366,1980.
-
(1980)
IEEE Trans. Acoustics, Speech, &Signal Processing
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.1
Mermelstein, P.2
-
22
-
-
0002629270
-
Maximum Likelihood from Incomplete Data via the EM Algorithm
-
Dempster, A.P., M.M. Laird and D.B. Rubin, Maximum Likelihood from Incomplete Data via the EM Algorithm, Journal of the Royal Statistical Society Series B (methodological), 39:1-38,1977.
-
(1977)
Journal of the Royal Statistical Society Series B (Methodological)
, vol.39
, pp. 1-38
-
-
Dempster, A.P.1
Laird, M.M.2
Rubin, D.B.3
-
23
-
-
30244578066
-
Human Speech Recognition Performance on the 1995 CSR Hub-3 Corpus
-
Harriman, NY
-
N. Deshmukh, A. Ganapathiraju, R.J. Duncan and J. Picone, Human Speech Recognition Performance on the 1995 CSR Hub-3 Corpus Proc. ARPA Speech Recognition Workshop, 129-134, Harriman, NY, February 1996.
-
(1996)
Proc. ARPA Speech Recognition Workshop
, pp. 129-134
-
-
Deshmukh, N.1
Ganapathiraju, A.2
Duncan, R.J.3
Picone, J.4
-
24
-
-
33646934294
-
Genones: Optimization the Degree of Tying in a Large Vocabulary HMM-based Speech Recognizer
-
Adelaide, Australia
-
V. Digalakis and H. Murveit, Genones: Optimization the Degree of Tying in a Large Vocabulary HMM-based Speech Recognizer, Proc. IEEE ICASSP-94, 1:537-540, Adelaide, Australia, April 1994.
-
(1994)
Proc. IEEE ICASSP-94
, vol.1
, pp. 537-540
-
-
Digalakis, V.1
Murveit, H.2
-
25
-
-
0029375590
-
Speaker adaptation using constrained estimation of Gaussian mixtures
-
V. Digalakis, D. Rtichev and L.G. Neumeyer, Speaker adaptation using constrained estimation of Gaussian mixtures, IEEE Trans. On Speech &Audio, 3(5):357-366, September 1995.
-
(1995)
IEEE Trans. On Speech &Audio
, vol.3
, Issue.5
, pp. 357-366
-
-
Digalakis, V.1
Rtichev, D.2
Neumeyer, L.G.3
-
27
-
-
0141515360
-
Automatic Recognition of Phonetic Patterns in Speech
-
H. Dudley and S. Balashek, Automatic Recognition of Phonetic Patterns in Speech, J. Acoust. Soc. America, 30:721, 1958.
-
(1958)
J. Acoust. Soc. America
, vol.30
, pp. 721
-
-
Dudley, H.1
Balashek, S.2
-
28
-
-
15844378911
-
Human Speech Recognition Performance on the 1994 CSR Spoke 10 Corpus
-
Austin, TX
-
W.J. Ebel and J. Picone, Human Speech Recognition Performance on the 1994 CSR Spoke 10 Corpus, Proc. ARPA Spoken Language Systems Technology Workshop, 53-59, Austin, TX, January 1995.
-
(1995)
Proc. ARPA Spoken Language Systems Technology Workshop
, pp. 53-59
-
-
Ebel, W.J.1
Picone, J.2
-
29
-
-
0019583902
-
Comparison of speaker recognition methods using statistical features and dynamic features
-
S. Furui, Comparison of speaker recognition methods using statistical features and dynamic features, IEEE Trans. On Acoustics, Speech &Signal Processing, ASSP-29, 342-350, 1981.
-
(1981)
IEEE Trans. On Acoustics, Speech &Signal Processing
, vol.ASSP-29
, pp. 342-350
-
-
Furui, S.1
-
30
-
-
85017310148
-
An improved approach to hidden Markov model decomposition of speech and noise
-
San Francisco, CA
-
M.J.F. Gales and S.J. Young, An improved approach to hidden Markov model decomposition of speech and noise, Proc. IEEE ICASSP-92, 233-236, San Francisco, CA, March 1992.
-
(1992)
Proc. IEEE ICASSP-92
, pp. 233-236
-
-
Gales, M.J.F.1
Young, S.J.2
-
31
-
-
0029390135
-
Robust Continuous Speech Recognition using Parallel Model Combination
-
M.J.F. Gales and S.J. Young, Robust Continuous Speech Recognition using Parallel Model Combination, Computer Speech &Language, 9(4):289-307, October 1995.
-
(1995)
Computer Speech &Language
, vol.9
, Issue.4
, pp. 289-307
-
-
Gales, M.J.F.1
Young, S.J.2
-
32
-
-
85128364359
-
Cluster Adaptive Training for Speech Recognition
-
Sydney, Australia
-
M.J.F. Gales, Cluster Adaptive Training for Speech Recognition, Proc. IC-SLP’98, 1783-1786, Sydney, Australia, November 1998.
-
(1998)
Proc. IC-SLP’98
, pp. 1783-1786
-
-
Gales, M.J.F.1
-
33
-
-
0032638856
-
Semi-Tied Covariance Matrices for Hidden Markov Models
-
M.J.F. Gales, Semi-Tied Covariance Matrices for Hidden Markov Models, IEEE Trans. On Speech and Audio, 7(3):273-281, May 1999.
-
(1999)
IEEE Trans. On Speech and Audio
, vol.7
, Issue.3
, pp. 273-281
-
-
Gales, M.J.F.1
-
34
-
-
0001893347
-
Transcribing Broadcast News: The LIMSI Nov96 Hub4 System
-
Chantilly, VA
-
J.L. Gauvain, G. Adda, L. Lamel and M. Adda-Decker, Transcribing Broadcast News: The LIMSI Nov96 Hub4 System, Proc. ARPA Speech Recognition Workshop, 56-63, Chantilly, VA, February 1997.
-
(1997)
Proc. ARPA Speech Recognition Workshop
, pp. 56-63
-
-
Gauvain, J.L.1
Adda, G.2
Lamel, L.3
Adda-Decker, M.4
-
35
-
-
0001790691
-
Spoken Language component of the MASK Kiosk
-
K. Varghese, S. Pfleger(Eds.), Springer-Verlag, 1997. Also in Proc. Human Comfort and Security Workshop, Brussels, Belguim
-
J.L. Gauvain, S. Bennacef, L. Devillers, L. Lamel and R. Rosset, Spoken Language component of the MASK Kiosk in K. Varghese, S. Pfleger(Eds.) Human Comfort and security of information systems, Springer-Verlag, 1997. Also in Proc. Human Comfort and Security Workshop, Brussels, Belguim, October 1995.
-
(1995)
Human Comfort and Security of Information Systems
-
-
Gauvain, J.L.1
Bennacef, S.2
Devillers, L.3
Lamel, L.4
Rosset, R.5
-
36
-
-
0030374902
-
Speech Recognition for an Information Kiosk
-
Philadelphia, PA
-
J.L. Gauvain, J.J. Gangolf, and L. Lamel, Speech Recognition for an Information Kiosk, Proc. ICSLP’96, 849-852, Philadelphia, PA, October 1996.
-
(1996)
Proc. ICSLP’96
, pp. 849-852
-
-
Gauvain, J.L.1
Gangolf, J.J.2
Lamel, L.3
-
37
-
-
85128356454
-
Partitioning and Transcription of Broadcast News Data
-
Sydney, Australia
-
J.L. Gauvain, L. Lamel and G. Adda, Partitioning and Transcription of Broadcast News Data, Proc. ICSLP’98, 5:1335-1338, Sydney, Australia, December 1998.
-
(1998)
Proc. ICSLP’98
, vol.5
, pp. 1335-1338
-
-
Gauvain, J.L.1
Lamel, L.2
Adda, G.3
-
38
-
-
0028996849
-
Developments in Continuous Speech Dictation using the ARPA WSJ Task
-
Detroit, MI
-
J.L. Gauvain, L.F. Lamel and M. Adda-Decker, Developments in Continuous Speech Dictation using the ARPA WSJ Task, Proc. IEEE ICASSP-95, 65-68, Detroit, MI, May 1995.
-
(1995)
Proc. IEEE ICASSP-95
, pp. 65-68
-
-
Gauvain, J.L.1
Lamel, L.F.2
Adda-Decker, M.3
-
39
-
-
0028419019
-
Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
-
J.L. Gauvain and C.H. Lee, Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains, IEEE Trans. Speech &Audio Processing, 2(2):291-298, April 1994.
-
(1994)
IEEE Trans. Speech &Audio Processing
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.L.1
Lee, C.H.2
-
40
-
-
0036567851
-
The LIMSI Broadcast News Transcription System
-
J.L. Gauvain, L. Lamel and G. Adda, The LIMSI Broadcast News Transcription System, Speech Communication, 37(1-2):89-108, May 2002.
-
(2002)
Speech Communication
, vol.37
, Issue.1-2
, pp. 89-108
-
-
Gauvain, J.L.1
Lamel, L.2
Adda, G.3
-
41
-
-
3643049373
-
A Rapid Match Algorithm for Continuous Speech Recognition
-
Hidden Valley, PA
-
L. Gillick and R. Roth, A Rapid Match Algorithm for Continuous Speech Recognition, Proc. DARPA Speech &Natural Language Workshop, 170-172, Hidden Valley, PA, June 1990.
-
(1990)
Proc. DARPA Speech &Natural Language Workshop
, pp. 170-172
-
-
Gillick, L.1
Roth, R.2
-
42
-
-
0030648371
-
A Probabilistic Approach to Confidence Measure Estimation and Evaluation
-
Munich, Germany
-
L. Gillick, Y. Ito and J. Young, A Probabilistic Approach to Confidence Measure Estimation and Evaluation, Proc. IEEE ICASSP-97, 879-882, Munich, Germany, April 1997.
-
(1997)
Proc. IEEE ICASSP-97
, pp. 879-882
-
-
Gillick, L.1
Ito, Y.2
Young, J.3
-
43
-
-
0032665631
-
Real-time Telephone-based Speech Recognition in the Jupiter Domain
-
Phoenix, AZ
-
J.R. Glass, T.J. Hazen and I. L. Hetherington, Real-time Telephone-based Speech Recognition in the Jupiter Domain, Proc. IEEE ICASSP-99, 1:61-64, Phoenix, AZ, March 1999.
-
(1999)
Proc. IEEE ICASSP-99
, vol.1
, pp. 61-64
-
-
Glass, J.R.1
Hazen, T.J.2
Hetherington, I.L.3
-
44
-
-
85016587886
-
SWITCHBOARD: Telephone Speech Corpus for Research and Development
-
San Francisco, CA
-
J. Godfrey, E. Holliman and J. McDaniel, SWITCHBOARD: Telephone Speech Corpus for Research and Development, Proc. IEEE ICASSP-92, 517-520, San Francisco, CA, March 1992.
-
(1992)
Proc. IEEE ICASSP-92
, pp. 517-520
-
-
Godfrey, J.1
Holliman, E.2
McDaniel, J.3
-
45
-
-
0000803388
-
The Population Frequencies of Species and the Estimation of Population Parameters
-
I.J. Good, The Population Frequencies of Species and the Estimation of Population Parameters, Biomterika, 40(3/4):237-264,1953.
-
(1953)
Biomterika
, vol.40
, Issue.3-4
, pp. 237-264
-
-
Good, I.J.1
-
46
-
-
0028996969
-
A tree search strategy for large-vocabulary continuous speech recognition
-
Detroit, MI
-
P.S. Gopalakrishnan, L.R. Bahl and R.L. Mercer, A tree search strategy for large-vocabulary continuous speech recognition, Proc. IEEE ICASSP-95, 1:572-575, Detroit, MI, May 1995.
-
(1995)
Proc. IEEE ICASSP-95
, vol.1
, pp. 572-575
-
-
Gopalakrishnan, P.S.1
Bahl, L.R.2
Mercer, R.L.3
-
47
-
-
85017287487
-
Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition
-
R. Haeb-Umbach and H. Ney, Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition, Proc. ICASSP-92, 1:1316, March 1992.
-
(1992)
Proc. ICASSP-92
, vol.1
, pp. 1316
-
-
Haeb-Umbach, R.1
Ney, H.2
-
48
-
-
0002751623
-
Segment Generation and Clustering in the HTK Broadcast News Transcription System
-
Landsdowne, VA
-
T. Hain, S.E. Johnson, A. Tuerk, P.C. Woodland and S.J. Young, Segment Generation and Clustering in the HTK Broadcast News Transcription System, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 133-137, Landsdowne, VA, February 1998.
-
(1998)
Proc. Darpa Broadcast News Transcription &Understanding Workshop
, pp. 133-137
-
-
Hain, T.1
Johnson, S.E.2
Tuerk, A.3
Woodland, P.C.4
Young, S.J.5
-
50
-
-
0002384527
-
The ATIS Spoken Language Systems Pilot Corpus
-
Pittsburgh, PA
-
C.T. Hemphill, J.J. Godfrey, and G.R. Doddington, The ATIS Spoken Language Systems Pilot Corpus, Proc. Darpa Speech &Natural Language Workshop, Pittsburgh, PA, June 1990.
-
(1990)
Proc. Darpa Speech &Natural Language Workshop
-
-
Hemphill, C.T.1
Godfrey, J.J.2
Doddington, G.R.3
-
51
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, J. Acoust. Soc. America, 87(4):1738-1752,1990.
-
(1990)
J. Acoust. Soc. America
, vol.87
, Issue.4
, pp. 1738-1752
-
-
Hermansky, H.1
-
52
-
-
0002384092
-
Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system
-
Yokohama, Japan
-
M.M. Hochberg, S.J. Renals, A.J. Robinson and D. Kershaw, Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system, Proc. ICSLP’94, 1499-1502, Yokohama, Japan, September 1994.
-
(1994)
Proc. ICSLP’94
, pp. 1499-1502
-
-
Hochberg, M.M.1
Renals, S.J.2
Robinson, A.J.3
Kershaw, D.4
-
53
-
-
85055768776
-
-
Chapter 1.3 of the State of the Art in Human Language Technology, (Cole et al, eds.)
-
M.J. Hunt, Signal Representation, Chapter 1.3 of the State of the Art in Human Language Technology, (Cole et al, eds.), 1996. (http://www.cse.ogi.edu/CSLU/HLTsurvey/ch1node2.html)
-
(1996)
Signal Representation
-
-
Hunt, M.J.1
-
54
-
-
85015539783
-
Subphonetic Modeling with Markov States - Senone
-
San Francisco, CA
-
M. Hwang and X. Huang, Subphonetic Modeling with Markov States - Senone, Proc. IEEE ICASSP-92,1:33-36, San Francisco, CA, March 1992.
-
(1992)
Proc. IEEE ICASSP-92
, vol.1
, pp. 33-36
-
-
Hwang, M.1
Huang, X.2
-
55
-
-
0027153655
-
Predicting Unseen Triphones with Senones
-
Minneapolis, MN
-
M.Y. Hwang, X. Huang and F. Alleva, Predicting Unseen Triphones with Senones, Proc. IEEE ICASSP-93, II:311-314, Minneapolis, MN, April 1993.
-
(1993)
Proc. IEEE ICASSP-93
, vol.2
, pp. 311-314
-
-
Hwang, M.Y.1
Huang, X.2
Alleva, F.3
-
56
-
-
0016939124
-
Continuous Speech Recognition by Statistical Methods
-
F. Jelinek, Continuous Speech Recognition by Statistical Methods, Proc. Of the IEEE, 64(4): 532-556, April 1976.
-
(1976)
Proc. Of the IEEE
, vol.64
, Issue.4
, pp. 532-556
-
-
Jelinek, F.1
-
58
-
-
0012357341
-
A Dynamic Language Model for Speech Recognition
-
Pacific Grove, CA
-
F. Jelinek, B. Merialdo, S. Roukos and M. Strauss, A Dynamic Language Model for Speech Recognition, Proc. DARPA Speech &Natural Language Workshop, 293-295, Pacific Grove, CA, February 1991.
-
(1991)
Proc. DARPA Speech &Natural Language Workshop
, pp. 293-295
-
-
Jelinek, F.1
Merialdo, B.2
Roukos, S.3
Strauss, M.4
-
59
-
-
85055786864
-
-
Toulouse, France
-
F. deJong, J.L. Gauvain, J. deb Hartog and K. Netter, Olive: Speech Based Video Retrieval, Proc. CBMI’99, Toulouse, France, October 1999.
-
(1999)
Olive: Speech Based Video Retrieval, Proc. CBMI’99
-
-
Dejong, F.1
Gauvain, J.L.2
Deb Hartog, J.3
Netter, K.4
-
61
-
-
0023312404
-
Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer
-
S.M. Katz, Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer, IEEE Trans. Acoustics, Speech &Signal Processing, ASSP-35(3):400-401, March 1987.
-
(1987)
IEEE Trans. Acoustics, Speech &Signal Processing
, vol.ASSP-35
, Issue.3
, pp. 400-401
-
-
Katz, S.M.1
-
62
-
-
85135261720
-
Unsupervised Training of a Speech Recognizer: Recent Experiments
-
Budapest, Hungary
-
T. Kemp and A. Waibel, Unsupervised Training of a Speech Recognizer: Recent Experiments, Proc. ESCA Eurospeech’99, 6:2725-2728, Budapest, Hungary, September 1999.
-
(1999)
Proc. ESCA Eurospeech’99
, vol.6
, pp. 2725-2728
-
-
Kemp, T.1
Waibel, A.2
-
63
-
-
33646908801
-
The 1995 Abbot hybrid connectionist-HMM large-vocabulary recognition system
-
Harriman, NY
-
D. Kershaw, A.J. Robinson and S.J. Renals, The 1995 Abbot hybrid connectionist-HMM large-vocabulary recognition system, Proc. ARPA Speech Recognition Workshop, 93-98, Harriman, NY, February 1996.
-
(1996)
Proc. ARPA Speech Recognition Workshop
, pp. 93-98
-
-
Kershaw, D.1
Robinson, A.J.2
Renals, S.J.3
-
64
-
-
85123963268
-
Improved Clustering Techniques for Class-Based Statistical Language Modelling
-
Berlin, September
-
R. Kneser and H. Ney, Improved Clustering Techniques for Class-Based Statistical Language Modelling, Proc. Eurospeech’93, 973-976, Berlin, September 1993.
-
(1993)
Proc. Eurospeech’93
, pp. 973-976
-
-
Kneser, R.1
Ney, H.2
-
65
-
-
0028996876
-
Improved backing-off for n-gram language modeling
-
Detroit, MI, May
-
R. Kneser and H. Ney, Improved backing-off for n-gram language modeling, Proc. IEEEICASSP-95,1:181-184, Detroit, MI, May 1995.
-
(1995)
Proc. IEEEICASSP-95
, vol.1
, pp. 181-184
-
-
Kneser, R.1
Ney, H.2
-
66
-
-
0343125611
-
Design of the 1994 CSR Benchmark Tests
-
Austin, TX, January
-
F. Kubala, Design of the 1994 CSR Benchmark Tests, Proc. ARPA Spoken Language Systems Technology Workshop, 41-46, Austin, TX, January 1995.
-
(1995)
Proc. ARPA Spoken Language Systems Technology Workshop
, pp. 41-46
-
-
Kubala, F.1
-
67
-
-
0002519514
-
Toward Automatic Recognition of Broadcast News
-
Harriman, NY, February
-
F. Kubala, T. Anastasakos, H. Jin, J. Makhoul, L. Nguyen, R. Schwartz and N. Yuan, Toward Automatic Recognition of Broadcast News, Proc. Darpa Speech Recognition Workshop, 55-60, Harriman, NY, February 1996.
-
(1996)
Proc. Darpa Speech Recognition Workshop
, pp. 55-60
-
-
Kubala, F.1
Anastasakos, T.2
Jin, H.3
Makhoul, J.4
Nguyen, L.5
Schwartz, R.6
Yuan, N.7
-
68
-
-
0032289099
-
Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition
-
December
-
N. Kumar and A.G. Andreou, Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition, Speech Communication, 26(4):283-297, December 1998.
-
(1998)
Speech Communication
, vol.26
, Issue.4
, pp. 283-297
-
-
Kumar, N.1
Andreou, A.G.2
-
69
-
-
84871609195
-
Eigenvoices for Speaker Adaptation
-
Sydney, November
-
R. Kuhn, P. Nguyen, J.C. Junqua, L. Goldwasser, N. Niedzielski, S. Fincke, and K. Field, M. Contolini, Eigenvoices for Speaker Adaptation, Proc. IC-SLP’98, 177l-1774, Sydney, November 1998.
-
(1998)
Proc. IC-SLP’98
, pp. 1771-1774
-
-
Kuhn, R.1
Nguyen, P.2
Junqua, J.C.3
Goldwasser, L.4
Niedzielski, N.5
Fincke, S.6
Field, K.7
Contolini, M.8
-
70
-
-
0030351374
-
On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition
-
Philadelphia, PA, October
-
L.F. Lamel and G. Adda, On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition, Proc. ICSLP’96, 1:6-9, Philadelphia, PA, October 1996.
-
(1996)
Proc. ICSLP’96
, vol.1
, pp. 6-9
-
-
Lamel, L.F.1
Adda, G.2
-
71
-
-
0011830592
-
Speech Recognition of European Languages
-
Snowbird, Utah, December
-
L.F. Lamel and R. DeMori, Speech Recognition of European Languages, Proc. IEEE Automatic Speech Recognition Workshop, 51-54, Snowbird, Utah, December 1995.
-
(1995)
Proc. IEEE Automatic Speech Recognition Workshop
, pp. 51-54
-
-
Lamel, L.F.1
Demori, R.2
-
72
-
-
0010536626
-
Continuous Speech Recognition at LIMSI
-
Stanford, CA, September
-
L.F. Lamel and J.L. Gauvain, Continuous Speech Recognition at LIMSI, Proc. ARPA Workshop on Continuous Speech Recognition, 59-64, Stanford, CA, September 1992.
-
(1992)
Proc. ARPA Workshop on Continuous Speech Recognition
, pp. 59-64
-
-
Lamel, L.F.1
Gauvain, J.L.2
-
73
-
-
0029219785
-
A Phone-based Approach to Non-Linguistic Speech Feature Identification
-
January
-
L.F. Lamel and J.L. Gauvain, A Phone-based Approach to Non-Linguistic Speech Feature Identification, Computer Speech &Language, 9(1):87-103, January 1995.
-
(1995)
Computer Speech &Language
, vol.9
, Issue.1
, pp. 87-103
-
-
Lamel, L.F.1
Gauvain, J.L.2
-
74
-
-
0036460908
-
Lightly Supervised and Unsupervised Acoustic Model Training
-
L. Lamel, J.L. Gauvain, and G. Adda, Lightly Supervised and Unsupervised Acoustic Model Training, Computer, Speech &Language, 16(1):115-229, January 2002.
-
(2002)
Computer, Speech &Language
, vol.16
, Issue.1
, pp. 115-229
-
-
Lamel, L.1
Gauvain, J.L.2
Adda, G.3
-
75
-
-
85124760678
-
Development of Spoken Language Corpora for Travel Information
-
Madrid, Spain
-
L.F. Lamel, S. Rosset, S.K. Bennacef, H. Bonneau-Maynard, L. Devillers and J.L. Gauvain, Development of Spoken Language Corpora for Travel Information, Proc. ESCA Eurospeech’95, 3:1961-1964, Madrid, Spain, September 1995.
-
(1995)
Proc. ESCA Eurospeech’95
, vol.3
, pp. 1961-1964
-
-
Lamel, L.F.1
Rosset, S.2
Bennacef, S.K.3
Bonneau-Maynard, H.4
Devillers, L.5
Gauvain, J.L.6
-
77
-
-
0029747183
-
Speaker Normalization Using Efficient Frequency Warping Procedures
-
Atlanta, GA, May
-
L. Lee and R.C. Rose, Speaker Normalization Using Efficient Frequency Warping Procedures, Proc. IEEE ICASSP-96,1:353-356, Atlanta, GA, May 1996.
-
(1996)
Proc. IEEE ICASSP-96
, vol.1
, pp. 353-356
-
-
Lee, L.1
Rose, R.C.2
-
78
-
-
0029288633
-
Maximum Likelihood Linear Regressionfor Speaker Adaptation of Continuous Density Hidden Markov Models
-
C. J. Leggetter and P. C. Woodland, Maximum Likelihood Linear Regressionfor Speaker Adaptation of Continuous Density Hidden Markov Models, Computer Speech &Language, 9(2):171-185, April 1995.
-
(1995)
Computer Speech &Language
, vol.9
, Issue.2
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
79
-
-
0020180460
-
Maximum Likelihood Estimation for Multivariate Observations of Markov Sources
-
Liporace, L. R., Maximum Likelihood Estimation for Multivariate Observations of Markov Sources, IEEE Transactions on Information Theory, IT-28(5):729-734,1982.
-
(1982)
IEEE Transactions on Information Theory
, vol.IT-28
, Issue.5
, pp. 729-734
-
-
Liporace, L.R.1
-
80
-
-
0031187171
-
Speech recognition by machines and humans
-
R. P Lippmann, Speech recognition by machines and humans, Speech Communication, 22(1):1-15, July 1997.
-
(1997)
Speech Communication
, vol.22
, Issue.1
, pp. 1-15
-
-
Lippmann, R.P.1
-
81
-
-
85119434191
-
Fast Speaker Change Detection for Broadcast News Transcription and Indexing
-
Budapest, Hungary
-
D. Liu and F. Kubala, Fast Speaker Change Detection for Broadcast News Transcription and Indexing, Proc. ESCA EuroSpeech’99, 3:1031-1034, Budapest, Hungary, September 1999.
-
(1999)
Proc. ESCA EuroSpeech’99
, vol.3
, pp. 1031-1034
-
-
Liu, D.1
Kubala, F.2
-
82
-
-
0345098384
-
Multi-site Data Collection for a Spoken Language Corpus
-
Harriman, NY, February
-
Madcow, M., Multi-site Data Collection for a Spoken Language Corpus, Proc. Darpa Speech &Natural Language Workshop, 7-14, Harriman, NY, February 1992
-
(1992)
Proc. Darpa Speech &Natural Language Workshop
, pp. 7-14
-
-
Madcow, M.1
-
83
-
-
0034296009
-
A Stolcke, Finding Consensus in Speech Recognition: Word Error Minimization and Other Applications of Confusion Networks
-
L Mangu, E Brill, A Stolcke, Finding Consensus in Speech Recognition: Word Error Minimization and Other Applications of Confusion Networks, Computer, Speech and Language, 14(4):373-400, October 2000.
-
(2000)
Computer, Speech and Language
, vol.14
, Issue.4
, pp. 373-400
-
-
Mangu, L.1
Brill, E.2
-
84
-
-
85135158363
-
Subspace distribution clustering for continuous observation density hidden Markov models
-
Rhodes, Greece
-
B Mak and E Bocchieri, Subspace distribution clustering for continuous observation density hidden Markov models, Proc. Eurospeech’97, 107-110, Rhodes, Greece, September 1997.
-
(1997)
Proc. Eurospeech’97
, pp. 107-110
-
-
Mak, B.1
Bocchieri, E.2
-
85
-
-
33646936293
-
Spoken Language Processing and Human-Machine Communication in the European Union Programs
-
G. Varile, ed, Rhodes, Greece, September
-
J J Mariani Spoken Language Processing and Human-Machine Communication in the European Union Programs, in G. Varile, ed., Eurospeech’97 EU Speech Projects Day report, Rhodes, Greece, September 1997.
-
(1997)
Eurospeech’97 EU Speech Projects Day Report
-
-
Mariani, J.J.1
-
86
-
-
0004657714
-
An overview of EU programs related to conversational/interactive systems
-
Landsdowne, VA
-
J. J. Mariani and L. F. Lamel, An overview of EU programs related to conversational/interactive systems, Proc DARPA Broadcast News Transcription &Understanding Workshop, 247-253, Landsdowne, VA, February 1998.
-
(1998)
Proc DARPA Broadcast News Transcription &Understanding Workshop
, pp. 247-253
-
-
Mariani, J.J.1
Lamel, L.F.2
-
87
-
-
85135152717
-
Algorithms for Bigram and Trigram Clustering
-
Madrid, Spain
-
S. Martin, J. Liermann and H. Ney, Algorithms for Bigram and Trigram Clustering, Proc. Eurospeech’95, 1253-1256, Madrid, Spain, September 1995.
-
(1995)
Proc. Eurospeech’95
, pp. 1253-1256
-
-
Martin, S.1
Liermann, J.2
Ney, H.3
-
89
-
-
0009588713
-
Named Entity Extraction from Broadcast News
-
Herndon, VA
-
D. Miller, R. Schwartz, R. Weischedel and R. Stone, Named Entity Extraction from Broadcast News, Proc DARPA Broadcast News Workshop, 37-40, Herndon, VA, February 1999
-
(1999)
Proc DARPA Broadcast News Workshop
, pp. 37-40
-
-
Miller, D.1
Schwartz, R.2
Weischedel, R.3
Stone, R.4
-
90
-
-
84892168937
-
Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition
-
Seattle, WA
-
M. Mohri, M. Riley, D. Hindle, A. Ljolie and F. Pereira, Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition, Proc IEEE ICASSP-98, 665-668, Seattle, WA, May 1998.
-
(1998)
Proc IEEE ICASSP-98
, pp. 665-668
-
-
Mohri, M.1
Riley, M.2
Hindle, D.3
Ljolie, A.4
Pereira, F.5
-
91
-
-
0027192626
-
Large-Vocabulary Dictation using SRI’s Decipher Speech Recognition System: Progressive Search Techniques
-
Minneapolis, MN
-
H. Murveit, J. Butzberger, V. Digalakis and M. Weintraub, Large-Vocabulary Dictation using SRI’s Decipher Speech Recognition System: Progressive Search Techniques, Proc. IEEE ICASSP-93, II:319-322, Minneapolis, MN, April 1993.
-
(1993)
Proc. IEEE ICASSP-93
, vol.2
, pp. 319-322
-
-
Murveit, H.1
Butzberger, J.2
Digalakis, V.3
Weintraub, M.4
-
92
-
-
0021406359
-
The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition
-
H. Ney, The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition, IEEE Trans. Acoustics, Speech and Signal Processing, ASSP-32(2):263-271, April 1984.
-
(1984)
IEEE Trans. Acoustics, Speech and Signal Processing
, vol.ASSP-32
, Issue.2
, pp. 263-271
-
-
Ney, H.1
-
93
-
-
85017308347
-
Improvements in Beam Search for 10000-Word Continuous Speech Recognition
-
San Francisco, CA
-
H. Ney, R. Haeb-Umbach, B.H. Tran and M. Oerder, Improvements in Beam Search for 10000-Word Continuous Speech Recognition, Proc. IEEE ICASSP-92, I:9-12, San Francisco, CA, March 1992.
-
(1992)
Proc. IEEE ICASSP-92
, vol.1
, pp. 9-12
-
-
Ney, H.1
Haeb-Umbach, R.2
Tran, B.H.3
Oerder, M.4
-
94
-
-
0032689227
-
Single-Tree Method for Grammar-Directed Search
-
Phoenix, AZ
-
L. Nguyen and R. Schwartz, Single-Tree Method for Grammar-Directed Search, Proc. IEEE ICASSP-99,2:613-616, Phoenix, AZ, March 1999.
-
(1999)
Proc. IEEE ICASSP-99
, vol.2
, pp. 613-616
-
-
Nguyen, L.1
Schwartz, R.2
-
96
-
-
0001889147
-
A One Pass Decoder Design for Large Vocabulary Recognition
-
Princeton, NJ
-
J.J. Odell, V. Valtchev, P.C. Woodland and S.J. Young, A One Pass Decoder Design for Large Vocabulary Recognition, Proc. ARPA Human Language Technology Workshop, 405-410, Princeton, NJ, March 1994.
-
(1994)
Proc. ARPA Human Language Technology Workshop
, pp. 405-410
-
-
Odell, J.J.1
Valtchev, V.2
Woodland, P.C.3
Young, S.J.4
-
97
-
-
0002110654
-
Recent Advances in Japanese Broadcast News Transcription
-
Budapest, Hungary
-
K. Ohtsuki, S. Furui, N. Sakurai, A. Iwasaki and Z. P Zeang, Recent Advances in Japanese Broadcast News Transcription, Proc. ESCA Eurospeech’99, 2:671-674, Budapest, Hungary, September 1999.
-
(1999)
Proc. ESCA Eurospeech’99
, vol.2
, pp. 671-674
-
-
Ohtsuki, K.1
Furui, S.2
Sakurai, N.3
Iwasaki, A.4
Zeang, Z.P.5
-
98
-
-
0036295941
-
Modeling Inverse Covariance Matrices by Basis Expansion
-
Orlando, FL
-
P A. Olsen and R. A. Gopinath, Modeling Inverse Covariance Matrices by Basis Expansion, Proc. IEEE ICASSP-02, 945-948, Orlando, FL, 2002.
-
(2002)
Proc. IEEE ICASSP-02
, pp. 945-948
-
-
Olsen, P.A.1
Gopinath, R.A.2
-
99
-
-
0030366694
-
Language-model look-ahead for large vocabulary speech recognition
-
Philadelphia, PA
-
S. Ortmanns, H. Ney, and A. Eiden, Language-model look-ahead for large vocabulary speech recognition, Proc. ICSLP’96, 2095-2098, Philadelphia, PA, October 1996
-
(1996)
Proc. ICSLP’96
, pp. 2095-2098
-
-
Ortmanns, S.1
Ney, H.2
Eiden, A.3
-
100
-
-
0030719155
-
A Word Graph Algorithm for Large Vocabulary Continuous Speech Recognition
-
S. Ortmanns, H. Ney, and X. Aubert, A Word Graph Algorithm for Large Vocabulary Continuous Speech Recognition, Computer, Speech and Language, 11(1):43-72, January 1997.
-
(1997)
Computer, Speech and Language
, vol.11
, Issue.1
, pp. 43-72
-
-
Ortmanns, S.1
Ney, H.2
Aubert, X.3
-
101
-
-
0016467605
-
The Role of Phonological Rules in Speech Understanding Research
-
B. T Oshika, V.W. Zue, R. V. Weeks, H. Neu and J. Aurbach, The Role of Phonological Rules in Speech Understanding Research, IEEE Trans. Acoustics, Speech, Signal Processing, ASSP-23, 104-112,1975.
-
(1975)
IEEE Trans. Acoustics, Speech, Signal Processing
, vol.ASSP-23
, pp. 104-112
-
-
Oshika, B.T.1
Zue, V.W.2
Weeks, R.V.3
Neu, H.4
Aurbach, J.5
-
102
-
-
33645771960
-
Continuous Word Recognition Based on the Stochastic Segment Model
-
Stanford, CA
-
M. Ostendorf, A. Kannan, O. Kimball and J. R. Rohlicek, Continuous Word Recognition Based on the Stochastic Segment Model, Proc ARPA Workshop on Continuous Speech Recognition, 53-58, Stanford, CA, September 1992.
-
(1992)
Proc ARPA Workshop on Continuous Speech Recognition
, pp. 53-58
-
-
Ostendorf, M.1
Kannan, A.2
Kimball, O.3
Rohlicek, J.R.4
-
103
-
-
0141760645
-
1993 Benchmark Tests for the ARPA Spoken Language Program
-
Princeton, NJ
-
D. S. Pallett, J. G. Fiscus, W. M. Fisher, J. S. Garofolo, B. A. Lund and M.A. Pryzbocki, 1993 Benchmark Tests for the ARPA Spoken Language Program, Proc. ARPA Human Language Technology Workshop, 49-74, Princeton, NJ, March 1994
-
(1994)
Proc. ARPA Human Language Technology Workshop
, pp. 49-74
-
-
Pallett, D.S.1
Fiscus, J.G.2
Fisher, W.M.3
Garofolo, J.S.4
Lund, B.A.5
Pryzbocki, M.A.6
-
104
-
-
0012316245
-
1994 Benchmark Tests for the ARPA Spoken Language Program
-
Austin, TX
-
D. S. Pallett, J. G. Fiscus, W. M. Fisher, J. S. Garofolo, B. A. Lund, A.F. Martin and M. A. Przybocki, 1994 Benchmark Tests for the ARPA Spoken Language Program, Proc. ARPA Spoken Language Systems Technology Workshop, 536, Austin, TX, January 1995.
-
(1995)
Proc. ARPA Spoken Language Systems Technology Workshop
, pp. 536
-
-
Pallett, D.S.1
Fiscus, J.G.2
Fisher, W.M.3
Garofolo, J.S.4
Lund, B.A.5
Martin, A.F.6
Przybocki, M.A.7
-
105
-
-
0344230603
-
1995 Hub-3 Multiple Microphone Corpus Benchmark Tests
-
Harriman, NY
-
D. S. Pallett, J. G. Fiscus, W. M. Fisher, J. S. Garofolo, A.F. Martin and M.A. Przybocki, 1995 Hub-3 Multiple Microphone Corpus Benchmark Tests, Proc. ARPA Speech Recognition Workshop, 27-46, Harriman, NY, February 1996.
-
(1996)
Proc. ARPA Speech Recognition Workshop
, pp. 27-46
-
-
Pallett, D.S.1
Fiscus, J.G.2
Fisher, W.M.3
Garofolo, J.S.4
Martin, A.F.5
Przybocki, M.A.6
-
106
-
-
0001895107
-
1998 Broadcast News Benchmark Test Results: English and Non-English Word Error Rate Performance Measures
-
Herndon, VA
-
D. S. Pallett, J. G. Fiscus, J. S. Garofolo, A.F. Martin and M. A. Przybocki, 1998 Broadcast News Benchmark Test Results: English and Non-English Word Error Rate Performance Measures, Proc. Darpa Broadcast News Workshop, 5-12, Herndon, VA, February 1999.
-
(1999)
Proc. Darpa Broadcast News Workshop
, pp. 5-12
-
-
Pallett, D.S.1
Fiscus, J.G.2
Garofolo, J.S.3
Martin, A.F.4
Przybocki, M.A.5
-
107
-
-
85017287102
-
An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model
-
San Francisco, CA
-
D. B. Paul, An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model, Proc. IEEE ICASSP-92, 405-409, San Francisco, CA, March 1992.
-
(1992)
Proc. IEEE ICASSP-92
, pp. 405-409
-
-
Paul, D.B.1
-
108
-
-
0034849080
-
Improved Discriminative Training Techniques For Large Vocabulary Continuous Speech Recognition
-
Salt Lake City, May
-
D. Povey and P Woodland, Improved Discriminative Training Techniques For Large Vocabulary Continuous Speech Recognition, Proc. IEEE ICASSP-01, Salt Lake City, May 2001.
-
(2001)
Proc. IEEE ICASSP-01
-
-
Povey, D.1
Woodland, P.2
-
109
-
-
0002617904
-
Evaluation of Spoken Language Systems: The ATIS Domain
-
Hidden Valley, PA, June
-
P Price, Evaluation of Spoken Language Systems: The ATIS Domain, Proc. Darpa Speech and Natural Language Workshop, 91-95, Hidden Valley, PA, June, 1990
-
(1990)
Proc. Darpa Speech and Natural Language Workshop
, pp. 91-95
-
-
Price, P.1
-
112
-
-
0033353288
-
Stochastic pronunciation modelling from hand-labelled phonetic corpora
-
M. D. Riley, W. Byrne, M. Finke, S. Khudanpu, A. Ljojle, J. McDonough, H. Nock, M. Saraclar, C. Wooters and G. Zavaliagkos, Stochastic pronunciation modelling from hand-labelled phonetic corpora, Speech Communication, 29(2-4):209-224, November 1999.
-
(1999)
Speech Communication
, vol.29
, Issue.2-4
, pp. 209-224
-
-
Riley, M.D.1
Byrne, W.2
Finke, M.3
Khudanpu, S.4
Ljojle, A.5
McDonough, J.6
Nock, H.7
Saraclar, M.8
Wooters, C.9
Zavaliagkos, G.10
-
113
-
-
30244503648
-
Improvements in Stochastic Language Modeling
-
Harriman, NY
-
R. Rosenfeld and X. Huang, Improvements in Stochastic Language Modeling, Proc. Darpa Workshop on Speech &Natural Language, 107-111, Harriman, NY, February 1992
-
(1992)
Proc. Darpa Workshop on Speech &Natural Language
, pp. 107-111
-
-
Rosenfeld, R.1
Huang, X.2
-
114
-
-
0003904645
-
-
Ph. D. Thesis, Carnegie Mellon University, (also Tech. rep. CMU-CS-94-138)
-
R. Rosenfeld, Adaptive Statistical Language Modeling, Ph. D. Thesis, Carnegie Mellon University, 1994. (also Tech. rep. CMU-CS-94-138).
-
(1994)
Adaptive Statistical Language Modeling
-
-
Rosenfeld, R.1
-
115
-
-
33646907991
-
Two Decades of Statistical Language Modeling: Where Do We Go From Here?
-
R. Rosenfeld, Two Decades of Statistical Language Modeling: Where Do We Go From Here?, Proceedings of the IEEE, Special issue on Spoken Language Processing, 88(8):1270-1278, August 2000.
-
(2000)
Proceedings of the IEEE, Special Issue on Spoken Language Processing
, vol.88
, Issue.8
, pp. 1270-1278
-
-
Rosenfeld, R.1
-
116
-
-
0035426931
-
Language-independent and langauge-adaptive acoustic modeling for speech recognition
-
T Schultza and A. Waibel, Language-independent and langauge-adaptive acoustic modeling for speech recognition, Speech Communication, 35(1-2):31-51, August 2001.
-
(2001)
Speech Communication
, vol.35
, Issue.1-2
, pp. 31-51
-
-
Schultza, T.1
Waibel, A.2
-
117
-
-
0033896970
-
Memory-efficient LVCSR search using a one-pass stack decoder
-
M. Schuster, Memory-efficient LVCSR search using a one-pass stack decoder, Computer Speech &Language, 14(1):47-77, January 2000.
-
(2000)
Computer Speech &Language
, vol.14
, Issue.1
, pp. 47-77
-
-
Schuster, M.1
-
118
-
-
85017310294
-
New uses for N-Best Sentence Hypothesis, within the BYBLOS Speech Recognition System
-
San Francisco, CA
-
R. Schwartz, S. Austin, F. Kubala and J. Makhoul, New uses for N-Best Sentence Hypothesis, within the BYBLOS Speech Recognition System, Proc. IEEE ICASSP-92,1:1-4, San Francisco, CA, March 1992.
-
(1992)
Proc. IEEE ICASSP-92
, vol.1
, pp. 1-4
-
-
Schwartz, R.1
Austin, S.2
Kubala, F.3
Makhoul, J.4
-
119
-
-
0021142214
-
Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition
-
San Diego, CA
-
R. Schwartz, Y. Chow, S. Roucos, M. Krasner and J. Makhoul, Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition, Proc. IEEE ICASSP-84, 3:35.6.1-35.6.4, San Diego, CA, March 1984.
-
(1984)
Proc. IEEE ICASSP-84
-
-
Schwartz, R.1
Chow, Y.2
Roucos, S.3
Krasner, M.4
Makhoul, J.5
-
120
-
-
33646939277
-
NYU Language Modeling Experiments for the 1995 CSR Evaluation
-
Harriman, NY
-
S. Sekine and R. Grishman, NYU Language Modeling Experiments for the 1995 CSR Evaluation, Proc. ARPA Speech Recognition Workshop, 123-128, Harriman, NY, February 1996.
-
(1996)
Proc. ARPA Speech Recognition Workshop
, pp. 123-128
-
-
Sekine, S.1
Grishman, R.2
-
121
-
-
0029726011
-
A Markov Random Field Approach to Bayesian Speaker Adaptation
-
Detroit, MI
-
B. Shahshahani, A Markov Random Field Approach to Bayesian Speaker Adaptation, Proc. IEEE ICASSP-95, 697-700, Detroit, MI, May 1995.
-
(1995)
Proc. IEEE ICASSP-95
, pp. 697-700
-
-
Shahshahani, B.1
-
122
-
-
0001405849
-
Modeling Those F-Conditions - Or Not
-
Chantilly, VA
-
R. Schwartz, H. Jin, F. Kubala and S. Matsoukas, Modeling Those F-Conditions - Or Not, Proc. Darpa Speech Recognition Workshop, 115-118, Chantilly, VA, February 1997.
-
(1997)
Proc. Darpa Speech Recognition Workshop
, pp. 115-118
-
-
Schwartz, R.1
Jin, H.2
Kubala, F.3
Matsoukas, S.4
-
123
-
-
0030361237
-
Scalable backoff language models
-
Philadelphia, PA
-
K. Seymore and R. Rosenfeld, Scalable backoff language models, Proc. ICSLP’96, 1:232-235, Philadelphia, PA, October 1996.
-
(1996)
Proc. ICSLP’96
, vol.1
, pp. 232-235
-
-
Seymore, K.1
Rosenfeld, R.2
-
124
-
-
0002782496
-
Automatic Segmentation, Classification and Clustering of Broadcast News Audio
-
Chantilly, VA
-
M. Siegler, U. Jain, B. Raj and R. Stern, Automatic Segmentation, Classification and Clustering of Broadcast News Audio, Proc DARPA Speech Recognition Workshop, 97-99, Chantilly, VA, February 1997
-
(1997)
Proc DARPA Speech Recognition Workshop
, pp. 97-99
-
-
Siegler, M.1
Jain, U.2
Raj, B.3
Stern, R.4
-
125
-
-
0033344871
-
Evaluation of word confidence for speech recognition systems
-
M. Siu and H. Gish, Evaluation of word confidence for speech recognition systems, Computer Speech &Language, 13(4):299-318, October 1999.
-
(1999)
Computer Speech &Language
, vol.13
, Issue.4
, pp. 299-318
-
-
Siu, M.1
Gish, H.2
-
127
-
-
0028996958
-
Four-level Tied Structure for Efficient Representation of Acoustic Modeling
-
Detroit, MI
-
S. Takahashi and S. Sagayama, Four-level Tied Structure for Efficient Representation of Acoustic Modeling, Proc. IEEE ICASSP-95, 520-523, Detroit, MI, May 1995.
-
(1995)
Proc. IEEE ICASSP-95
, pp. 520-523
-
-
Takahashi, S.1
Sagayama, S.2
-
128
-
-
85135261079
-
An Investigation into Vocal Tract Length Normalization
-
Budapest, Hungary
-
L. F. Uebel and P C. Woodland, An Investigation into Vocal Tract Length Normalization, Proc. ESCA Eurospeech’99, 2527-2530, Budapest, Hungary, September 1999.
-
(1999)
Proc. ESCA Eurospeech’99
, pp. 2527-2530
-
-
Uebel, L.F.1
Woodland, P.C.2
-
129
-
-
0040262071
-
Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance
-
Madrid, Spain
-
D.A. van Leeuwen, L. G. van den Berg and H.J. M. Steeneken, Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance, Proc. ESCA Eurospeech’95, 1461-1464, Madrid, Spain, September 1995.
-
(1995)
Proc. ESCA Eurospeech’95
, pp. 1461-1464
-
-
Van Leeuwen, D.A.1
Van Den Berg, L.G.2
Steeneken, H.J.M.3
-
130
-
-
0010727514
-
Speech discrimination by dynamic programming
-
T K. Vintsyuk, Speech discrimination by dynamic programming, Kibnernetika, 4:81, 1968.
-
(1968)
Kibnernetika
, vol.4
, pp. 81
-
-
Vintsyuk, T.K.1
-
131
-
-
34250411858
-
Elements-wise recognition of continuous speech composed of words from a specified dictionary
-
March-April
-
T. K. Vintsyuk, Elements-wise recognition of continuous speech composed of words from a specified dictionary, Cybernetics, 7:133-143, March-April 1971.
-
(1971)
Cybernetics
, vol.7
, pp. 133-143
-
-
Vintsyuk, T.K.1
-
132
-
-
0001891171
-
Verbmobil: Translation of Face-to-Face Dialogs
-
Berlin, Germany, Plenary
-
W. Wahlster, Verbmobil: Translation of Face-to-Face Dialogs, Proc. ESCA Eurospeech’93, Berlin, Germany, Plenary, 29-38, September 1993.
-
(1993)
Proc. ESCA Eurospeech’93
, pp. 29-38
-
-
Wahlster, W.1
-
133
-
-
0012327341
-
Multilinguality in Speech and Spoken Language Systems
-
A. Waibel, P Geutner, L. Mayfield Tomokiyo, T. Schultz and M. Woszczyna, Multilinguality in Speech and Spoken Language Systems, Proceedings of the IEEE, Special Issue on Spoken Language Processing, 88(8):1297-1313, August 2000.
-
(2000)
Proceedings of the IEEE, Special Issue on Spoken Language Processing
, vol.88
, Issue.8
, pp. 1297-1313
-
-
Waibel, A.1
Geutner, P.2
Mayfield Tomokiyo, L.3
Schultz, T.4
Woszczyna, M.5
-
134
-
-
0032678104
-
Probabilistic Models for Topic Detection and Tracking
-
Phoenix, AZ
-
F. Walls, H. Jin, S. Sista and R. Schwartz, Probabilistic Models for Topic Detection and Tracking, Proc. IEEE ICASSP-99,1:521-524, Phoenix, AZ, March 1999
-
(1999)
Proc. IEEE ICASSP-99
, vol.1
, pp. 521-524
-
-
Walls, F.1
Jin, H.2
Sista, S.3
Schwartz, R.4
-
135
-
-
0002707166
-
Dragon Systems’ 1997 Broadcast News Transcription System
-
Landsdowne, VA
-
S. Wegmann, F. Scattone, I. Carp, L. Gillick, R. Roth and J. Yamron, Dragon Systems’ 1997 Broadcast News Transcription System, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 60-65, Landsdowne, VA, February 1998
-
(1998)
Proc. Darpa Broadcast News Transcription &Understanding Workshop
, pp. 60-65
-
-
Wegmann, S.1
Scattone, F.2
Carp, I.3
Gillick, L.4
Roth, R.5
Yamron, J.6
-
136
-
-
0032657771
-
Progress in Broadcast News Transcription at Dragon Systems
-
Phoenix, AZ
-
S. Wegmann, P Zhan, and L. Gillick, Progress in Broadcast News Transcription at Dragon Systems, Proc IEEE ICASSP-99, 33-36, Phoenix, AZ, March 1999
-
(1999)
Proc IEEE ICASSP-99
, pp. 33-36
-
-
Wegmann, S.1
Zhan, P.2
Gillick, L.3
-
137
-
-
0030706666
-
NeuralNetwork based Measures of Confidence for Word Recognition
-
Munich, Germany
-
M. Weintraub, F. Beaufays, Z. Rivlin, Y. Konig and A. Stolcke, NeuralNetwork based Measures of Confidence for Word Recognition, Proc. IEEE ICASSP-97, 887-890, Munich, Germany, April 1997.
-
(1997)
Proc. IEEE ICASSP-97
, pp. 887-890
-
-
Weintraub, M.1
Beaufays, F.2
Rivlin, Z.3
Konig, Y.4
Stolcke, A.5
-
138
-
-
0031630644
-
Using word probabilities as confidence measures
-
Seattle, WA
-
F. Wessel, K. Macherey and R. Schlüter, Using word probabilities as confidence measures, Proc. IEEE ICASSP-98, 225-228, Seattle, WA, May 1998.
-
(1998)
Proc. IEEE ICASSP-98
, pp. 225-228
-
-
Wessel, F.1
Macherey, K.2
Schlüter, R.3
-
139
-
-
84962920544
-
Unsupervised training of acoustic models for large vocabulary continuous speech recognition
-
Madonna di Campiglio, Italy
-
F. Wessel and H. Ney, Unsupervised training of acoustic models for large vocabulary continuous speech recognition, Proc. IEEE ASRU’01, Madonna di Campiglio, Italy, December 2001.
-
(2001)
Proc. IEEE ASRU’01
-
-
Wessel, F.1
Ney, H.2
-
140
-
-
0026187945
-
The Zero Frequency problem: Estimating the problems of Novel Events in Adaptive tex Compression
-
I.H. Witten and T. C. Bell, The Zero Frequency problem: Estimating the problems of Novel Events in Adaptive tex Compression, Proc. IEEE Trans. On Information Theory, 37(1):1085-1094, July 1991.
-
(1991)
Proc. IEEE Trans. On Information Theory
, vol.37
, Issue.1
, pp. 1085-1094
-
-
Witten, I.H.1
Bell, T.C.2
-
141
-
-
0036461035
-
Large scale discriminative training of hidden Markov models for speech recognition
-
P C. Woodland and D. Povey, Large scale discriminative training of hidden Markov models for speech recognition, Computer, Speech and Language, 16(1):25-47, January 2002.
-
(2002)
Computer, Speech and Language
, vol.16
, Issue.1
, pp. 25-47
-
-
Woodland, P.C.1
Povey, D.2
-
142
-
-
0001393274
-
The development of the 1994 HTK large vocabulary speech recognition system
-
Austin, TX
-
P C. Woodland, C. J. Leggetter, J.J. Odell, V. Valtchev and S. J. Young, The development of the 1994 HTK large vocabulary speech recognition system, Proc. ARPA Spoken Language Systems Technology Workshop, 104-109, Austin, TX, January 1995.
-
(1995)
Proc. ARPA Spoken Language Systems Technology Workshop
, pp. 104-109
-
-
Woodland, P.C.1
Leggetter, C.J.2
Odell, J.J.3
Valtchev, V.4
Young, S.J.5
-
143
-
-
0002452931
-
The HTK large vocabulary recognition system for the 1995 ARPA H3 task
-
Harriman, NY
-
P C. Woodland, M. J.F. Gales, D. Pye and V. Valtchev, The HTK large vocabulary recognition system for the 1995 ARPA H3 task, Proc. ARPA Speech Recognition Workshop, 99-104, Harriman, NY, February 1996.
-
(1996)
Proc. ARPA Speech Recognition Workshop
, pp. 99-104
-
-
Woodland, P.C.1
Gales, M.J.F.2
Pye, D.3
Valtchev, V.4
-
144
-
-
0031624946
-
A Hidden Markov Approach to Text Segmentation and Event Tracking
-
Seattle, WA
-
J.P. Yamron, I. Carp, L. Gillick, S. Lowe and P. van Mulbregt, A Hidden Markov Approach to Text Segmentation and Event Tracking, Proc IEEE ICASSP-98,1:333-336, Seattle, WA, May 1998.
-
(1998)
Proc IEEE ICASSP-98
, vol.1
, pp. 333-336
-
-
Yamron, J.P.1
Carp, I.2
Gillick, L.3
Lowe, S.4
Van Mulbregt, P.5
-
145
-
-
0030244826
-
A Review of Large-Vocabulary Continuous Speech Recognition
-
S. J. Young, A Review of Large-Vocabulary Continuous Speech Recognition, IEEE Signal Processing Magazine, 13(5):45-57, September 1996.
-
(1996)
IEEE Signal Processing Magazine
, vol.13
, Issue.5
, pp. 45-57
-
-
Young, S.J.1
-
146
-
-
0030718943
-
Multilingual large vocabulary speech recognition: The European SQALE project
-
S.J. Young, M. Adda-Decker, X. Aubert, C. Dugast, J. L. Gauvain, D.J. Kershaw, L Lamel, D A Leeuwen, D Pye, H J M Steeneken, A J Robinson and P C. Woodland, Multilingual large vocabulary speech recognition: the European SQALE project, Computer Speech &Language, 11(1):73-89, January 1997
-
(1997)
Computer Speech &Language
, vol.11
, Issue.1
, pp. 73-89
-
-
Young, S.J.1
Adda-Decker, M.2
Aubert, X.3
Dugast, C.4
Gauvain, J.L.5
Kershaw, D.J.6
Lamel, L.7
Leeuwen, D.A.8
Pye, D.9
Steeneken, H.J.M.10
Robinson, A.J.11
Woodland, P.C.12
-
147
-
-
0032181247
-
Speech recognition evaluation: A review of the U.S. CSR andLVCSR programmes
-
S.J. Young and L. Chase, Speech recognition evaluation: a review of the U.S. CSR andLVCSR programmes, Computer Speech &Language, 12(4):263-279, October 1998
-
(1998)
Computer Speech &Language
, vol.12
, Issue.4
, pp. 263-279
-
-
Young, S.J.1
Chase, L.2
-
148
-
-
0002144369
-
Tree-Based State Tying for High Accuracy Acoustic Modeling
-
Princeton, NJ
-
S. J. Young, J. J. Odell and P C. Woodland, Tree-Based State Tying for High Accuracy Acoustic Modeling, Proc. ARPA Human Language Technology Workshop, 307-312, Princeton, NJ, March 1994.
-
(1994)
Proc. ARPA Human Language Technology Workshop
, pp. 307-312
-
-
Young, S.J.1
Odell, J.J.2
Woodland, P.C.3
-
149
-
-
85135369802
-
The Use of State Tying in Continuous Speech Recognition
-
Berlin, Germany
-
S.J. Young and P C. Woodland, The Use of State Tying in Continuous Speech Recognition, Proc. ESCA Eurospeech’93, 3:2203-2206, Berlin, Germany, September1993
-
(1993)
Proc. ESCA Eurospeech’93
, vol.3
, pp. 2203-2206
-
-
Young, S.J.1
Woodland, P.C.2
-
150
-
-
0002162027
-
-
DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA
-
G. Zavaliagkos and T. Colthurst, Utilizing Untranscribed Training Data to Improve Performance, DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA, 301-305, February 1998.
-
(1998)
Utilizing Untranscribed Training Data to Improve Performance
, pp. 301-305
-
-
Zavaliagkos, G.1
Colthurst, T.2
-
151
-
-
0029745232
-
Maximum a Posteriori Adaptation for Large Scale HMM Recognizers
-
Detroit, MI
-
G. Zavaliagkos, R. Schwartz and J. McDonough, Maximum a Posteriori Adaptation for Large Scale HMM Recognizers, Proc IEEE ICASSP-95, 725-728, Detroit, MI, May 1995
-
(1995)
Proc IEEE ICASSP-95
, pp. 725-728
-
-
Zavaliagkos, G.1
Schwartz, R.2
McDonough, J.3
-
152
-
-
85121123643
-
The MITSummit Speech Recognition System: A Progress Report
-
Philadelphia, PA
-
V. Zue, J. Glass, M. Phillips and S. Seneff, The MITSummit Speech Recognition System: A Progress Report, Proc DARPA Speech &Natural Language Workshop, 179-189, Philadelphia, PA, February 1989
-
(1989)
Proc DARPA Speech &Natural Language Workshop
, pp. 179-189
-
-
Zue, V.1
Glass, J.2
Phillips, M.3
Seneff, S.4
|