SCOPUS 정보 검색 플랫폼

Pattern Recognition in Speech and Language Processing

Volumn , Issue , 2003, Pages 149-190

Large vocabulary speech recognition based on statistical methods

(2) Gauvain, Jean Luc a Lamel, Lori a

a CNRS (France)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 33645586060 PISSN: None EISSN: None Source Type: Book
DOI: 10.1201/9780203010525 Document Type: Chapter

Times cited : (10)

References (152)

1
- 85055771355
- http://coretex.itc.it

2
- 0342321463
- The THISL Broadcast News Retrieval System
- 14-19, Cambridge, U.K
- D. Abberley, D. Kirby, S. Renais and T. Robinson, The THISL Broadcast News Retrieval System, Proc. ESCA ETRW on Accessing Information in Spoken Audio, 14-19, Cambridge, U.K., April 1999.
- (1999) Proc. ESCA ETRW on Accessing Information in Spoken Audio
- Abberley, D.¹ Kirby, D.² Renais, S.³ Robinson, T.⁴

3
- 0141629128
- Experiments in Vocal Tract Normalization
- A. Andreoum, T. Kamm and J. Cohen, Experiments in Vocal Tract Normalization, Proc. CAIP Workshop: Frontiers in Speech Recognition II, 1994.
- (1994) Proc. CAIP Workshop: Frontiers in Speech Recognition II
- Andreoum, A.¹ Kamm, T.² Cohen, J.³

4
- 0030362995
- A Compact Model for Speaker Adaptation Training
- Philadelphia, PA
- T. Anastasakos, J. McDonough, R. Schwartz and J. Makhoul, A Compact Model for Speaker Adaptation Training, Proc. ICSLP’96, 1137-1140, Philadelphia, PA, October 1996.
- (1996) Proc. ICSLP’96 , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

5
- 85009118347
- One Pass Cross Word Decoding for Large Vocabularies Based on a Lexical Tree Search Organization
- Budapest, Hungary
- X. Aubert, One Pass Cross Word Decoding for Large Vocabularies Based on a Lexical Tree Search Organization, Proc. ESCA Eurospeech’99, 4:1559-1562, Budapest, Hungary, September 1999.
- (1999) Proc. ESCA Eurospeech’99 , vol.4 , pp. 1559-1562
- Aubert, X.¹

6
- 0026382117
- The Forward-Backward Search Strategy for Real-Time Speech Recognition
- Toronto, Canada
- S. Austin, R. Schwartz and P. Placeway, The Forward-Backward Search Strategy for Real-Time Speech Recognition, Proc. IEEE ICASSP-91, 697-700, Toronto, Canada, May 1991.
- (1991) Proc. IEEE ICASSP-91 , pp. 697-700
- Austin, S.¹ Schwartz, R.² Placeway, P.³

7
- 85006228776
- Preliminary results on the performance of a system for the automatic recognition of continuous speech
- Philadelphia, PA
- L.R. Bahl, J.K. Baker, P.S. Cohen, N.R. Dixon, F. Jelinek, R.L. Mercer and H.F. Silverman, Preliminary results on the performance of a system for the automatic recognition of continuous speech, Proc. IEEE ICASSP-76, Philadelphia, PA, April 1976.
- (1976) Proc. IEEE ICASSP-76
- Bahl, L.R.¹ Baker, J.K.² Cohen, P.S.³ Dixon, N.R.⁴ Jelinek, F.⁵ Mercer, R.L.⁶ Silverman, H.F.⁷

8
- 0023725866
- Acoustic Markov Models used in the Tangora Speech Recognition System
- New York, NY
- L.R. Bahl, P. Brown, P. de Souza, R.L. Mercer and M. Picheny, Acoustic Markov Models used in the Tangora Speech Recognition System, Proc. IEEE ICASSP-88 1:497-500, New York, NY, April 1988.
- (1988) Proc. IEEE ICASSP-88 1 , pp. 497-500
- Bahl, L.R.¹ Brown, P.² De Souza, P.³ Mercer, R.L.⁴ Picheny, M.⁵

9
- 0020719320
- A Maximum Likelihood Approach to Continuous Speech Recognition
- L.R. Bahl, F. Jelinek and R.L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition, IEEE Trans. Pattern Analysis &Machine Intelligence, PAMI-5(2):179-190, March 1983.
- (1983) IEEE Trans. Pattern Analysis &Machine Intelligence , vol.PAMI-5 , Issue.2 , pp. 179-190
- Bahl, L.R.¹ Jelinek, F.² Mercer, R.L.³

10
- 77949374939
- A Fast Match for Continuous Speech Recognition Using Allophonic Models
- San Francisco, CA
- L.R. Bahl, P.V. de Souza, P.S. Gopalakrishnan, D. Nahamoo and M. Picheny, A Fast Match for Continuous Speech Recognition Using Allophonic Models, Proc. IEEE ICASSP-92, CA, 1:17-21, San Francisco, CA, March 1992.
- (1992) Proc. IEEE ICASSP-92 , vol.1 , pp. 17-21
- Bahl, L.R.¹ De Souza, P.V.² Gopalakrishnan, P.S.³ Nahamoo, D.⁴ Picheny, M.⁵

11
- 33646933249
- Large Vocabulary Recognition ofWall Street Journal Sentences at Dragon Systems
- Harriman, NY
- J. Baker, J. Baker, P. Bamberg, K. Bishop, L. Gillick, V. Helman, Z. Huang, Y. Ito, S. Lowe, B. Peskin, R. Roth and F. Scattone, Large Vocabulary Recognition ofWall Street Journal Sentences at Dragon Systems, Proc. DARPA Speech &Natural Language Workshop, 387-392, Harriman, NY, February 1992.
- (1992) Proc. DARPA Speech &Natural Language Workshop , pp. 387-392
- Baker, J.¹ Baker, J.² Bamberg, P.³ Bishop, K.⁴ Gillick, L.⁵ Helman, V.⁶ Huang, Z.⁷ Ito, Y.⁸ Lowe, S.⁹ Peskin, B.¹⁰ Roth, R.¹¹ Scattone, F.¹²

12
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- Baum, L.E., T. Petrie, G. Soules, and N. Weiss, A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains, Ann. Math. Stat. 41:164-171,1970.
- (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
- Baum, L.E.¹ Petrie, T.² Soules, G.³ Weiss, N.⁴

13
- 0027297381
- Vector quantization for efficient computation of continuous density likelihoods
- Minneapolis, MN
- E. Bocchieri, Vector quantization for efficient computation of continuous density likelihoods, Proc. IEEE ICASSP-93, 2:692-695, Minneapolis, MN, May 1993.
- (1993) Proc. IEEE ICASSP-93 , vol.2 , pp. 692-695
- Bocchieri, E.¹

14
- 0033693013
- A Baseline for the Transcription ofItalian Broadcast News
- Istanbul, Turkey
- F. Brugnara, M. Cettolo, M. Federico and D. Giuliani, A Baseline for the Transcription ofItalian Broadcast News, Proc. IEEE ICASSP-00, Istanbul, Turkey, June 2000.
- (2000) Proc. IEEE ICASSP-00
- Brugnara, F.¹ Cettolo, M.² Federico, M.³ Giuliani, D.⁴

15
- 85135168075
- Word and acoustic confidence annotation for large vocabulary speech recognition
- Rhodes, Greece
- L. Chase, Word and acoustic confidence annotation for large vocabulary speech recognition, Proc. ESCA Eurospeech’97, 815-818, Rhodes, Greece, September 1997.
- (1997) Proc. ESCA Eurospeech’97 , pp. 815-818
- Chase, L.¹

16
- 33646916912
- Improvements in Language, Lexical and Phonetic Modeling in Sphinx-II
- Austin, TX
- L. Chase, R. Rosenberg, A. Hauptmann, M. Ravishankar, E. Thayer, P. Placeway, R. Weide and C. Lu, Improvements in Language, Lexical and Phonetic Modeling in Sphinx-II, Proc. ARPA Spoken Language Systems Technology Workshop, 60-65, Austin, TX, January 1995.
- (1995) Proc. ARPA Spoken Language Systems Technology Workshop , pp. 60-65
- Chase, L.¹ Rosenberg, R.² Hauptmann, A.³ Ravishankar, M.⁴ Thayer, E.⁵ Placeway, P.⁶ Weide, R.⁷ Lu, C.⁸

17
- 0033329799
- An empirical study of smoothing techniques for language modeling
- S.F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Computer, Speech &Language, 13(4):359-394, October 1999.
- (1999) Computer, Speech &Language , vol.13 , Issue.4 , pp. 359-394
- Chen, S.F.¹ Goodman, J.²

18
- 0002595416
- Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion
- Landsdowne, VA
- S.S. Chen and P.S. Gopalakrishnan, Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 127-132, Landsdowne, VA, February 1998.
- (1998) Proc. D , pp. 127-132
- Chen, S.S.¹ Gopalakrishnan, P.S.²

19
- 0022859679
- The Role of Word-Dependent Coartic-ulatory Effects in a Phoneme-Based Speech Recognition System
- Tokyo, Japan
- Y.L. Chow, R. Schwartz, S. Roukos, O. Kimball, P Price, F. Kubala, M. O. Dunham, M Krasner and J Makhoul, The Role of Word-Dependent Coartic-ulatory Effects in a Phoneme-Based Speech Recognition System, Proc. IEEE ICASSP-86, 3:1593-1596, Tokyo, Japan, April 1986.
- (1986) Proc. IEEE ICASSP-86 , vol.3 , pp. 1593-1596
- Chow, Y.L.¹ Schwartz, R.² Roukos, S.³ Kimball, O.⁴ Price, P.⁵ Kubala, F.⁶ Dunham, M.O.⁷ Krasner, M.⁸ Makhoul, J.⁹

20
- 85118743743
- Statistical Language Modelling using CMU-Cambridge Toolkit
- Rhodes, Greece
- P Clarkson and R. Rosenfeld, Statistical Language Modelling using CMU-Cambridge Toolkit, Proc. ESCA EuroSpeech’97, 2707-2710, Rhodes, Greece, September 1997
- (1997) Proc. ESCA EuroSpeech’97 , pp. 2707-2710
- Clarkson, P.¹ Rosenfeld, R.²

21
- 0019053271
- Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences
- S. Davis and P Mermelstein, Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences, IEEE Trans. Acoustics, Speech, &Signal Processing, 28(4):357-366,1980.
- (1980) IEEE Trans. Acoustics, Speech, &Signal Processing , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

22
- 0002629270
- Maximum Likelihood from Incomplete Data via the EM Algorithm
- Dempster, A.P., M.M. Laird and D.B. Rubin, Maximum Likelihood from Incomplete Data via the EM Algorithm, Journal of the Royal Statistical Society Series B (methodological), 39:1-38,1977.
- (1977) Journal of the Royal Statistical Society Series B (Methodological) , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, M.M.² Rubin, D.B.³

23
- 30244578066
- Human Speech Recognition Performance on the 1995 CSR Hub-3 Corpus
- Harriman, NY
- N. Deshmukh, A. Ganapathiraju, R.J. Duncan and J. Picone, Human Speech Recognition Performance on the 1995 CSR Hub-3 Corpus Proc. ARPA Speech Recognition Workshop, 129-134, Harriman, NY, February 1996.
- (1996) Proc. ARPA Speech Recognition Workshop , pp. 129-134
- Deshmukh, N.¹ Ganapathiraju, A.² Duncan, R.J.³ Picone, J.⁴

24
- 33646934294
- Genones: Optimization the Degree of Tying in a Large Vocabulary HMM-based Speech Recognizer
- Adelaide, Australia
- V. Digalakis and H. Murveit, Genones: Optimization the Degree of Tying in a Large Vocabulary HMM-based Speech Recognizer, Proc. IEEE ICASSP-94, 1:537-540, Adelaide, Australia, April 1994.
- (1994) Proc. IEEE ICASSP-94 , vol.1 , pp. 537-540
- Digalakis, V.¹ Murveit, H.²

25
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- V. Digalakis, D. Rtichev and L.G. Neumeyer, Speaker adaptation using constrained estimation of Gaussian mixtures, IEEE Trans. On Speech &Audio, 3(5):357-366, September 1995.
- (1995) IEEE Trans. On Speech &Audio , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.¹ Rtichev, D.² Neumeyer, L.G.³

26
- 84899708867
- Sonograph and Sound Mechanics
- J. Dreyfus-Graf, Sonograph and Sound Mechanics, J. Acoust. Soc. America, 22:731, 1949.
- (1949) J. Acoust. Soc. America , vol.22 , pp. 731
- Dreyfus-Graf, J.¹

27
- 0141515360
- Automatic Recognition of Phonetic Patterns in Speech
- H. Dudley and S. Balashek, Automatic Recognition of Phonetic Patterns in Speech, J. Acoust. Soc. America, 30:721, 1958.
- (1958) J. Acoust. Soc. America , vol.30 , pp. 721
- Dudley, H.¹ Balashek, S.²

28
- 15844378911
- Human Speech Recognition Performance on the 1994 CSR Spoke 10 Corpus
- Austin, TX
- W.J. Ebel and J. Picone, Human Speech Recognition Performance on the 1994 CSR Spoke 10 Corpus, Proc. ARPA Spoken Language Systems Technology Workshop, 53-59, Austin, TX, January 1995.
- (1995) Proc. ARPA Spoken Language Systems Technology Workshop , pp. 53-59
- Ebel, W.J.¹ Picone, J.²

29
- 0019583902
- Comparison of speaker recognition methods using statistical features and dynamic features
- S. Furui, Comparison of speaker recognition methods using statistical features and dynamic features, IEEE Trans. On Acoustics, Speech &Signal Processing, ASSP-29, 342-350, 1981.
- (1981) IEEE Trans. On Acoustics, Speech &Signal Processing , vol.ASSP-29 , pp. 342-350
- Furui, S.¹

30
- 85017310148
- An improved approach to hidden Markov model decomposition of speech and noise
- San Francisco, CA
- M.J.F. Gales and S.J. Young, An improved approach to hidden Markov model decomposition of speech and noise, Proc. IEEE ICASSP-92, 233-236, San Francisco, CA, March 1992.
- (1992) Proc. IEEE ICASSP-92 , pp. 233-236
- Gales, M.J.F.¹ Young, S.J.²

31
- 0029390135
- Robust Continuous Speech Recognition using Parallel Model Combination
- M.J.F. Gales and S.J. Young, Robust Continuous Speech Recognition using Parallel Model Combination, Computer Speech &Language, 9(4):289-307, October 1995.
- (1995) Computer Speech &Language , vol.9 , Issue.4 , pp. 289-307
- Gales, M.J.F.¹ Young, S.J.²

32
- 85128364359
- Cluster Adaptive Training for Speech Recognition
- Sydney, Australia
- M.J.F. Gales, Cluster Adaptive Training for Speech Recognition, Proc. IC-SLP’98, 1783-1786, Sydney, Australia, November 1998.
- (1998) Proc. IC-SLP’98 , pp. 1783-1786
- Gales, M.J.F.¹

33
- 0032638856
- Semi-Tied Covariance Matrices for Hidden Markov Models
- M.J.F. Gales, Semi-Tied Covariance Matrices for Hidden Markov Models, IEEE Trans. On Speech and Audio, 7(3):273-281, May 1999.
- (1999) IEEE Trans. On Speech and Audio , vol.7 , Issue.3 , pp. 273-281
- Gales, M.J.F.¹

34
- 0001893347
- Transcribing Broadcast News: The LIMSI Nov96 Hub4 System
- Chantilly, VA
- J.L. Gauvain, G. Adda, L. Lamel and M. Adda-Decker, Transcribing Broadcast News: The LIMSI Nov96 Hub4 System, Proc. ARPA Speech Recognition Workshop, 56-63, Chantilly, VA, February 1997.
- (1997) Proc. ARPA Speech Recognition Workshop , pp. 56-63
- Gauvain, J.L.¹ Adda, G.² Lamel, L.³ Adda-Decker, M.⁴

35
- 0001790691
- Spoken Language component of the MASK Kiosk
- K. Varghese, S. Pfleger(Eds.), Springer-Verlag, 1997. Also in Proc. Human Comfort and Security Workshop, Brussels, Belguim
- J.L. Gauvain, S. Bennacef, L. Devillers, L. Lamel and R. Rosset, Spoken Language component of the MASK Kiosk in K. Varghese, S. Pfleger(Eds.) Human Comfort and security of information systems, Springer-Verlag, 1997. Also in Proc. Human Comfort and Security Workshop, Brussels, Belguim, October 1995.
- (1995) Human Comfort and Security of Information Systems
- Gauvain, J.L.¹ Bennacef, S.² Devillers, L.³ Lamel, L.⁴ Rosset, R.⁵

36
- 0030374902
- Speech Recognition for an Information Kiosk
- Philadelphia, PA
- J.L. Gauvain, J.J. Gangolf, and L. Lamel, Speech Recognition for an Information Kiosk, Proc. ICSLP’96, 849-852, Philadelphia, PA, October 1996.
- (1996) Proc. ICSLP’96 , pp. 849-852
- Gauvain, J.L.¹ Gangolf, J.J.² Lamel, L.³

37
- 85128356454
- Partitioning and Transcription of Broadcast News Data
- Sydney, Australia
- J.L. Gauvain, L. Lamel and G. Adda, Partitioning and Transcription of Broadcast News Data, Proc. ICSLP’98, 5:1335-1338, Sydney, Australia, December 1998.
- (1998) Proc. ICSLP’98 , vol.5 , pp. 1335-1338
- Gauvain, J.L.¹ Lamel, L.² Adda, G.³

38
- 0028996849
- Developments in Continuous Speech Dictation using the ARPA WSJ Task
- Detroit, MI
- J.L. Gauvain, L.F. Lamel and M. Adda-Decker, Developments in Continuous Speech Dictation using the ARPA WSJ Task, Proc. IEEE ICASSP-95, 65-68, Detroit, MI, May 1995.
- (1995) Proc. IEEE ICASSP-95 , pp. 65-68
- Gauvain, J.L.¹ Lamel, L.F.² Adda-Decker, M.³

39
- 0028419019
- Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
- J.L. Gauvain and C.H. Lee, Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains, IEEE Trans. Speech &Audio Processing, 2(2):291-298, April 1994.
- (1994) IEEE Trans. Speech &Audio Processing , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

40
- 0036567851
- The LIMSI Broadcast News Transcription System
- J.L. Gauvain, L. Lamel and G. Adda, The LIMSI Broadcast News Transcription System, Speech Communication, 37(1-2):89-108, May 2002.
- (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 89-108
- Gauvain, J.L.¹ Lamel, L.² Adda, G.³

41
- 3643049373
- A Rapid Match Algorithm for Continuous Speech Recognition
- Hidden Valley, PA
- L. Gillick and R. Roth, A Rapid Match Algorithm for Continuous Speech Recognition, Proc. DARPA Speech &Natural Language Workshop, 170-172, Hidden Valley, PA, June 1990.
- (1990) Proc. DARPA Speech &Natural Language Workshop , pp. 170-172
- Gillick, L.¹ Roth, R.²

42
- 0030648371
- A Probabilistic Approach to Confidence Measure Estimation and Evaluation
- Munich, Germany
- L. Gillick, Y. Ito and J. Young, A Probabilistic Approach to Confidence Measure Estimation and Evaluation, Proc. IEEE ICASSP-97, 879-882, Munich, Germany, April 1997.
- (1997) Proc. IEEE ICASSP-97 , pp. 879-882
- Gillick, L.¹ Ito, Y.² Young, J.³

43
- 0032665631
- Real-time Telephone-based Speech Recognition in the Jupiter Domain
- Phoenix, AZ
- J.R. Glass, T.J. Hazen and I. L. Hetherington, Real-time Telephone-based Speech Recognition in the Jupiter Domain, Proc. IEEE ICASSP-99, 1:61-64, Phoenix, AZ, March 1999.
- (1999) Proc. IEEE ICASSP-99 , vol.1 , pp. 61-64
- Glass, J.R.¹ Hazen, T.J.² Hetherington, I.L.³

44
- 85016587886
- SWITCHBOARD: Telephone Speech Corpus for Research and Development
- San Francisco, CA
- J. Godfrey, E. Holliman and J. McDaniel, SWITCHBOARD: Telephone Speech Corpus for Research and Development, Proc. IEEE ICASSP-92, 517-520, San Francisco, CA, March 1992.
- (1992) Proc. IEEE ICASSP-92 , pp. 517-520
- Godfrey, J.¹ Holliman, E.² McDaniel, J.³

45
- 0000803388
- The Population Frequencies of Species and the Estimation of Population Parameters
- I.J. Good, The Population Frequencies of Species and the Estimation of Population Parameters, Biomterika, 40(3/4):237-264,1953.
- (1953) Biomterika , vol.40 , Issue.3-4 , pp. 237-264
- Good, I.J.¹

46
- 0028996969
- A tree search strategy for large-vocabulary continuous speech recognition
- Detroit, MI
- P.S. Gopalakrishnan, L.R. Bahl and R.L. Mercer, A tree search strategy for large-vocabulary continuous speech recognition, Proc. IEEE ICASSP-95, 1:572-575, Detroit, MI, May 1995.
- (1995) Proc. IEEE ICASSP-95 , vol.1 , pp. 572-575
- Gopalakrishnan, P.S.¹ Bahl, L.R.² Mercer, R.L.³

47
- 85017287487
- Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition
- R. Haeb-Umbach and H. Ney, Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition, Proc. ICASSP-92, 1:1316, March 1992.
- (1992) Proc. ICASSP-92 , vol.1 , pp. 1316
- Haeb-Umbach, R.¹ Ney, H.²

48
- 0002751623
- Segment Generation and Clustering in the HTK Broadcast News Transcription System
- Landsdowne, VA
- T. Hain, S.E. Johnson, A. Tuerk, P.C. Woodland and S.J. Young, Segment Generation and Clustering in the HTK Broadcast News Transcription System, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 133-137, Landsdowne, VA, February 1998.
- (1998) Proc. Darpa Broadcast News Transcription &Understanding Workshop , pp. 133-137
- Hain, T.¹ Johnson, S.E.² Tuerk, A.³ Woodland, P.C.⁴ Young, S.J.⁵

49
- 77952268007
- Digital Libraries Magazine, September
- A.G. Hauptmann, M. Witbrock and M. Christel, News-on-Demand-’An Application of Informedia Technology’, Digital Libraries Magazine, September 1995.
- (1995) News-on-Demand-’An Application of Informedia Technology’
- Hauptmann, A.G.¹ Witbrock, M.² Christel, M.³

50
- 0002384527
- The ATIS Spoken Language Systems Pilot Corpus
- Pittsburgh, PA
- C.T. Hemphill, J.J. Godfrey, and G.R. Doddington, The ATIS Spoken Language Systems Pilot Corpus, Proc. Darpa Speech &Natural Language Workshop, Pittsburgh, PA, June 1990.
- (1990) Proc. Darpa Speech &Natural Language Workshop
- Hemphill, C.T.¹ Godfrey, J.J.² Doddington, G.R.³

51
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, J. Acoust. Soc. America, 87(4):1738-1752,1990.
- (1990) J. Acoust. Soc. America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

52
- 0002384092
- Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system
- Yokohama, Japan
- M.M. Hochberg, S.J. Renals, A.J. Robinson and D. Kershaw, Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system, Proc. ICSLP’94, 1499-1502, Yokohama, Japan, September 1994.
- (1994) Proc. ICSLP’94 , pp. 1499-1502
- Hochberg, M.M.¹ Renals, S.J.² Robinson, A.J.³ Kershaw, D.⁴

53
- 85055768776
- Chapter 1.3 of the State of the Art in Human Language Technology, (Cole et al, eds.)
- M.J. Hunt, Signal Representation, Chapter 1.3 of the State of the Art in Human Language Technology, (Cole et al, eds.), 1996. (http://www.cse.ogi.edu/CSLU/HLTsurvey/ch1node2.html)
- (1996) Signal Representation
- Hunt, M.J.¹

54
- 85015539783
- Subphonetic Modeling with Markov States - Senone
- San Francisco, CA
- M. Hwang and X. Huang, Subphonetic Modeling with Markov States - Senone, Proc. IEEE ICASSP-92,1:33-36, San Francisco, CA, March 1992.
- (1992) Proc. IEEE ICASSP-92 , vol.1 , pp. 33-36
- Hwang, M.¹ Huang, X.²

55
- 0027153655
- Predicting Unseen Triphones with Senones
- Minneapolis, MN
- M.Y. Hwang, X. Huang and F. Alleva, Predicting Unseen Triphones with Senones, Proc. IEEE ICASSP-93, II:311-314, Minneapolis, MN, April 1993.
- (1993) Proc. IEEE ICASSP-93 , vol.2 , pp. 311-314
- Hwang, M.Y.¹ Huang, X.² Alleva, F.³

56
- 0016939124
- Continuous Speech Recognition by Statistical Methods
- F. Jelinek, Continuous Speech Recognition by Statistical Methods, Proc. Of the IEEE, 64(4): 532-556, April 1976.
- (1976) Proc. Of the IEEE , vol.64 , Issue.4 , pp. 532-556
- Jelinek, F.¹

57
- 0003786003
- Cambridge: MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition, Cambridge: MIT Press, 1997.
- (1997) Statistical Methods for Speech Recognition
- Jelinek, F.¹

58
- 0012357341
- A Dynamic Language Model for Speech Recognition
- Pacific Grove, CA
- F. Jelinek, B. Merialdo, S. Roukos and M. Strauss, A Dynamic Language Model for Speech Recognition, Proc. DARPA Speech &Natural Language Workshop, 293-295, Pacific Grove, CA, February 1991.
- (1991) Proc. DARPA Speech &Natural Language Workshop , pp. 293-295
- Jelinek, F.¹ Merialdo, B.² Roukos, S.³ Strauss, M.⁴

59
- 85055786864
- Toulouse, France
- F. deJong, J.L. Gauvain, J. deb Hartog and K. Netter, Olive: Speech Based Video Retrieval, Proc. CBMI’99, Toulouse, France, October 1999.
- (1999) Olive: Speech Based Video Retrieval, Proc. CBMI’99
- Dejong, F.¹ Gauvain, J.L.² Deb Hartog, J.³ Netter, K.⁴

60
- 0022097649
- AT&T Technical Journal
- Juang, B.-H., Maximum-Likelihood Estimation for Mixture Multivariate Stochastic Observations of Markov Chains, AT&T Technical Journal, 64(6), 1985.
- (1985) Maximum-Likelihood Estimation for Mixture Multivariate Stochastic Observations of Markov Chains , vol.64 , Issue.6
- Juang, B.-H.¹

61
- 0023312404
- Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer
- S.M. Katz, Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer, IEEE Trans. Acoustics, Speech &Signal Processing, ASSP-35(3):400-401, March 1987.
- (1987) IEEE Trans. Acoustics, Speech &Signal Processing , vol.ASSP-35 , Issue.3 , pp. 400-401
- Katz, S.M.¹

62
- 85135261720
- Unsupervised Training of a Speech Recognizer: Recent Experiments
- Budapest, Hungary
- T. Kemp and A. Waibel, Unsupervised Training of a Speech Recognizer: Recent Experiments, Proc. ESCA Eurospeech’99, 6:2725-2728, Budapest, Hungary, September 1999.
- (1999) Proc. ESCA Eurospeech’99 , vol.6 , pp. 2725-2728
- Kemp, T.¹ Waibel, A.²

63
- 33646908801
- The 1995 Abbot hybrid connectionist-HMM large-vocabulary recognition system
- Harriman, NY
- D. Kershaw, A.J. Robinson and S.J. Renals, The 1995 Abbot hybrid connectionist-HMM large-vocabulary recognition system, Proc. ARPA Speech Recognition Workshop, 93-98, Harriman, NY, February 1996.
- (1996) Proc. ARPA Speech Recognition Workshop , pp. 93-98
- Kershaw, D.¹ Robinson, A.J.² Renals, S.J.³

64
- 85123963268
- Improved Clustering Techniques for Class-Based Statistical Language Modelling
- Berlin, September
- R. Kneser and H. Ney, Improved Clustering Techniques for Class-Based Statistical Language Modelling, Proc. Eurospeech’93, 973-976, Berlin, September 1993.
- (1993) Proc. Eurospeech’93 , pp. 973-976
- Kneser, R.¹ Ney, H.²

65
- 0028996876
- Improved backing-off for n-gram language modeling
- Detroit, MI, May
- R. Kneser and H. Ney, Improved backing-off for n-gram language modeling, Proc. IEEEICASSP-95,1:181-184, Detroit, MI, May 1995.
- (1995) Proc. IEEEICASSP-95 , vol.1 , pp. 181-184
- Kneser, R.¹ Ney, H.²

66
- 0343125611
- Design of the 1994 CSR Benchmark Tests
- Austin, TX, January
- F. Kubala, Design of the 1994 CSR Benchmark Tests, Proc. ARPA Spoken Language Systems Technology Workshop, 41-46, Austin, TX, January 1995.
- (1995) Proc. ARPA Spoken Language Systems Technology Workshop , pp. 41-46
- Kubala, F.¹

67
- 0002519514
- Toward Automatic Recognition of Broadcast News
- Harriman, NY, February
- F. Kubala, T. Anastasakos, H. Jin, J. Makhoul, L. Nguyen, R. Schwartz and N. Yuan, Toward Automatic Recognition of Broadcast News, Proc. Darpa Speech Recognition Workshop, 55-60, Harriman, NY, February 1996.
- (1996) Proc. Darpa Speech Recognition Workshop , pp. 55-60
- Kubala, F.¹ Anastasakos, T.² Jin, H.³ Makhoul, J.⁴ Nguyen, L.⁵ Schwartz, R.⁶ Yuan, N.⁷

68
- 0032289099
- Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition
- December
- N. Kumar and A.G. Andreou, Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition, Speech Communication, 26(4):283-297, December 1998.
- (1998) Speech Communication , vol.26 , Issue.4 , pp. 283-297
- Kumar, N.¹ Andreou, A.G.²

69
- 84871609195
- Eigenvoices for Speaker Adaptation
- Sydney, November
- R. Kuhn, P. Nguyen, J.C. Junqua, L. Goldwasser, N. Niedzielski, S. Fincke, and K. Field, M. Contolini, Eigenvoices for Speaker Adaptation, Proc. IC-SLP’98, 177l-1774, Sydney, November 1998.
- (1998) Proc. IC-SLP’98 , pp. 1771-1774
- Kuhn, R.¹ Nguyen, P.² Junqua, J.C.³ Goldwasser, L.⁴ Niedzielski, N.⁵ Fincke, S.⁶ Field, K.⁷ Contolini, M.⁸

70
- 0030351374
- On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition
- Philadelphia, PA, October
- L.F. Lamel and G. Adda, On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition, Proc. ICSLP’96, 1:6-9, Philadelphia, PA, October 1996.
- (1996) Proc. ICSLP’96 , vol.1 , pp. 6-9
- Lamel, L.F.¹ Adda, G.²

71
- 0011830592
- Speech Recognition of European Languages
- Snowbird, Utah, December
- L.F. Lamel and R. DeMori, Speech Recognition of European Languages, Proc. IEEE Automatic Speech Recognition Workshop, 51-54, Snowbird, Utah, December 1995.
- (1995) Proc. IEEE Automatic Speech Recognition Workshop , pp. 51-54
- Lamel, L.F.¹ Demori, R.²

72
- 0010536626
- Continuous Speech Recognition at LIMSI
- Stanford, CA, September
- L.F. Lamel and J.L. Gauvain, Continuous Speech Recognition at LIMSI, Proc. ARPA Workshop on Continuous Speech Recognition, 59-64, Stanford, CA, September 1992.
- (1992) Proc. ARPA Workshop on Continuous Speech Recognition , pp. 59-64
- Lamel, L.F.¹ Gauvain, J.L.²

73
- 0029219785
- A Phone-based Approach to Non-Linguistic Speech Feature Identification
- January
- L.F. Lamel and J.L. Gauvain, A Phone-based Approach to Non-Linguistic Speech Feature Identification, Computer Speech &Language, 9(1):87-103, January 1995.
- (1995) Computer Speech &Language , vol.9 , Issue.1 , pp. 87-103
- Lamel, L.F.¹ Gauvain, J.L.²

74
- 0036460908
- Lightly Supervised and Unsupervised Acoustic Model Training
- L. Lamel, J.L. Gauvain, and G. Adda, Lightly Supervised and Unsupervised Acoustic Model Training, Computer, Speech &Language, 16(1):115-229, January 2002.
- (2002) Computer, Speech &Language , vol.16 , Issue.1 , pp. 115-229
- Lamel, L.¹ Gauvain, J.L.² Adda, G.³

75
- 85124760678
- Development of Spoken Language Corpora for Travel Information
- Madrid, Spain
- L.F. Lamel, S. Rosset, S.K. Bennacef, H. Bonneau-Maynard, L. Devillers and J.L. Gauvain, Development of Spoken Language Corpora for Travel Information, Proc. ESCA Eurospeech’95, 3:1961-1964, Madrid, Spain, September 1995.
- (1995) Proc. ESCA Eurospeech’95 , vol.3 , pp. 1961-1964
- Lamel, L.F.¹ Rosset, S.² Bennacef, S.K.³ Bonneau-Maynard, H.⁴ Devillers, L.⁵ Gauvain, J.L.⁶

76
- 0003539541
- PhD Thesis, Carnegie Mellon University
- K.-F. Lee, Large-vocabulary speaker-independent continuous speech recognition: The SPHINX system, PhD Thesis, Carnegie Mellon University, 1988.
- (1988) Large-Vocabulary Speaker-Independent Continuous Speech Recognition: The SPHINX System
- Lee, K.-F.¹

77
- 0029747183
- Speaker Normalization Using Efficient Frequency Warping Procedures
- Atlanta, GA, May
- L. Lee and R.C. Rose, Speaker Normalization Using Efficient Frequency Warping Procedures, Proc. IEEE ICASSP-96,1:353-356, Atlanta, GA, May 1996.
- (1996) Proc. IEEE ICASSP-96 , vol.1 , pp. 353-356
- Lee, L.¹ Rose, R.C.²

78
- 0029288633
- Maximum Likelihood Linear Regressionfor Speaker Adaptation of Continuous Density Hidden Markov Models
- C. J. Leggetter and P. C. Woodland, Maximum Likelihood Linear Regressionfor Speaker Adaptation of Continuous Density Hidden Markov Models, Computer Speech &Language, 9(2):171-185, April 1995.
- (1995) Computer Speech &Language , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

79
- 0020180460
- Maximum Likelihood Estimation for Multivariate Observations of Markov Sources
- Liporace, L. R., Maximum Likelihood Estimation for Multivariate Observations of Markov Sources, IEEE Transactions on Information Theory, IT-28(5):729-734,1982.
- (1982) IEEE Transactions on Information Theory , vol.IT-28 , Issue.5 , pp. 729-734
- Liporace, L.R.¹

80
- 0031187171
- Speech recognition by machines and humans
- R. P Lippmann, Speech recognition by machines and humans, Speech Communication, 22(1):1-15, July 1997.
- (1997) Speech Communication , vol.22 , Issue.1 , pp. 1-15
- Lippmann, R.P.¹

81
- 85119434191
- Fast Speaker Change Detection for Broadcast News Transcription and Indexing
- Budapest, Hungary
- D. Liu and F. Kubala, Fast Speaker Change Detection for Broadcast News Transcription and Indexing, Proc. ESCA EuroSpeech’99, 3:1031-1034, Budapest, Hungary, September 1999.
- (1999) Proc. ESCA EuroSpeech’99 , vol.3 , pp. 1031-1034
- Liu, D.¹ Kubala, F.²

82
- 0345098384
- Multi-site Data Collection for a Spoken Language Corpus
- Harriman, NY, February
- Madcow, M., Multi-site Data Collection for a Spoken Language Corpus, Proc. Darpa Speech &Natural Language Workshop, 7-14, Harriman, NY, February 1992
- (1992) Proc. Darpa Speech &Natural Language Workshop , pp. 7-14
- Madcow, M.¹

83
- 0034296009
- A Stolcke, Finding Consensus in Speech Recognition: Word Error Minimization and Other Applications of Confusion Networks
- L Mangu, E Brill, A Stolcke, Finding Consensus in Speech Recognition: Word Error Minimization and Other Applications of Confusion Networks, Computer, Speech and Language, 14(4):373-400, October 2000.
- (2000) Computer, Speech and Language , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.²

84
- 85135158363
- Subspace distribution clustering for continuous observation density hidden Markov models
- Rhodes, Greece
- B Mak and E Bocchieri, Subspace distribution clustering for continuous observation density hidden Markov models, Proc. Eurospeech’97, 107-110, Rhodes, Greece, September 1997.
- (1997) Proc. Eurospeech’97 , pp. 107-110
- Mak, B.¹ Bocchieri, E.²

85
- 33646936293
- Spoken Language Processing and Human-Machine Communication in the European Union Programs
- G. Varile, ed, Rhodes, Greece, September
- J J Mariani Spoken Language Processing and Human-Machine Communication in the European Union Programs, in G. Varile, ed., Eurospeech’97 EU Speech Projects Day report, Rhodes, Greece, September 1997.
- (1997) Eurospeech’97 EU Speech Projects Day Report
- Mariani, J.J.¹

86
- 0004657714
- An overview of EU programs related to conversational/interactive systems
- Landsdowne, VA
- J. J. Mariani and L. F. Lamel, An overview of EU programs related to conversational/interactive systems, Proc DARPA Broadcast News Transcription &Understanding Workshop, 247-253, Landsdowne, VA, February 1998.
- (1998) Proc DARPA Broadcast News Transcription &Understanding Workshop , pp. 247-253
- Mariani, J.J.¹ Lamel, L.F.²

87
- 85135152717
- Algorithms for Bigram and Trigram Clustering
- Madrid, Spain
- S. Martin, J. Liermann and H. Ney, Algorithms for Bigram and Trigram Clustering, Proc. Eurospeech’95, 1253-1256, Madrid, Spain, September 1995.
- (1995) Proc. Eurospeech’95 , pp. 1253-1256
- Martin, S.¹ Liermann, J.² Ney, H.³

88
- 0043086482
- News on Demand
- M. Maybury (ed.), News on Demand, Special Section in the Communications of the ACM43(2), February 2000.
- (2000) Special Section in the Communications of the ACM , vol.43 , Issue.2
- Maybury, M.¹

89
- 0009588713
- Named Entity Extraction from Broadcast News
- Herndon, VA
- D. Miller, R. Schwartz, R. Weischedel and R. Stone, Named Entity Extraction from Broadcast News, Proc DARPA Broadcast News Workshop, 37-40, Herndon, VA, February 1999
- (1999) Proc DARPA Broadcast News Workshop , pp. 37-40
- Miller, D.¹ Schwartz, R.² Weischedel, R.³ Stone, R.⁴

90
- 84892168937
- Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition
- Seattle, WA
- M. Mohri, M. Riley, D. Hindle, A. Ljolie and F. Pereira, Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition, Proc IEEE ICASSP-98, 665-668, Seattle, WA, May 1998.
- (1998) Proc IEEE ICASSP-98 , pp. 665-668
- Mohri, M.¹ Riley, M.² Hindle, D.³ Ljolie, A.⁴ Pereira, F.⁵

91
- 0027192626
- Large-Vocabulary Dictation using SRI’s Decipher Speech Recognition System: Progressive Search Techniques
- Minneapolis, MN
- H. Murveit, J. Butzberger, V. Digalakis and M. Weintraub, Large-Vocabulary Dictation using SRI’s Decipher Speech Recognition System: Progressive Search Techniques, Proc. IEEE ICASSP-93, II:319-322, Minneapolis, MN, April 1993.
- (1993) Proc. IEEE ICASSP-93 , vol.2 , pp. 319-322
- Murveit, H.¹ Butzberger, J.² Digalakis, V.³ Weintraub, M.⁴

92
- 0021406359
- The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition
- H. Ney, The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition, IEEE Trans. Acoustics, Speech and Signal Processing, ASSP-32(2):263-271, April 1984.
- (1984) IEEE Trans. Acoustics, Speech and Signal Processing , vol.ASSP-32 , Issue.2 , pp. 263-271
- Ney, H.¹

93
- 85017308347
- Improvements in Beam Search for 10000-Word Continuous Speech Recognition
- San Francisco, CA
- H. Ney, R. Haeb-Umbach, B.H. Tran and M. Oerder, Improvements in Beam Search for 10000-Word Continuous Speech Recognition, Proc. IEEE ICASSP-92, I:9-12, San Francisco, CA, March 1992.
- (1992) Proc. IEEE ICASSP-92 , vol.1 , pp. 9-12
- Ney, H.¹ Haeb-Umbach, R.² Tran, B.H.³ Oerder, M.⁴

94
- 0032689227
- Single-Tree Method for Grammar-Directed Search
- Phoenix, AZ
- L. Nguyen and R. Schwartz, Single-Tree Method for Grammar-Directed Search, Proc. IEEE ICASSP-99,2:613-616, Phoenix, AZ, March 1999.
- (1999) Proc. IEEE ICASSP-99 , vol.2 , pp. 613-616
- Nguyen, L.¹ Schwartz, R.²

95
- 23144446072
- MPhil Thesis, Cambridge University Engineering Dept
- J.J. Odell, The Use of Decision Trees with Context Sensitive Phoneme Modelling, MPhil Thesis, Cambridge University Engineering Dept, 1992.
- (1992) The Use of Decision Trees with Context Sensitive Phoneme Modelling
- Odell, J.J.¹

96
- 0001889147
- A One Pass Decoder Design for Large Vocabulary Recognition
- Princeton, NJ
- J.J. Odell, V. Valtchev, P.C. Woodland and S.J. Young, A One Pass Decoder Design for Large Vocabulary Recognition, Proc. ARPA Human Language Technology Workshop, 405-410, Princeton, NJ, March 1994.
- (1994) Proc. ARPA Human Language Technology Workshop , pp. 405-410
- Odell, J.J.¹ Valtchev, V.² Woodland, P.C.³ Young, S.J.⁴

97
- 0002110654
- Recent Advances in Japanese Broadcast News Transcription
- Budapest, Hungary
- K. Ohtsuki, S. Furui, N. Sakurai, A. Iwasaki and Z. P Zeang, Recent Advances in Japanese Broadcast News Transcription, Proc. ESCA Eurospeech’99, 2:671-674, Budapest, Hungary, September 1999.
- (1999) Proc. ESCA Eurospeech’99 , vol.2 , pp. 671-674
- Ohtsuki, K.¹ Furui, S.² Sakurai, N.³ Iwasaki, A.⁴ Zeang, Z.P.⁵

98
- 0036295941
- Modeling Inverse Covariance Matrices by Basis Expansion
- Orlando, FL
- P A. Olsen and R. A. Gopinath, Modeling Inverse Covariance Matrices by Basis Expansion, Proc. IEEE ICASSP-02, 945-948, Orlando, FL, 2002.
- (2002) Proc. IEEE ICASSP-02 , pp. 945-948
- Olsen, P.A.¹ Gopinath, R.A.²

99
- 0030366694
- Language-model look-ahead for large vocabulary speech recognition
- Philadelphia, PA
- S. Ortmanns, H. Ney, and A. Eiden, Language-model look-ahead for large vocabulary speech recognition, Proc. ICSLP’96, 2095-2098, Philadelphia, PA, October 1996
- (1996) Proc. ICSLP’96 , pp. 2095-2098
- Ortmanns, S.¹ Ney, H.² Eiden, A.³

100
- 0030719155
- A Word Graph Algorithm for Large Vocabulary Continuous Speech Recognition
- S. Ortmanns, H. Ney, and X. Aubert, A Word Graph Algorithm for Large Vocabulary Continuous Speech Recognition, Computer, Speech and Language, 11(1):43-72, January 1997.
- (1997) Computer, Speech and Language , vol.11 , Issue.1 , pp. 43-72
- Ortmanns, S.¹ Ney, H.² Aubert, X.³

101
- 0016467605
- The Role of Phonological Rules in Speech Understanding Research
- B. T Oshika, V.W. Zue, R. V. Weeks, H. Neu and J. Aurbach, The Role of Phonological Rules in Speech Understanding Research, IEEE Trans. Acoustics, Speech, Signal Processing, ASSP-23, 104-112,1975.
- (1975) IEEE Trans. Acoustics, Speech, Signal Processing , vol.ASSP-23 , pp. 104-112
- Oshika, B.T.¹ Zue, V.W.² Weeks, R.V.³ Neu, H.⁴ Aurbach, J.⁵

102
- 33645771960
- Continuous Word Recognition Based on the Stochastic Segment Model
- Stanford, CA
- M. Ostendorf, A. Kannan, O. Kimball and J. R. Rohlicek, Continuous Word Recognition Based on the Stochastic Segment Model, Proc ARPA Workshop on Continuous Speech Recognition, 53-58, Stanford, CA, September 1992.
- (1992) Proc ARPA Workshop on Continuous Speech Recognition , pp. 53-58
- Ostendorf, M.¹ Kannan, A.² Kimball, O.³ Rohlicek, J.R.⁴

103
- 0141760645
- 1993 Benchmark Tests for the ARPA Spoken Language Program
- Princeton, NJ
- D. S. Pallett, J. G. Fiscus, W. M. Fisher, J. S. Garofolo, B. A. Lund and M.A. Pryzbocki, 1993 Benchmark Tests for the ARPA Spoken Language Program, Proc. ARPA Human Language Technology Workshop, 49-74, Princeton, NJ, March 1994
- (1994) Proc. ARPA Human Language Technology Workshop , pp. 49-74
- Pallett, D.S.¹ Fiscus, J.G.² Fisher, W.M.³ Garofolo, J.S.⁴ Lund, B.A.⁵ Pryzbocki, M.A.⁶

104
- 0012316245
- 1994 Benchmark Tests for the ARPA Spoken Language Program
- Austin, TX
- D. S. Pallett, J. G. Fiscus, W. M. Fisher, J. S. Garofolo, B. A. Lund, A.F. Martin and M. A. Przybocki, 1994 Benchmark Tests for the ARPA Spoken Language Program, Proc. ARPA Spoken Language Systems Technology Workshop, 536, Austin, TX, January 1995.
- (1995) Proc. ARPA Spoken Language Systems Technology Workshop , pp. 536
- Pallett, D.S.¹ Fiscus, J.G.² Fisher, W.M.³ Garofolo, J.S.⁴ Lund, B.A.⁵ Martin, A.F.⁶ Przybocki, M.A.⁷

105
- 0344230603
- 1995 Hub-3 Multiple Microphone Corpus Benchmark Tests
- Harriman, NY
- D. S. Pallett, J. G. Fiscus, W. M. Fisher, J. S. Garofolo, A.F. Martin and M.A. Przybocki, 1995 Hub-3 Multiple Microphone Corpus Benchmark Tests, Proc. ARPA Speech Recognition Workshop, 27-46, Harriman, NY, February 1996.
- (1996) Proc. ARPA Speech Recognition Workshop , pp. 27-46
- Pallett, D.S.¹ Fiscus, J.G.² Fisher, W.M.³ Garofolo, J.S.⁴ Martin, A.F.⁵ Przybocki, M.A.⁶

106
- 0001895107
- 1998 Broadcast News Benchmark Test Results: English and Non-English Word Error Rate Performance Measures
- Herndon, VA
- D. S. Pallett, J. G. Fiscus, J. S. Garofolo, A.F. Martin and M. A. Przybocki, 1998 Broadcast News Benchmark Test Results: English and Non-English Word Error Rate Performance Measures, Proc. Darpa Broadcast News Workshop, 5-12, Herndon, VA, February 1999.
- (1999) Proc. Darpa Broadcast News Workshop , pp. 5-12
- Pallett, D.S.¹ Fiscus, J.G.² Garofolo, J.S.³ Martin, A.F.⁴ Przybocki, M.A.⁵

107
- 85017287102
- An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model
- San Francisco, CA
- D. B. Paul, An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model, Proc. IEEE ICASSP-92, 405-409, San Francisco, CA, March 1992.
- (1992) Proc. IEEE ICASSP-92 , pp. 405-409
- Paul, D.B.¹

108
- 0034849080
- Improved Discriminative Training Techniques For Large Vocabulary Continuous Speech Recognition
- Salt Lake City, May
- D. Povey and P Woodland, Improved Discriminative Training Techniques For Large Vocabulary Continuous Speech Recognition, Proc. IEEE ICASSP-01, Salt Lake City, May 2001.
- (2001) Proc. IEEE ICASSP-01
- Povey, D.¹ Woodland, P.²

109
- 0002617904
- Evaluation of Spoken Language Systems: The ATIS Domain
- Hidden Valley, PA, June
- P Price, Evaluation of Spoken Language Systems: The ATIS Domain, Proc. Darpa Speech and Natural Language Workshop, 91-95, Hidden Valley, PA, June, 1990
- (1990) Proc. Darpa Speech and Natural Language Workshop , pp. 91-95
- Price, P.¹

110
- 0022594196
- An Introduction to Hidden Markov Models
- L. R. Rabiner and B. H. Juang, An Introduction to Hidden Markov Models, IEEE Acoustics Speech and Signal Processing Magazine, ASSP-3(1):4-16, January 1986
- (1986) IEEE Acoustics Speech and Signal Processing Magazine , vol.ASSP-3 , Issue.1 , pp. 4-16
- Rabiner, L.R.¹ Juang, B.H.²

111
- 0004021226
- PhD Thesis, Carnegie Mellon University
- M. K. Ravishankar, Efficient Algorithms for Speech Recognition, PhD Thesis, Carnegie Mellon University, 1996.
- (1996) Efficient Algorithms for Speech Recognition
- Ravishankar, M.K.¹

112
- 0033353288
- Stochastic pronunciation modelling from hand-labelled phonetic corpora
- M. D. Riley, W. Byrne, M. Finke, S. Khudanpu, A. Ljojle, J. McDonough, H. Nock, M. Saraclar, C. Wooters and G. Zavaliagkos, Stochastic pronunciation modelling from hand-labelled phonetic corpora, Speech Communication, 29(2-4):209-224, November 1999.
- (1999) Speech Communication , vol.29 , Issue.2-4 , pp. 209-224
- Riley, M.D.¹ Byrne, W.² Finke, M.³ Khudanpu, S.⁴ Ljojle, A.⁵ McDonough, J.⁶ Nock, H.⁷ Saraclar, M.⁸ Wooters, C.⁹ Zavaliagkos, G.¹⁰

113
- 30244503648
- Improvements in Stochastic Language Modeling
- Harriman, NY
- R. Rosenfeld and X. Huang, Improvements in Stochastic Language Modeling, Proc. Darpa Workshop on Speech &Natural Language, 107-111, Harriman, NY, February 1992
- (1992) Proc. Darpa Workshop on Speech &Natural Language , pp. 107-111
- Rosenfeld, R.¹ Huang, X.²

114
- 0003904645
- Ph. D. Thesis, Carnegie Mellon University, (also Tech. rep. CMU-CS-94-138)
- R. Rosenfeld, Adaptive Statistical Language Modeling, Ph. D. Thesis, Carnegie Mellon University, 1994. (also Tech. rep. CMU-CS-94-138).
- (1994) Adaptive Statistical Language Modeling
- Rosenfeld, R.¹

115
- 33646907991
- Two Decades of Statistical Language Modeling: Where Do We Go From Here?
- R. Rosenfeld, Two Decades of Statistical Language Modeling: Where Do We Go From Here?, Proceedings of the IEEE, Special issue on Spoken Language Processing, 88(8):1270-1278, August 2000.
- (2000) Proceedings of the IEEE, Special Issue on Spoken Language Processing , vol.88 , Issue.8 , pp. 1270-1278
- Rosenfeld, R.¹

116
- 0035426931
- Language-independent and langauge-adaptive acoustic modeling for speech recognition
- T Schultza and A. Waibel, Language-independent and langauge-adaptive acoustic modeling for speech recognition, Speech Communication, 35(1-2):31-51, August 2001.
- (2001) Speech Communication , vol.35 , Issue.1-2 , pp. 31-51
- Schultza, T.¹ Waibel, A.²

117
- 0033896970
- Memory-efficient LVCSR search using a one-pass stack decoder
- M. Schuster, Memory-efficient LVCSR search using a one-pass stack decoder, Computer Speech &Language, 14(1):47-77, January 2000.
- (2000) Computer Speech &Language , vol.14 , Issue.1 , pp. 47-77
- Schuster, M.¹

118
- 85017310294
- New uses for N-Best Sentence Hypothesis, within the BYBLOS Speech Recognition System
- San Francisco, CA
- R. Schwartz, S. Austin, F. Kubala and J. Makhoul, New uses for N-Best Sentence Hypothesis, within the BYBLOS Speech Recognition System, Proc. IEEE ICASSP-92,1:1-4, San Francisco, CA, March 1992.
- (1992) Proc. IEEE ICASSP-92 , vol.1 , pp. 1-4
- Schwartz, R.¹ Austin, S.² Kubala, F.³ Makhoul, J.⁴

119
- 0021142214
- Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition
- San Diego, CA
- R. Schwartz, Y. Chow, S. Roucos, M. Krasner and J. Makhoul, Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition, Proc. IEEE ICASSP-84, 3:35.6.1-35.6.4, San Diego, CA, March 1984.
- (1984) Proc. IEEE ICASSP-84
- Schwartz, R.¹ Chow, Y.² Roucos, S.³ Krasner, M.⁴ Makhoul, J.⁵

120
- 33646939277
- NYU Language Modeling Experiments for the 1995 CSR Evaluation
- Harriman, NY
- S. Sekine and R. Grishman, NYU Language Modeling Experiments for the 1995 CSR Evaluation, Proc. ARPA Speech Recognition Workshop, 123-128, Harriman, NY, February 1996.
- (1996) Proc. ARPA Speech Recognition Workshop , pp. 123-128
- Sekine, S.¹ Grishman, R.²

121
- 0029726011
- A Markov Random Field Approach to Bayesian Speaker Adaptation
- Detroit, MI
- B. Shahshahani, A Markov Random Field Approach to Bayesian Speaker Adaptation, Proc. IEEE ICASSP-95, 697-700, Detroit, MI, May 1995.
- (1995) Proc. IEEE ICASSP-95 , pp. 697-700
- Shahshahani, B.¹

122
- 0001405849
- Modeling Those F-Conditions - Or Not
- Chantilly, VA
- R. Schwartz, H. Jin, F. Kubala and S. Matsoukas, Modeling Those F-Conditions - Or Not, Proc. Darpa Speech Recognition Workshop, 115-118, Chantilly, VA, February 1997.
- (1997) Proc. Darpa Speech Recognition Workshop , pp. 115-118
- Schwartz, R.¹ Jin, H.² Kubala, F.³ Matsoukas, S.⁴

123
- 0030361237
- Scalable backoff language models
- Philadelphia, PA
- K. Seymore and R. Rosenfeld, Scalable backoff language models, Proc. ICSLP’96, 1:232-235, Philadelphia, PA, October 1996.
- (1996) Proc. ICSLP’96 , vol.1 , pp. 232-235
- Seymore, K.¹ Rosenfeld, R.²

124
- 0002782496
- Automatic Segmentation, Classification and Clustering of Broadcast News Audio
- Chantilly, VA
- M. Siegler, U. Jain, B. Raj and R. Stern, Automatic Segmentation, Classification and Clustering of Broadcast News Audio, Proc DARPA Speech Recognition Workshop, 97-99, Chantilly, VA, February 1997
- (1997) Proc DARPA Speech Recognition Workshop , pp. 97-99
- Siegler, M.¹ Jain, U.² Raj, B.³ Stern, R.⁴

125
- 0033344871
- Evaluation of word confidence for speech recognition systems
- M. Siu and H. Gish, Evaluation of word confidence for speech recognition systems, Computer Speech &Language, 13(4):299-318, October 1999.
- (1999) Computer Speech &Language , vol.13 , Issue.4 , pp. 299-318
- Siu, M.¹ Gish, H.²

126
- 0012611072
- Entropy-based Pruning of Backoff Language Models
- Lands-downe, VA
- A. Stolcke, Entropy-based Pruning of Backoff Language Models, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 270-274, Lands-downe, VA, February 1998
- (1998) Proc. Darpa Broadcast News Transcription &Understanding Workshop , pp. 270-274
- Stolcke, A.¹

127
- 0028996958
- Four-level Tied Structure for Efficient Representation of Acoustic Modeling
- Detroit, MI
- S. Takahashi and S. Sagayama, Four-level Tied Structure for Efficient Representation of Acoustic Modeling, Proc. IEEE ICASSP-95, 520-523, Detroit, MI, May 1995.
- (1995) Proc. IEEE ICASSP-95 , pp. 520-523
- Takahashi, S.¹ Sagayama, S.²

128
- 85135261079
- An Investigation into Vocal Tract Length Normalization
- Budapest, Hungary
- L. F. Uebel and P C. Woodland, An Investigation into Vocal Tract Length Normalization, Proc. ESCA Eurospeech’99, 2527-2530, Budapest, Hungary, September 1999.
- (1999) Proc. ESCA Eurospeech’99 , pp. 2527-2530
- Uebel, L.F.¹ Woodland, P.C.²

129
- 0040262071
- Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance
- Madrid, Spain
- D.A. van Leeuwen, L. G. van den Berg and H.J. M. Steeneken, Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance, Proc. ESCA Eurospeech’95, 1461-1464, Madrid, Spain, September 1995.
- (1995) Proc. ESCA Eurospeech’95 , pp. 1461-1464
- Van Leeuwen, D.A.¹ Van Den Berg, L.G.² Steeneken, H.J.M.³

130
- 0010727514
- Speech discrimination by dynamic programming
- T K. Vintsyuk, Speech discrimination by dynamic programming, Kibnernetika, 4:81, 1968.
- (1968) Kibnernetika , vol.4 , pp. 81
- Vintsyuk, T.K.¹

131
- 34250411858
- Elements-wise recognition of continuous speech composed of words from a specified dictionary
- March-April
- T. K. Vintsyuk, Elements-wise recognition of continuous speech composed of words from a specified dictionary, Cybernetics, 7:133-143, March-April 1971.
- (1971) Cybernetics , vol.7 , pp. 133-143
- Vintsyuk, T.K.¹

132
- 0001891171
- Verbmobil: Translation of Face-to-Face Dialogs
- Berlin, Germany, Plenary
- W. Wahlster, Verbmobil: Translation of Face-to-Face Dialogs, Proc. ESCA Eurospeech’93, Berlin, Germany, Plenary, 29-38, September 1993.
- (1993) Proc. ESCA Eurospeech’93 , pp. 29-38
- Wahlster, W.¹

133
- 0012327341
- Multilinguality in Speech and Spoken Language Systems
- A. Waibel, P Geutner, L. Mayfield Tomokiyo, T. Schultz and M. Woszczyna, Multilinguality in Speech and Spoken Language Systems, Proceedings of the IEEE, Special Issue on Spoken Language Processing, 88(8):1297-1313, August 2000.
- (2000) Proceedings of the IEEE, Special Issue on Spoken Language Processing , vol.88 , Issue.8 , pp. 1297-1313
- Waibel, A.¹ Geutner, P.² Mayfield Tomokiyo, L.³ Schultz, T.⁴ Woszczyna, M.⁵

134
- 0032678104
- Probabilistic Models for Topic Detection and Tracking
- Phoenix, AZ
- F. Walls, H. Jin, S. Sista and R. Schwartz, Probabilistic Models for Topic Detection and Tracking, Proc. IEEE ICASSP-99,1:521-524, Phoenix, AZ, March 1999
- (1999) Proc. IEEE ICASSP-99 , vol.1 , pp. 521-524
- Walls, F.¹ Jin, H.² Sista, S.³ Schwartz, R.⁴

135
- 0002707166
- Dragon Systems’ 1997 Broadcast News Transcription System
- Landsdowne, VA
- S. Wegmann, F. Scattone, I. Carp, L. Gillick, R. Roth and J. Yamron, Dragon Systems’ 1997 Broadcast News Transcription System, Proc. Darpa Broadcast News Transcription &Understanding Workshop, 60-65, Landsdowne, VA, February 1998
- (1998) Proc. Darpa Broadcast News Transcription &Understanding Workshop , pp. 60-65
- Wegmann, S.¹ Scattone, F.² Carp, I.³ Gillick, L.⁴ Roth, R.⁵ Yamron, J.⁶

136
- 0032657771
- Progress in Broadcast News Transcription at Dragon Systems
- Phoenix, AZ
- S. Wegmann, P Zhan, and L. Gillick, Progress in Broadcast News Transcription at Dragon Systems, Proc IEEE ICASSP-99, 33-36, Phoenix, AZ, March 1999
- (1999) Proc IEEE ICASSP-99 , pp. 33-36
- Wegmann, S.¹ Zhan, P.² Gillick, L.³

137
- 0030706666
- NeuralNetwork based Measures of Confidence for Word Recognition
- Munich, Germany
- M. Weintraub, F. Beaufays, Z. Rivlin, Y. Konig and A. Stolcke, NeuralNetwork based Measures of Confidence for Word Recognition, Proc. IEEE ICASSP-97, 887-890, Munich, Germany, April 1997.
- (1997) Proc. IEEE ICASSP-97 , pp. 887-890
- Weintraub, M.¹ Beaufays, F.² Rivlin, Z.³ Konig, Y.⁴ Stolcke, A.⁵

138
- 0031630644
- Using word probabilities as confidence measures
- Seattle, WA
- F. Wessel, K. Macherey and R. Schlüter, Using word probabilities as confidence measures, Proc. IEEE ICASSP-98, 225-228, Seattle, WA, May 1998.
- (1998) Proc. IEEE ICASSP-98 , pp. 225-228
- Wessel, F.¹ Macherey, K.² Schlüter, R.³

139
- 84962920544
- Unsupervised training of acoustic models for large vocabulary continuous speech recognition
- Madonna di Campiglio, Italy
- F. Wessel and H. Ney, Unsupervised training of acoustic models for large vocabulary continuous speech recognition, Proc. IEEE ASRU’01, Madonna di Campiglio, Italy, December 2001.
- (2001) Proc. IEEE ASRU’01
- Wessel, F.¹ Ney, H.²

140
- 0026187945
- The Zero Frequency problem: Estimating the problems of Novel Events in Adaptive tex Compression
- I.H. Witten and T. C. Bell, The Zero Frequency problem: Estimating the problems of Novel Events in Adaptive tex Compression, Proc. IEEE Trans. On Information Theory, 37(1):1085-1094, July 1991.
- (1991) Proc. IEEE Trans. On Information Theory , vol.37 , Issue.1 , pp. 1085-1094
- Witten, I.H.¹ Bell, T.C.²

141
- 0036461035
- Large scale discriminative training of hidden Markov models for speech recognition
- P C. Woodland and D. Povey, Large scale discriminative training of hidden Markov models for speech recognition, Computer, Speech and Language, 16(1):25-47, January 2002.
- (2002) Computer, Speech and Language , vol.16 , Issue.1 , pp. 25-47
- Woodland, P.C.¹ Povey, D.²

142
- 0001393274
- The development of the 1994 HTK large vocabulary speech recognition system
- Austin, TX
- P C. Woodland, C. J. Leggetter, J.J. Odell, V. Valtchev and S. J. Young, The development of the 1994 HTK large vocabulary speech recognition system, Proc. ARPA Spoken Language Systems Technology Workshop, 104-109, Austin, TX, January 1995.
- (1995) Proc. ARPA Spoken Language Systems Technology Workshop , pp. 104-109
- Woodland, P.C.¹ Leggetter, C.J.² Odell, J.J.³ Valtchev, V.⁴ Young, S.J.⁵

143
- 0002452931
- The HTK large vocabulary recognition system for the 1995 ARPA H3 task
- Harriman, NY
- P C. Woodland, M. J.F. Gales, D. Pye and V. Valtchev, The HTK large vocabulary recognition system for the 1995 ARPA H3 task, Proc. ARPA Speech Recognition Workshop, 99-104, Harriman, NY, February 1996.
- (1996) Proc. ARPA Speech Recognition Workshop , pp. 99-104
- Woodland, P.C.¹ Gales, M.J.F.² Pye, D.³ Valtchev, V.⁴

144
- 0031624946
- A Hidden Markov Approach to Text Segmentation and Event Tracking
- Seattle, WA
- J.P. Yamron, I. Carp, L. Gillick, S. Lowe and P. van Mulbregt, A Hidden Markov Approach to Text Segmentation and Event Tracking, Proc IEEE ICASSP-98,1:333-336, Seattle, WA, May 1998.
- (1998) Proc IEEE ICASSP-98 , vol.1 , pp. 333-336
- Yamron, J.P.¹ Carp, I.² Gillick, L.³ Lowe, S.⁴ Van Mulbregt, P.⁵

145
- 0030244826
- A Review of Large-Vocabulary Continuous Speech Recognition
- S. J. Young, A Review of Large-Vocabulary Continuous Speech Recognition, IEEE Signal Processing Magazine, 13(5):45-57, September 1996.
- (1996) IEEE Signal Processing Magazine , vol.13 , Issue.5 , pp. 45-57
- Young, S.J.¹

146
- 0030718943
- Multilingual large vocabulary speech recognition: The European SQALE project
- S.J. Young, M. Adda-Decker, X. Aubert, C. Dugast, J. L. Gauvain, D.J. Kershaw, L Lamel, D A Leeuwen, D Pye, H J M Steeneken, A J Robinson and P C. Woodland, Multilingual large vocabulary speech recognition: the European SQALE project, Computer Speech &Language, 11(1):73-89, January 1997
- (1997) Computer Speech &Language , vol.11 , Issue.1 , pp. 73-89
- Young, S.J.¹ Adda-Decker, M.² Aubert, X.³ Dugast, C.⁴ Gauvain, J.L.⁵ Kershaw, D.J.⁶ Lamel, L.⁷ Leeuwen, D.A.⁸ Pye, D.⁹ Steeneken, H.J.M.¹⁰ Robinson, A.J.¹¹ Woodland, P.C.¹²

147
- 0032181247
- Speech recognition evaluation: A review of the U.S. CSR andLVCSR programmes
- S.J. Young and L. Chase, Speech recognition evaluation: a review of the U.S. CSR andLVCSR programmes, Computer Speech &Language, 12(4):263-279, October 1998
- (1998) Computer Speech &Language , vol.12 , Issue.4 , pp. 263-279
- Young, S.J.¹ Chase, L.²

148
- 0002144369
- Tree-Based State Tying for High Accuracy Acoustic Modeling
- Princeton, NJ
- S. J. Young, J. J. Odell and P C. Woodland, Tree-Based State Tying for High Accuracy Acoustic Modeling, Proc. ARPA Human Language Technology Workshop, 307-312, Princeton, NJ, March 1994.
- (1994) Proc. ARPA Human Language Technology Workshop , pp. 307-312
- Young, S.J.¹ Odell, J.J.² Woodland, P.C.³

149
- 85135369802
- The Use of State Tying in Continuous Speech Recognition
- Berlin, Germany
- S.J. Young and P C. Woodland, The Use of State Tying in Continuous Speech Recognition, Proc. ESCA Eurospeech’93, 3:2203-2206, Berlin, Germany, September1993
- (1993) Proc. ESCA Eurospeech’93 , vol.3 , pp. 2203-2206
- Young, S.J.¹ Woodland, P.C.²

150
- 0002162027
- DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA
- G. Zavaliagkos and T. Colthurst, Utilizing Untranscribed Training Data to Improve Performance, DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA, 301-305, February 1998.
- (1998) Utilizing Untranscribed Training Data to Improve Performance , pp. 301-305
- Zavaliagkos, G.¹ Colthurst, T.²

151
- 0029745232
- Maximum a Posteriori Adaptation for Large Scale HMM Recognizers
- Detroit, MI
- G. Zavaliagkos, R. Schwartz and J. McDonough, Maximum a Posteriori Adaptation for Large Scale HMM Recognizers, Proc IEEE ICASSP-95, 725-728, Detroit, MI, May 1995
- (1995) Proc IEEE ICASSP-95 , pp. 725-728
- Zavaliagkos, G.¹ Schwartz, R.² McDonough, J.³

152
- 85121123643
- The MITSummit Speech Recognition System: A Progress Report
- Philadelphia, PA
- V. Zue, J. Glass, M. Phillips and S. Seneff, The MITSummit Speech Recognition System: A Progress Report, Proc DARPA Speech &Natural Language Workshop, 179-189, Philadelphia, PA, February 1989
- (1989) Proc DARPA Speech &Natural Language Workshop , pp. 179-189
- Zue, V.¹ Glass, J.² Phillips, M.³ Seneff, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.