-
2
-
-
64149096060
-
What HMMs can do Dept. of Elect. Eng., Univ.Washington, Seattle, WA
-
Tech. Rep. UWEETR-2002-003, Online, Available
-
J. Bilmes, What HMMs can do Dept. of Elect. Eng., Univ.Washington, Seattle, WA, Tech. Rep. UWEETR-2002-003, 2002 [Online]. Available: www.ee.washington.edu/techsite/papers26
-
(2002)
-
-
Bilmes, J.1
-
3
-
-
0030245363
-
From HMMs to segment models: A unified view of stochastic modeling for speech recognition
-
Sep
-
M. Ostendorf, V. Digalakis, and O. Kimball, "From HMMs to segment models: a unified view of stochastic modeling for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 360-378, Sep. 1996.
-
(1996)
IEEE Trans. Speech Audio Process
, vol.4
, Issue.5
, pp. 360-378
-
-
Ostendorf, M.1
Digalakis, V.2
Kimball, O.3
-
4
-
-
0003805597
-
The use of context in large vocabulary speech recognition,
-
Ph.D. dissertation, Univ. Cambridge, Cambridge, U.K
-
J. J. Odell, "The use of context in large vocabulary speech recognition," Ph.D. dissertation, Univ. Cambridge, Cambridge, U.K., 1995.
-
(1995)
-
-
Odell, J.J.1
-
5
-
-
64149101768
-
Cepstral mean compensation for HMM recognition in noise
-
Cannes-Mandelieu, France
-
S. Young, "Cepstral mean compensation for HMM recognition in noise," in Proc. ESCA Workshop on Speech Processing in Adverse Conditions, Cannes-Mandelieu, France, 1992, pp. 123-126.
-
(1992)
Proc. ESCA Workshop on Speech Processing in Adverse Conditions
, pp. 123-126
-
-
Young, S.1
-
6
-
-
0023206946
-
A speech enhancement method based on Kalman filtering
-
K. K. Paliwal and A. Basu, "A speech enhancement method based on Kalman filtering," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1987, pp. 177-180.
-
(1987)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
, pp. 177-180
-
-
Paliwal, K.K.1
Basu, A.2
-
7
-
-
0009578471
-
Multi-Microphone Correlation-Based Processing for Robust Automatic Speech Recognition,
-
Ph.D. dissertation, Carnegie Mellon Univ, Pittsburgh, PA
-
T. M. Sullivan, "Multi-Microphone Correlation-Based Processing for Robust Automatic Speech Recognition," Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA, 1996.
-
(1996)
-
-
Sullivan, T.M.1
-
8
-
-
0028517164
-
RASTA processing of speech
-
Oct
-
H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
9
-
-
85009074661
-
Large- vocabulary audio-visual speech recognition by machines and humans
-
G. Potamianos, C. Neti, G. Iyengar, and E. Helmuth, "Large- vocabulary audio-visual speech recognition by machines and humans," in European Conf. Speech Communication and Technology (EuroSpeech), 2001, pp. 1027-1030.
-
(2001)
European Conf. Speech Communication and Technology (EuroSpeech)
, pp. 1027-1030
-
-
Potamianos, G.1
Neti, C.2
Iyengar, G.3
Helmuth, E.4
-
10
-
-
0030635418
-
Joint distributional modeling with cross-correlation based features
-
J. A. Bilmes, "Joint distributional modeling with cross-correlation based features," in Proc. IEEE ASRU Workshop, 1997, pp. 148-155.
-
(1997)
Proc. IEEE ASRU Workshop
, pp. 148-155
-
-
Bilmes, J.A.1
-
11
-
-
0018455310
-
Supression of acoustic noise in speech using spectral subtraction
-
S. Boll, "Supression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, 1979.
-
(1979)
IEEE Trans. Acoust., Speech, Signal Process
, vol.ASSP-27
, Issue.2
, pp. 113-120
-
-
Boll, S.1
-
12
-
-
0016067897
-
Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
-
B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
-
(1974)
J. Acoust. Soc. Amer
, vol.55
, pp. 1304-1312
-
-
Atal, B.S.1
-
13
-
-
0019555090
-
Cepstral analysis technique for automatic speaker verification
-
Apr
-
S. Furui, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech, Signal Process., vol. 29, no. 2, pp. 254-272, Apr. 1981.
-
(1981)
IEEE Trans. Acoust., Speech, Signal Process
, vol.29
, Issue.2
, pp. 254-272
-
-
Furui, S.1
-
14
-
-
0030711157
-
Transcription of broadcast television and radio news: The 1996 abbot system
-
Munich, Germany
-
G. D. Cook, D. J. Kershaw, J. D. M. Christie, C. W. Seymour, and S. R. Waterhouse, "Transcription of broadcast television and radio news: the 1996 abbot system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Munich, Germany, 1997.
-
(1997)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
-
-
Cook, G.D.1
Kershaw, D.J.2
Christie, J.D.M.3
Seymour, C.W.4
Waterhouse, S.R.5
-
15
-
-
0026925484
-
Hidden Markov models with firstorder equalization for noisy speech recognition
-
Sep
-
B.-H. Juang and K. K. Paliwal, "Hidden Markov models with firstorder equalization for noisy speech recognition," IEEE Trans. Signal Process., vol. 40, no. 9, pp. 2136-2143, Sep. 1992.
-
(1992)
IEEE Trans. Signal Process
, vol.40
, Issue.9
, pp. 2136-2143
-
-
Juang, B.-H.1
Paliwal, K.K.2
-
16
-
-
0030149866
-
A maximum-likelihood approach to stochastic matching for robust speech recognition
-
May
-
A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 3, pp. 190-202, May 1996.
-
(1996)
IEEE Trans. Speech Audio Process
, vol.4
, Issue.3
, pp. 190-202
-
-
Sankar, A.1
Lee, C.-H.2
-
17
-
-
0025681008
-
Hidden Markov model decomposition of speech and noise
-
A. P. Varga and R. K. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1990, pp. 845-848.
-
(1990)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
, pp. 845-848
-
-
Varga, A.P.1
Moore, R.K.2
-
18
-
-
85017310148
-
An improved approach to the hidden Markov model decomposition of speech and noise
-
M. J. F. Gales and S. Young, "An improved approach to the hidden Markov model decomposition of speech and noise," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1992, pp. 233-236.
-
(1992)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
, pp. 233-236
-
-
Gales, M.J.F.1
Young, S.2
-
19
-
-
0003940203
-
The Generation and Use of Regression Class Trees for MLLR Adaptation Dept. Eng., Univ. Cambridge
-
Tech. Rep. CUED/FINFENG/ TR263
-
M. J. F. Gales, The Generation and Use of Regression Class Trees for MLLR Adaptation Dept. Eng., Univ. Cambridge, Tech. Rep. CUED/FINFENG/ TR263, 1996.
-
(1996)
-
-
Gales, M.J.F.1
-
20
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug
-
S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. 28, no. 4, pp. 357-366, Aug. 1980.
-
(1980)
IEEE Trans. Acoust., Speech, Signal Process
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
23
-
-
0033676943
-
Large vocabulary decoding and confidence estimation using word posterior probabilities
-
Istanbul, Turkey
-
G. Evermann and P. C. Woodland, "Large vocabulary decoding and confidence estimation using word posterior probabilities," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Istanbul, Turkey, 2000.
-
(2000)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
-
-
Evermann, G.1
Woodland, P.C.2
-
24
-
-
0030638031
-
A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)
-
Santa Barbara, CA
-
J. G. Fiscus, "A post-processing system to yield reduced word error rates: recognizer Output Voting Error Reduction (ROVER)," in Proceedings of IEEEWorkshop on Automatic Speech Recognition and Understanding, Santa Barbara, CA, 1997.
-
(1997)
Proceedings of IEEEWorkshop on Automatic Speech Recognition and Understanding
-
-
Fiscus, J.G.1
-
25
-
-
0342759089
-
Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
-
L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: Word error minimization and other applications of confusion networks," in Proc. Third World Multiconf. Systemics, Cybernetics and Informatics, joint with the Fifth Int. Conf. Information Syst. Analysis and Synthesis, 1999, vol. 5, pp. 246-252.
-
(1999)
Proc. Third World Multiconf. Systemics, Cybernetics and Informatics, joint with the Fifth Int. Conf. Information Syst. Analysis and Synthesis
, vol.5
, pp. 246-252
-
-
Mangu, L.1
Brill, E.2
Stolcke, A.3
-
27
-
-
84946807902
-
Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system
-
V. Gadde, A. Stolcke, D. Vergyri, J. Zheng, K. Sonmez, and A. Venkataraman, "Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 2002, pp. 1577-1580.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP)
, pp. 1577-1580
-
-
Gadde, V.1
Stolcke, A.2
Vergyri, D.3
Zheng, J.4
Sonmez, K.5
Venkataraman, A.6
-
28
-
-
0034853045
-
Speech in noisy environments: Robust automatic segmentation, feature extraction and hypothesis combination
-
R. Singh, M. Seltzer, B. Raj, and R. Stern, "Speech in noisy environments: robust automatic segmentation, feature extraction and hypothesis combination," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP), 2001, pp. 273-276.
-
(2001)
Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP)
, pp. 273-276
-
-
Singh, R.1
Seltzer, M.2
Raj, B.3
Stern, R.4
-
29
-
-
17344389852
-
Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
-
B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: the 2001 IBM SPINE evaluation system," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP), 2002, pp. 53-56.
-
(2002)
Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP)
, pp. 53-56
-
-
Kingsbury, B.1
Saon, G.2
Mangu, L.3
Padmanabhan, M.4
Sarikaya, R.5
-
30
-
-
85009106519
-
Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
-
J. Barker,M. Cooke, and P. Green, "Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise," in European Conf. Speech Communication and Technology (EuroSpeech), 2001, pp. 213-216.
-
(2001)
European Conf. Speech Communication and Technology (EuroSpeech)
, pp. 213-216
-
-
Barker, J.1
Cooke, M.2
Green, P.3
-
31
-
-
84938647987
-
Feature vector selection to improve ASR robustness in noisy conditions
-
J. de Veth, L. Mauuary, B. Noe, F. de Wet, J. Sienel, L. Boves, and D. Jouver, "Feature vector selection to improve ASR robustness in noisy conditions," in European Conf. Speech Communication and Technology (EuroSpeech), 2001, pp. 201-204.
-
(2001)
European Conf. Speech Communication and Technology (EuroSpeech)
, pp. 201-204
-
-
de Veth, J.1
Mauuary, L.2
Noe, B.3
de Wet, F.4
Sienel, J.5
Boves, L.6
Jouver, D.7
-
33
-
-
84867608170
-
Low-resource noise-robust feature post-processing on Aurora 2.0
-
C.-P. Chen, J. Bilmes, and K. Kirchhoff, "Low-resource noise-robust feature post-processing on Aurora 2.0," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 2002, pp. 2445-2448.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP)
, pp. 2445-2448
-
-
Chen, C.-P.1
Bilmes, J.2
Kirchhoff, K.3
-
34
-
-
85009265586
-
Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases
-
C.-P. Chen, K. Filali, and J. Bilmes, "Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 2002, pp. 241-244.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP)
, pp. 241-244
-
-
Chen, C.-P.1
Filali, K.2
Bilmes, J.3
-
35
-
-
33646780873
-
Speech feature smoothing for robust ASR
-
Mar
-
C.-P. Chen, J. Bilmes, and D. Ellis, "Speech feature smoothing for robust ASR," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Mar. 2005.
-
(2005)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
-
-
Chen, C.-P.1
Bilmes, J.2
Ellis, D.3
-
36
-
-
0027465491
-
The Lombard reflex and its role on human listeners and automatic speech recognizers
-
January
-
J. C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers," J. Acoust. Soc. Amer. (JASA), vol. 91, no. 1, pp. 510-524, January 1993.
-
(1993)
J. Acoust. Soc. Amer. (JASA)
, vol.91
, Issue.1
, pp. 510-524
-
-
Junqua, J.C.1
-
37
-
-
0032676337
-
On the relative importance of various components of the modulation spectrum for automatic speech recognition
-
N. Kanedera, T. Arai, H. Hermansky, and M. Pavel, "On the relative importance of various components of the modulation spectrum for automatic speech recognition," Speech Commun., vol. 28, no. 1, pp. 43-55, 1999.
-
(1999)
Speech Commun
, vol.28
, Issue.1
, pp. 43-55
-
-
Kanedera, N.1
Arai, T.2
Hermansky, H.3
Pavel, M.4
-
38
-
-
0012236195
-
The CU-HTK march 2000 HUB5E transcription system
-
T. Hain, P. Woodland, G. Evermann, and D. Povey, "The CU-HTK march 2000 HUB5E transcription system," in Proc. Speech Transcription Workshop, 2000.
-
(2000)
Proc. Speech Transcription Workshop
-
-
Hain, T.1
Woodland, P.2
Evermann, G.3
Povey, D.4
-
39
-
-
85009231870
-
Qualcomm-icsi-ogi features for asr
-
Denver, CO
-
A. Adam, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, S. K. P. Jain, N. Morgan, and S. Sivadas, "Qualcomm-icsi-ogi features for asr," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), Denver, CO, 2002.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP)
-
-
Adam, A.1
Burget, L.2
Dupont, S.3
Garudadri, H.4
Grezl, F.5
Hermansky, H.6
Jain, S.K.P.7
Morgan, N.8
Sivadas, S.9
-
40
-
-
0027957839
-
Effect of temporal envelope smearing on speech reception
-
Feb
-
R. Drullman, J. M. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech reception," in J. Acoust. Soc. Amer. (JASA), Feb. 1994, vol. 95, no. 2, pp. 1053-1064.
-
(1994)
J. Acoust. Soc. Amer. (JASA)
, vol.95
, Issue.2
, pp. 1053-1064
-
-
Drullman, R.1
Festen, J.M.2
Plomp, R.3
-
41
-
-
85009252470
-
The 2001 GMTK-based SPINE ASR system
-
O. Çetin, H. Nock, K. Kirchhoff, J. Bilmes, and M. Ostendorf, "The 2001 GMTK-based SPINE ASR system," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 2002.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP)
-
-
Çetin, O.1
Nock, H.2
Kirchhoff, K.3
Bilmes, J.4
Ostendorf, M.5
-
42
-
-
70249086510
-
Robust ASR front-end using spectral-based and discriminant features: Experiments on the Aurora tasks
-
C. Benitez, L. Burget, B. Chen, S. Dupont, H. Garudadri, H. Hermansky, P. Jain, S. Kajarekar,N.Morgan, and S. Sivadas, "Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks," in European Conf. Speech Communication and Technology (EuroSpeech), 2001, pp. 429-432.
-
(2001)
European Conf. Speech Communication and Technology (EuroSpeech)
, pp. 429-432
-
-
Benitez, C.1
Burget, L.2
Chen, B.3
Dupont, S.4
Garudadri, H.5
Hermansky, H.6
Jain, P.7
Kajarekar, S.8
Morgan, N.9
Sivadas, S.10
-
43
-
-
0002788784
-
Signal processing for robust speech recognition
-
C.-H. Lee and F. Soong, Eds. Boston, MA: Kluwer
-
R. M. Stern, A. Acero, F.-H. Liu, and Y. Ohshima, "Signal processing for robust speech recognition," in Speech Recognit., C.-H. Lee and F. Soong, Eds. Boston, MA: Kluwer, 1996, pp. 351-378.
-
(1996)
Speech Recognit
, pp. 351-378
-
-
Stern, R.M.1
Acero, A.2
Liu, F.-H.3
Ohshima, Y.4
-
44
-
-
0003434858
-
Perceptually inspired signal-processing strategies fro robust speech recognition in reverberant environments,
-
Ph.D. dissertation, Univ. California, Berkeley
-
B. E. D. Kingsbury, "Perceptually inspired signal-processing strategies fro robust speech recognition in reverberant environments," Ph.D. dissertation, Univ. California, Berkeley, 1998.
-
(1998)
-
-
Kingsbury, B.E.D.1
-
45
-
-
0030711174
-
The modulation spectrogram: In pursuit of an invariant representation of speech
-
S. Greenberg and B. Kingsbury, "The modulation spectrogram: in pursuit of an invariant representation of speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1997, pp. 1647-1650.
-
(1997)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
, pp. 1647-1650
-
-
Greenberg, S.1
Kingsbury, B.2
-
46
-
-
84873312246
-
A review of the MTF concept in room acoustics and its use for estimating speech intelligibility
-
March
-
T. Houtgast and H. J. M. Steeneken, "A review of the MTF concept in room acoustics and its use for estimating speech intelligibility," J. Acoust. Soc. Amer. (JASA), vol. 77, no. 3, pp. 1069-1077, March 1985.
-
(1985)
J. Acoust. Soc. Amer. (JASA)
, vol.77
, Issue.3
, pp. 1069-1077
-
-
Houtgast, T.1
Steeneken, H.J.M.2
-
47
-
-
0038669544
-
The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions
-
Sep
-
H. G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions," in ICSA ITRW ASR 2000, Sep. 2000.
-
(2000)
ICSA ITRW ASR 2000
-
-
Hirsch, H.G.1
Pearce, D.2
-
48
-
-
64149119352
-
-
Motorola Au/374/01, Small Vocabulary Evaluation: Baseline mel-cepstrum Performances With Speech Endpoints Oct. 2001.
-
Motorola Au/374/01, Small Vocabulary Evaluation: Baseline mel-cepstrum Performances With Speech Endpoints Oct. 2001.
-
-
-
-
49
-
-
64149116336
-
Blind MVA speech feature processing on Aurora 2.0 Dept. Elect. Eng., Univ. Washington, Seattle, WA
-
Tech. Rep. UWEETR-2004-0017, Online, Available
-
C.-P. Chen, J. Bilmes, and D. Ellis, Blind MVA speech feature processing on Aurora 2.0 Dept. Elect. Eng., Univ. Washington, Seattle, WA, Tech. Rep. UWEETR-2004-0017, 2004 [Online]. Available: http://www.ee.washington.edu/ techsite/papers
-
(2004)
-
-
Chen, C.-P.1
Bilmes, J.2
Ellis, D.3
-
50
-
-
64149109407
-
MVA processing of speech features Dept. Elect. Eng., Univ. Washington, Seattle, WA
-
Tech. Rep. UWEETR- 2003-0024, Online, Available
-
C.-P. Chen and J. Bilmes, MVA processing of speech features Dept. Elect. Eng., Univ. Washington, Seattle, WA, Tech. Rep. UWEETR- 2003-0024, 2003 [Online]. Available: http://www.ee.washington.edu/ techsite/papers
-
(2003)
-
-
Chen, C.-P.1
Bilmes, J.2
|