-
1
-
-
0032716023
-
An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech
-
Hong Kong, May 1999, IEEE
-
Ali, A.M.A., Van der Spiegel, J., Muller, P., Haentjens, G. & Berman, J. (1999). An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech. In Proceedings of the International Symposium on Circuits and Systems, Hong Kong, May 1999, pp. 118-121. IEEE.
-
(1999)
Proceedings of the International Symposium on Circuits and Systems
, pp. 118-121
-
-
Ali, A.M.A.1
Van der Spiegel, J.2
Muller, P.3
Haentjens, G.4
Berman, J.5
-
3
-
-
0025591290
-
Neural networks for voiced/unvoiced speech classification
-
Albuquerque
-
Bendiksen, A. & Steiglitz, K. (1990). Neural networks for voiced/unvoiced speech classification. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90, Albuquerque, pp. 521-524.
-
(1990)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90
, pp. 521-524
-
-
Bendiksen, A.1
Steiglitz, K.2
-
6
-
-
0030643240
-
Sub-band-based speech recognition
-
Munich
-
Bourlard, H. & Dupont, S. (1997). Sub-band-based speech recognition. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97, Munich, pp. 1251-1254.
-
(1997)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97
, pp. 1251-1254
-
-
Bourlard, H.1
Dupont, S.2
-
8
-
-
0004119259
-
-
MIT Press, Cambridge, MA, U.S.A.
-
Chomsky, N. & Halle, M. (1968). The Sound Pattern of English. MIT Press, Cambridge, MA, U.S.A.
-
(1968)
The Sound Pattern of English
-
-
Chomsky, N.1
Halle, M.2
-
10
-
-
0035342414
-
Robust automatic speech recognition with missing and unreliable acoustic data
-
in press
-
Cooke, M., Green, P., Josifovski, L. & Vizinho, A. (2001). Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, in press.
-
(2001)
Speech Communication
-
-
Cooke, M.1
Green, P.2
Josifovski, L.3
Vizinho, A.4
-
11
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
Dempster, A., Laird, N. & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B, 39, 1-38.
-
(1977)
Journal of the Royal Statistical Society B
, vol.39
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
12
-
-
0028234947
-
A statistical approach to ASR using atomic units constructed from overlapping articulatory features
-
Deng, L. & Sun, D. (1994). A statistical approach to ASR using atomic units constructed from overlapping articulatory features. Journal of the Acoustical Society of America, 95, 2702-2719.
-
(1994)
Journal of the Acoustical Society of America
, vol.95
, pp. 2702-2719
-
-
Deng, L.1
Sun, D.2
-
13
-
-
0029816780
-
An HMM-based speech recognizer using overlapping articulatory features
-
Erler, K. & Freeman, G.H. (1996). An HMM-based speech recognizer using overlapping articulatory features. Journal of the Acoustical Society of America, 96, 2500-2513.
-
(1996)
Journal of the Acoustical Society of America
, vol.96
, pp. 2500-2513
-
-
Erler, K.1
Freeman, G.H.2
-
17
-
-
0021461632
-
Detection in noise by spectro-temporal pattern analysis
-
Hall, J.W., Haggard, M.P. & Fernandes, M.A. (1984). Detection in noise by spectro-temporal pattern analysis. Journal of the Acoustical Society of America, 76, 50-56.
-
(1984)
Journal of the Acoustical Society of America
, vol.76
, pp. 50-56
-
-
Hall, J.W.1
Haggard, M.P.2
Fernandes, M.A.3
-
19
-
-
0030365517
-
Towards ASR on partially corrupted speech
-
Philadelphia
-
Hermansky, H., Pavel, M. & Tibrewala, S. (1996). Towards ASR on partially corrupted speech. In Proceedings of the International Conference on Spoken Language Processing '96, Philadelphia, pp. 462-465.
-
(1996)
Proceedings of the International Conference on Spoken Language Processing '96
, pp. 462-465
-
-
Hermansky, H.1
Pavel, M.2
Tibrewala, S.3
-
22
-
-
0001136927
-
Computing with action potentials
-
M. Jordan, M. Kearns and S. Solla, eds), volume. MIT Press, Cambridge, MA, U.S.A.
-
Hopfield, J., Brody, C. & Rowels, S. (1998). Computing with action potentials. In Advances in Neural Information Processing Systems. (M. Jordan, M. Kearns and S. Solla, eds), volume 10, pp. 166-172. MIT Press, Cambridge, MA, U.S.A.
-
(1998)
Advances in Neural Information Processing Systems
, vol.10
, pp. 166-172
-
-
Hopfield, J.1
Brody, C.2
Rowels, S.3
-
23
-
-
0025680225
-
NTIMIT: A phonetically balanced, continuous speech, telephone bandwidth speech database
-
Albuquerque
-
Jankowsld, C., Kalyanswamy, A., Basson, S. & Spitz, J. (1990). NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90, Albuquerque, pp. 109-112.
-
(1990)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90
, pp. 109-112
-
-
Jankowsld, C.1
Kalyanswamy, A.2
Basson, S.3
Spitz, J.4
-
25
-
-
0034297586
-
Detection of phonological features in continuous speech using neural networks
-
King, S. & Taylor, P. (2000). Detection of phonological features in continuous speech using neural networks. Computer Speech and Language, 14, 333-353.
-
(2000)
Computer Speech and Language
, vol.14
, pp. 333-353
-
-
King, S.1
Taylor, P.2
-
27
-
-
0031187171
-
Speech recognition by machines and humans
-
Lippmann, R.P. (1997). Speech recognition by machines and humans. Speech Communication, 22, 1-15.
-
(1997)
Speech Communication
, vol.22
, pp. 1-15
-
-
Lippmann, R.P.1
-
28
-
-
0029907249
-
Landmark detection for distinctive feature-based speech recognition
-
Liu, S. (1996). Landmark detection for distinctive feature-based speech recognition. Journal of the Acoustical Society of America, 100, 3417-3430.
-
(1996)
Journal of the Acoustical Society of America
, vol.100
, pp. 3417-3430
-
-
Liu, S.1
-
29
-
-
84898935332
-
A framework for multiple-instance learning
-
M. Jordan, M. Kearns and S. Solla, eds. MIT Press, Cambridge, MA, U.S.A.
-
Moron, O. & Lozano-Perez, T. (1998). A framework for multiple-instance learning. In Advances in Neural Information Processing Systems. (M. Jordan, M. Kearns and S. Solla, eds), volume 10, pp. 570-576. MIT Press, Cambridge, MA, U.S.A.
-
(1998)
Advances in Neural Information Processing Systems
, vol.10
, pp. 570-576
-
-
Moron, O.1
Lozano-Perez, T.2
-
31
-
-
0001279385
-
Union: A new approach for combining sub-band observations for noisy speech recognition
-
Tampere, Finland
-
Ming, J. & Smith, F.J. (1999). Union: a new approach for combining sub-band observations for noisy speech recognition, in Proceedings of the Workshop on Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, pp. 175-178.
-
(1999)
Proceedings of the Workshop on Robust Methods for Speech Recognition in Adverse Conditions
, pp. 175-178
-
-
Ming, J.1
Smith, F.J.2
-
33
-
-
0003249790
-
Sooner or later: Exploiting asynchrony in multi-band speech recognition
-
Budapest
-
Mirghafori, N. & Morgan, N. (1999). Sooner or later: exploiting asynchrony in multi-band speech recognition, in Proceedings of Eurospeech-99, Budapest, pp. 595-598.
-
(1999)
Proceedings of Eurospeech-99
, pp. 595-598
-
-
Mirghafori, N.1
Morgan, N.2
-
35
-
-
0020816083
-
Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
-
Moore, B.C.J. & Glasberg, B.R. (1983). Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. Journal of the Acoustical Society of America, 74, 750-753.
-
(1983)
Journal of the Acoustical Society of America
, vol.74
, pp. 750-753
-
-
Moore, B.C.J.1
Glasberg, B.R.2
-
36
-
-
0032657797
-
Distinctive feature detection using support vector machines
-
Phoenix
-
Niyogi, P., Bnrges, C. & Ramesh, P. (1999). Distinctive feature detection using support vector machines. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '99, Phoenix, pp. 425-428.
-
(1999)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '99
, pp. 425-428
-
-
Niyogi, P.1
Bnrges, C.2
Ramesh, P.3
-
37
-
-
0031636902
-
Incorporating voice onset time to improve letter recognition accuracies
-
Seattle
-
Niyogi, P. & Ramesh, P. (1998). Incorporating voice onset time to improve letter recognition accuracies. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '98, Seattle, pp. 721-724.
-
(1998)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '98
, pp. 721-724
-
-
Niyogi, P.1
Ramesh, P.2
-
39
-
-
0004244302
-
-
Prentice Hall, Englewood Cliffs, NJ, U.S.A.
-
Rabiner, L.R. & Juang, B.H. (1993). Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs, NJ, U.S.A.
-
(1993)
Fundamentals of Speech Recognition
-
-
Rabiner, L.R.1
Juang, B.H.2
-
40
-
-
0003913694
-
An efficient implementation of the Patterson-Holdsworth auditory filterbank
-
Cupertino, CA
-
Slaney, M. (1993). An efficient implementation of the Patterson-Holdsworth auditory filterbank. Apple Computer Technical Report 35, Cupertino, CA.
-
(1993)
Apple Computer Technical Report
, vol.35
-
-
Slaney, M.1
-
41
-
-
0025623060
-
A perceptual pitch detector
-
Albuquerque
-
Slaney, M. & Lyon, R.F. (1990). A perceptual pitch detector. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90, Albuquerque, pp. 357-360.
-
(1990)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90
, pp. 357-360
-
-
Slaney, M.1
Lyon, R.F.2
-
42
-
-
85031258263
-
A neurally motivated technique for voicing detection and FO estimation for speech
-
University of Sterling, Sterling, Scotland
-
Smith, L. (1996). A neurally motivated technique for voicing detection and FO estimation for speech. CCCN Technical Report 22, University of Sterling, Sterling, Scotland.
-
(1996)
CCCN Technical Report 22
-
-
Smith, L.1
-
43
-
-
84956679481
-
A noise-robust auditory modelling front end for voiced speech
-
Lecture Notes in Computer Science, volume. (W. Gerstner, A. Germond, M. Hasler and J.-D. Nicoud, eds). Springer-Verlag, Heidelberg, Germany
-
Smith, L. (1997). A noise-robust auditory modelling front end for voiced speech. In Artificial Neural Networks ICANN-97, Lecture Notes in Computer Science, volume 1327. (W. Gerstner, A. Germond, M. Hasler and J.-D. Nicoud, eds), pp. 97-102. Springer-Verlag, Heidelberg, Germany.
-
(1997)
Artificial Neural Networks ICANN-97
, vol.1327
, pp. 97-102
-
-
Smith, L.1
-
45
-
-
0030682291
-
Sub-band based recognition of noisy speech
-
Munich
-
Tibrewala, S. & Hermansky, H. (1997). Sub-band based recognition of noisy speech. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97, Munich, pp. 1255-1258.
-
(1997)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97
, pp. 1255-1258
-
-
Tibrewala, S.1
Hermansky, H.2
-
46
-
-
0004319968
-
-
Technical Report, DRA Speech Research Unit
-
Varga, A., Steeneken, H.J.M., Tomlinson, M. & Jones, D. (1992). The noisex-92 study on the effect of additive noise on automatic speech recognition. Technical Report, DRA Speech Research Unit.
-
(1992)
The noisex-92 study on the effect of additive noise on automatic speech recognition
-
-
Varga, A.1
Steeneken, H.J.M.2
Tomlinson, M.3
Jones, D.4
|