-
1
-
-
0013143830
-
Energy-conditioned spectral estimation for recognition of noisy speech
-
Jan.
-
A. Erell and M. Weintraub, "Energy-conditioned spectral estimation for recognition of noisy speech," IEEE Trans, Speech Audio Processing, vol. 1, no. 1, pp. 84-89, Jan. 1993.
-
(1993)
IEEE Trans, Speech Audio Processing
, vol.1
, Issue.1
, pp. 84-89
-
-
Erell, A.1
Weintraub, M.2
-
2
-
-
0026843273
-
Gain-adapted hidden Markov models for recognition of clean and noisy speech
-
Apr.
-
Y. Ephraim, "Gain-adapted hidden Markov models for recognition of clean and noisy speech," IEEE Trans. Acoust., Speech, Signal Processing, vol. 40, no. 4, pp. 725-735, Apr. 1992.
-
(1992)
IEEE Trans. Acoust., Speech, Signal Processing
, vol.40
, Issue.4
, pp. 725-735
-
-
Ephraim, Y.1
-
3
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
Apr.
-
H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, Apr. 1990.
-
(1990)
J. Acoust. Soc. Amer.
, vol.87
, Issue.4
, pp. 1738-1752
-
-
Hermansky, H.1
-
4
-
-
0000030810
-
Auditory nerve representation as a basis for speech processing
-
S. Furui and M. M. Sondhi, Eds., New York: Marcel Dekker
-
O. Ghitza, "Auditory nerve representation as a basis for speech processing," in S. Furui and M. M. Sondhi, Eds., Advances in Speech Signal Processing. New York: Marcel Dekker, 1992, pp. 453-485.
-
(1992)
Advances in Speech Signal Processing.
, pp. 453-485
-
-
Ghitza, O.1
-
5
-
-
0003071809
-
Evaluation and optimization of perceptually based ASR front end
-
Jan.
-
J.-C. Junqua, H. Wakita, and H. Hermansky, "Evaluation and optimization of perceptually based ASR front end," IEEE Trims. Speech Audio Processing, vol. 1, no. 1, pp. 39-48, Jan. 1993.
-
(1993)
IEEE Trims. Speech Audio Processing
, vol.1
, Issue.1
, pp. 39-48
-
-
Junqua, J.-C.1
Wakita, H.2
Hermansky, H.3
-
6
-
-
33646938054
-
Language processing for speech understanding
-
A. Waibel and K.-F. Lee, Eds., San Mateo, CA: Morgan Kaufman
-
W. A. Woods, "Language processing for speech understanding," in A. Waibel and K.-F. Lee, Eds., Readings in Speech Recognition. San Mateo, CA: Morgan Kaufman, 1990, pp. 519-533.
-
(1990)
Readings in Speech Recognition.
, pp. 519-533
-
-
Woods, W.A.1
-
7
-
-
0344685169
-
High level knowledge sources in usable speech recognition systems
-
A. Waibel and K.-F. Lee, Eds., San Mateo, CA: Morgan Kaufmann
-
S. R. Young, A. G. Hauptmann, W. H. Ward, E. T. Smith, and P. Werner, "High level knowledge sources in usable speech recognition systems," in A. Waibel and K.-F. Lee, Eds., Readings in Speech Recognition, San Mateo, CA: Morgan Kaufmann, 1990, pp. 538-549.
-
(1990)
Readings in Speech Recognition
, pp. 538-549
-
-
Young, S.R.1
Hauptmann, A.G.2
Ward, W.H.3
Smith, E.T.4
Werner, P.5
-
8
-
-
0003539541
-
-
Ph.D. thesis, Carnegie-Mellon Univ., Pittsburgh, PA
-
K.-F. Lee, Large-Vocabulary "Speaker-independent continuous speech recognition: The SPHINX system," Ph.D. thesis, Carnegie-Mellon Univ., Pittsburgh, PA, 1988.
-
(1988)
Large-Vocabulary "Speaker-independent Continuous Speech Recognition: the SPHINX System"
-
-
Lee, K.-F.1
-
9
-
-
33646941794
-
Prosodie knowledge sources for word hypothesization in a continuous speech recognition system
-
A. Waibel and K.-F. Lee, Eds., San Mateo, CA: Morgan Kaufmann
-
A. Waibel, "Prosodie knowledge sources for word hypothesization in a continuous speech recognition system," in A. Waibel and K.-F. Lee, Eds., Readings in Speech Recognition. San Mateo, CA: Morgan Kaufmann, 1990, pp. 534-537.
-
(1990)
Readings in Speech Recognition.
, pp. 534-537
-
-
Waibel, A.1
-
10
-
-
0003699540
-
-
Ph.D. thesis, Univ. of Illinois, Urbana, IL
-
E. D. Petajan, "Automatic lipreading to enhance speech recognition," Ph.D. thesis, Univ. of Illinois, Urbana, IL, 1984.
-
(1984)
Automatic Lipreading to Enhance Speech Recognition
-
-
Petajan, E.D.1
-
11
-
-
0002365852
-
Surface learning with applications to lipreading
-
J. D. Cowan, G. Tesauro, and J. Alspector, Eds, San Francisco, CA: Morgan Kaufmann
-
C. Bregler and S. M. Omohundro, "Surface learning with applications to lipreading," in J. D. Cowan, G. Tesauro, and J. Alspector, Eds, Advances in Neural Information Processing Systems. San Francisco, CA: Morgan Kaufmann, 1994, pp. 43-50, vol. 6.
-
(1994)
Advances in Neural Information Processing Systems.
, vol.6
, pp. 43-50
-
-
Bregler, C.1
Omohundro, S.M.2
-
12
-
-
85009082168
-
A hybrid approach to bimodal speech recognition
-
Pacific Grove, CA, Nov.
-
C. Bregler, S. M. Omohundro, and Y. Konig, "A hybrid approach to bimodal speech recognition," in Proc. 28th Ann. Asilomar Conf. Signals, Syst., Comput., vol. 1, Pacific Grove, CA, Nov. 1994, pp. 556-560.
-
(1994)
Proc. 28th Ann. Asilomar Conf. Signals, Syst., Comput.
, vol.1
, pp. 556-560
-
-
Bregler, C.1
Omohundro, S.M.2
Konig, Y.3
-
13
-
-
84947971415
-
Bimodal recognition experiments with recurrent neural networks
-
P. Cosi, E. M. Caldognetto, K. Vagges, G. A. Mian, and M. Contolini, "Bimodal recognition experiments with recurrent neural networks," in Proc. Int. Conf. Acoust., Speech, Signal Processing, vol. 2, 1994, pp. 11/553-556.
-
(1994)
Proc. Int. Conf. Acoust., Speech, Signal Processing
, vol.2
, pp. 11
-
-
Cosi, P.1
Caldognetto, E.M.2
Vagges, K.3
Mian, G.A.4
Contolini, M.5
-
14
-
-
85135321224
-
See me, hear me: Integrating automatic speech recognition and lip-reading
-
P. Duchnowski, U. Meier, and A. Waibel, "See me, hear me: Integrating automatic speech recognition and lip-reading," in Proc. Int. Conf. Spoken Language Processing, 1994.
-
(1994)
Proc. Int. Conf. Spoken Language Processing
-
-
Duchnowski, P.1
Meier, U.2
Waibel, A.3
-
15
-
-
38249029471
-
Automatic optically-based recognition of speech
-
K. E. Finn and A. A. Montgomery, "Automatic optically-based recognition of speech," Patt. Recogn. Lett., vol. 8, no. 3, pp. 159-164, 1988.
-
(1988)
Patt. Recogn. Lett.
, vol.8
, Issue.3
, pp. 159-164
-
-
Finn, K.E.1
Montgomery, A.A.2
-
17
-
-
78649238564
-
Using deformable templates to infer visual speech dynamics
-
Pacific Grove, CA, Nov.
-
M. E. Hennecke, K. V. Prasad, and D. G. Stork, "Using deformable templates to infer visual speech dynamics," in Proc. 28th Ann. Asilomar Conf. Signals, Syst., Comput., vol. 1, Pacific Grove, CA, Nov. 1994, pp. 578-582.
-
(1994)
Proc. 28th Ann. Asilomar Conf. Signals, Syst., Comput.
, vol.1
, pp. 578-582
-
-
Hennecke, M.E.1
Prasad, K.V.2
Stork, D.G.3
-
18
-
-
85029619676
-
Visual speech recognition with stochastic networks
-
G. Tesauro, D. Touretzky, and T. Leen, Eds., Cambridge, MA: MIT Press
-
J. R. Movellan, "Visual speech recognition with stochastic networks," in G. Tesauro, D. Touretzky, and T. Leen, Eds., Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, vol. 7, 1995, pp. 851-858.
-
(1995)
Advances in Neural Information Processing Systems.
, vol.7
, pp. 851-858
-
-
Movellan, J.R.1
-
19
-
-
84921138344
-
"Speech recognition enhancement by lip information
-
S. Nishida, "Speech recognition enhancement by lip information," in Proc. Comput. Human Interfaces '86, pp. 198-204.
-
Proc. Comput. Human Interfaces '86
, pp. 198-204
-
-
Nishida, S.1
-
20
-
-
4244043499
-
An improved automatic lipreading system to enhance speech recognition
-
AT&T Bell Labs.
-
E. D. Petajan, "An improved automatic lipreading system to enhance speech recognition," Tech. Rep. 11251-871012-111TM, AT&T Bell Labs., 1987.
-
(1987)
Tech. Rep. 11251-871012-111TM
-
-
Petajan, E.D.1
-
21
-
-
85060684689
-
Lip modeling for visual speech recognition
-
Pacific Grove, CA, Nov.
-
R. R. Rao and R. M. Mersereau, "Lip modeling for visual speech recognition," in Proc. 28th Ann. Asilomar Conf. Signals, Syst., Comput., vol. 1, Pacific Grove, CA, Nov. 1994, pp. 587-590.
-
(1994)
Proc. 28th Ann. Asilomar Conf. Signals, Syst., Comput.
, vol.1
, pp. 587-590
-
-
Rao, R.R.1
Mersereau, R.M.2
-
24
-
-
0025503485
-
Neural network models of sensory integration for improved vowel recognition
-
Oct.
-
B. P. Yuhas, M. H. Goldstein, T. J. Sejnowski, and R. E. Jenkins, "Neural network models of sensory integration for improved vowel recognition," Proc. IEEE, vol. 78, no. 10, pp. 1658-1668, Oct. 1990.
-
(1990)
Proc. IEEE
, vol.78
, Issue.10
, pp. 1658-1668
-
-
Yuhas, B.P.1
Goldstein, M.H.2
Sejnowski, T.J.3
Jenkins, R.E.4
-
25
-
-
85132038963
-
Neural network lipreading system for improved speech recognition
-
D. G. Stork, G. Wolff, and E. Levine, "Neural network lipreading system for improved speech recognition," in Proc. Int. Joint Conf. Neural Networks, 1992, pp. 285-295.
-
(1992)
Proc. Int. Joint Conf. Neural Networks
, pp. 285-295
-
-
Stork, D.G.1
Wolff, G.2
Levine, E.3
-
26
-
-
85013580214
-
Sensory integration in audiovisual automatic speech recognition
-
Nov.
-
P. L. Silsbee, "Sensory integration in audiovisual automatic speech recognition," in 28th Ann. Asilomar Conf. Signals, Syst., Comput., vol. I, Nov. 1994, pp. 561-565.
-
(1994)
28th Ann. Asilomar Conf. Signals, Syst., Comput.
, vol.1
, pp. 561-565
-
-
Silsbee, P.L.1
-
27
-
-
2542503213
-
Visual lipreading by computer to improve automatic speech recognition accuracy
-
Univ. of Texas Comput. Vision Res. Center, Austin, TX
-
P. L. Silsbee and A. C. Bovik, "Visual lipreading by computer to improve automatic speech recognition accuracy," Tech. Rep., TR93-02-90, Univ. of Texas Comput. Vision Res. Center, Austin, TX, 1993.
-
(1993)
Tech. Rep., TR93-02-90
-
-
Silsbee, P.L.1
Bovik, A.C.2
-
28
-
-
0000585224
-
Lipreading by neural networks: Visual preprocessing, learning and sensory integration
-
J. D. Cowan, G. Tesauro, and J. Alspector, Eds., San Francisco, CA: Morgan Kaufmann
-
G. J. Wolff, K. V. Prasad, D. G. Stork, and M. E. Hennecke, "Lipreading by neural networks: Visual preprocessing, learning and sensory integration," in J. D. Cowan, G. Tesauro, and J. Alspector, Eds., Advances in Neural Information Processing Systems. San Francisco, CA: Morgan Kaufmann, 1994, pp. 1027-1034, vol. 6.
-
(1994)
Advances in Neural Information Processing Systems.
, vol.6
, pp. 1027-1034
-
-
Wolff, G.J.1
Prasad, K.V.2
Stork, D.G.3
Hennecke, M.E.4
-
29
-
-
0026369237
-
Neural network vowel recognition jointly using voice features and mouth shape image
-
J. Wu et al., "Neural network vowel recognition jointly using voice features and mouth shape image," Patt. Recogn., vol. 24, no. 10, pp. 921-927, 1991.
-
(1991)
Patt. Recogn.
, vol.24
, Issue.10
, pp. 921-927
-
-
Wu, J.1
-
31
-
-
0001048664
-
Visual contribution to speech intelligibility in noise
-
W. H. Sumby and I. Pollock, "Visual contribution to speech intelligibility in noise," J. Acoust. Soc. Amer., vol. 26, pp. 212-215, 1954.
-
(1954)
J. Acoust. Soc. Amer.
, vol.26
, pp. 212-215
-
-
Sumby, W.H.1
Pollock, I.2
-
32
-
-
0002028032
-
Some preliminaries to a comprehensive account of audio-visual speech perception
-
B. Dodd and R. Campbell, Eds., London: Lawrence Erlbaum
-
Q. Summerfield, "Some preliminaries to a comprehensive account of audio-visual speech perception," in B. Dodd and R. Campbell, Eds., Hearing by Eye: The Psychology of Lip-reading. London: Lawrence Erlbaum, 1987, pp. 3-51.
-
(1987)
Hearing by Eye: the Psychology of Lip-reading.
, pp. 3-51
-
-
Summerfield, Q.1
-
33
-
-
0002132290
-
Easy to hear but hard to understand: A lip-reading advantage with intact auditory stimuli
-
B. Dodd and R. Campbell, Eds., London: Lawrence Erlbaum
-
D. Reisbcrg, "Easy to hear but hard to understand: a lip-reading advantage with intact auditory stimuli," in B. Dodd and R. Campbell, Eds., Hearing by Eye: The Psychology of Lip-reading. London: Lawrence Erlbaum, 1987, pp. 97-113.
-
(1987)
Hearing by Eye: the Psychology of Lip-reading.
, pp. 97-113
-
-
Reisbcrg, D.1
-
34
-
-
0008745835
-
Speech perception by ear and eye
-
B. Dodd and R. Campbell, Eds., London: Lawrence Erlbaum
-
D. W. Massaro, "Speech perception by ear and eye," in B. Dodd and R. Campbell, Eds., Hearing by Eye: Tlie Psychology of Lip-reading. London: Lawrence Erlbaum, 1987, pp. 53-83.
-
(1987)
Hearing by Eye: Tlie Psychology of Lip-reading.
, pp. 53-83
-
-
Massaro, D.W.1
-
35
-
-
0017199877
-
Hearing lips and seeing voices
-
H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
MacDonald, J.2
-
36
-
-
0040914411
-
Lip-reading in the prelingually deaf
-
B. Dodd and R. Campbell, Eds., London: Lawrence Erlbaum
-
K. Mogford, "Lip-reading in the prelingually deaf," in B. Dodd and R. Campbell, Eds., Hearing by Eye: The Psychology of Lip-reading. London: Lawrence Erlbaum, 1987, pp. 191-211.
-
(1987)
Hearing by Eye: the Psychology of Lip-reading.
, pp. 191-211
-
-
Mogford, K.1
-
39
-
-
0017060763
-
Perceptual dimensions underlying vowel lipreading performance
-
P. L. Jackson, A. A. Montgomery, and C. A. Binnie, "Perceptual dimensions underlying vowel lipreading performance," J. Speech Hearing Res., vol. 19, pp. 796-812, 1976.
-
(1976)
J. Speech Hearing Res.
, vol.19
, pp. 796-812
-
-
Jackson, P.L.1
Montgomery, A.A.2
Binnie, C.A.3
-
40
-
-
0346080351
-
Roles of lips and teeth in lipreading vowels
-
M. McGrath, A. Q. Summerfield, and N. M. Brooke, "Roles of lips and teeth in lipreading vowels," Proc. Inst. Acoust., pp. 401-408, 1984.
-
(1984)
Proc. Inst. Acoust.
, pp. 401-408
-
-
McGrath, M.1
Summerfield, A.Q.2
Brooke, N.M.3
-
42
-
-
0023211284
-
Integration of acoustic information in a large vocabulary word recognizer
-
V. M. Gupta, M. Lennig, and P. Mermelstein, "Integration of acoustic information in a large vocabulary word recognizer," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1987, pp. 697-700.
-
(1987)
Proc. Int. Conf. Acoust., Speech, Signal Processing
, pp. 697-700
-
-
Gupta, V.M.1
Lennig, M.2
Mermelstein, P.3
-
44
-
-
0024752328
-
A new vector quantization clustering algorithm
-
Oct.
-
W. H. Equitz, "A new vector quantization clustering algorithm," IEEE Trans. Acoust., Speech, Signal-Processing, vol. 37, no. 10, pp. 1568-1575, Oct. 1989.
-
(1989)
IEEE Trans. Acoust., Speech, Signal-Processing
, vol.37
, Issue.10
, pp. 1568-1575
-
-
Equitz, W.H.1
-
45
-
-
0021412027
-
Vector quantization
-
Apr.
-
R. M. Gray, "Vector quantization," IEEE Acoust., Speech, Signal Processing Mag., vol. 2, pp. 4-29, Apr. 1984.
-
(1984)
IEEE Acoust., Speech, Signal Processing Mag.
, vol.2
, pp. 4-29
-
-
Gray, R.M.1
|