-
1
-
-
85009106482
-
Audiovisual representation of prosody in expressive speech communication
-
B. Granström and D. House, "Audiovisual representation of prosody in expressive speech communication," ISCA Int. Conf. Speech Prosody, pp. 393-400, 2004.
-
(2004)
ISCA Int. Conf. Speech Prosody
, pp. 393-400
-
-
Granström, B.1
House, D.2
-
2
-
-
0017199877
-
Hearing lips and seeing voices
-
H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
MacDonald, J.2
-
3
-
-
0001048664
-
Visual contributions to speech intelligibility in noise
-
W. H. Sumby and I. Pollak, "Visual contributions to speech intelligibility in noise," J. Acoust. Soc. Am., vol. 26, No. 2, pp. 212-215, 1954.
-
(1954)
J. Acoust. Soc. Am.
, vol.26
, Issue.2
, pp. 212-215
-
-
Sumby, W.H.1
Pollak, I.2
-
4
-
-
0025767028
-
Evaluating the articulation index for auditory-visual input
-
K. W. Grant and L. D. Braida, "Evaluating the articulation index for auditory-visual input," J. Acoust. Soc. Am., vol. 89, No. 6, pp. 2952-2960, 1991.
-
(1991)
J. Acoust. Soc. Am.
, vol.89
, Issue.6
, pp. 2952-2960
-
-
Grant, K.W.1
Braida, L.D.2
-
5
-
-
85027136924
-
Minimum error rate training of inter-word context dependent acoustic model units in speech recognition
-
W. Chou, C.-H. Lee, and B. H. Juang, "Minimum error rate training of inter-word context dependent acoustic model units in speech recognition," Proc. Int. Conf. Spoken Lang. Process., pp. 439-442, 1994.
-
(1994)
Proc. Int. Conf. Spoken Lang. Process.
, pp. 439-442
-
-
Chou, W.1
Lee, C.-H.2
Juang, B.H.3
-
6
-
-
0032140546
-
On stochastic feature and model compensation approaches to robust speech recognition
-
C.-H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Comm., vol. 25, No. 1, pp. 29-47, 1998.
-
(1998)
Speech Comm.
, vol.25
, Issue.1
, pp. 29-47
-
-
Lee, C.-H.1
-
7
-
-
84946801025
-
Use of real and contaminated speech for training of a hands-free in-car speech recognizer
-
M. Matassoni, M. Omologo, and P. Svaizer, "Use of real and contaminated speech for training of a hands-free in-car speech recognizer," Eurospeech, 2001.
-
(2001)
Eurospeech
-
-
Matassoni, M.1
Omologo, M.2
Svaizer, P.3
-
8
-
-
0000874053
-
Le signe de l'elevation de la voix
-
E. Lombard, "Le signe de l'elevation de la voix," Ann. Maladies Oreille, Larynx, Nez, Pharynx, vol. 37, pp. 101-119, 1911.
-
(1911)
Ann. Maladies Oreille, Larynx, Nez, Pharynx
, vol.37
, pp. 101-119
-
-
Lombard, E.1
-
9
-
-
0022915795
-
Recognition of speech under stress and in noise
-
P. Rajasekaran, G. Doddington, and J. Picone, "Recognition of speech under stress and in noise," Proc. Int. Conf. Acoust., Speech, and Sig. Process., pp. 733-736, 1986.
-
(1986)
Proc. Int. Conf. Acoust., Speech, and Sig. Process
, pp. 733-736
-
-
Rajasekaran, P.1
Doddington, G.2
Picone, J.3
-
10
-
-
0034817675
-
Optimized second-order gradient microphone for hands-free speech recordings in cars
-
R. Aubauer and D. Leckschat, "Optimized second-order gradient microphone for hands-free speech recordings in cars," Speech Comm., vol. 34, No. 1-2, pp. 13-23, 2001.
-
(2001)
Speech Comm.
, vol.34
, Issue.1-2
, pp. 13-23
-
-
Aubauer, R.1
Leckschat, D.2
-
12
-
-
85135275880
-
The SpeechDat-car multilingual speech databases for in-car applications: Some first validation results
-
H. V. den Heuvel, R. Boudy, S. Euler, A. Moreno, and G. Richard, "The SpeechDat-Car multilingual speech databases for in-car applications: Some first validation results," Eurospeech, pp. 2279-2282, 1999.
-
(1999)
Eurospeech
, pp. 2279-2282
-
-
Den Heuvel, H.V.1
Boudy, R.2
Euler, S.3
Moreno, A.4
Richard, G.5
-
13
-
-
85009152939
-
CU-move: Robust speech processing for in-vehicle speech systems
-
J. H. L. Hansen, J. Plucienkowski, S. Gallant, B. Pellom, and W. Ward, "CU-Move: Robust speech processing for in-vehicle speech systems," Proc. Int. Conf. Spoken Lang. Process., pp. 524-527, 2000.
-
(2000)
Proc. Int. Conf. Spoken Lang. Process
, pp. 524-527
-
-
Hansen, J.H.L.1
Plucienkowski, J.2
Gallant, S.3
Pellom, B.4
Ward, W.5
-
14
-
-
0013302639
-
CSDC - The MoTiV car speech data collection
-
D. Langmann, H. R. Pfitzinger, T. Schneider, R. Grudszus, A. Fischer, M. Westphal, T. Crull, and U. Jekosch, "CSDC - the MoTiV car speech data collection," Proc. Int. Conf. Lang. Resources and Eval., pp. 1107-1110, 1998.
-
(1998)
Proc. Int. Conf. Lang. Resources and Eval.
, pp. 1107-1110
-
-
Langmann, D.1
Pfitzinger, H.R.2
Schneider, T.3
Grudszus, R.4
Fischer, A.5
Westphal, M.6
Crull, T.7
Jekosch, U.8
-
15
-
-
85032752352
-
Audiovisual speech processing
-
T. Chen, "Audiovisual speech processing," IEEE Sig. Process. Magazine, vol. 18, No. 1, pp. 9-21, 2001.
-
(2001)
IEEE Sig. Process. Magazine
, vol.18
, Issue.1
, pp. 9-21
-
-
Chen, T.1
-
16
-
-
0036295989
-
Audio-visual speech modeling using coupled hidden Markov models
-
S. Chu and T. Huang, "Audio-visual speech modeling using coupled hidden Markov models," Proc. Int. Conf. Acoust., Speech, and Sig. Process., pp. 2009-2012, 2002.
-
(2002)
Proc. Int. Conf. Acoust., Speech, and Sig. Process.
, pp. 2009-2012
-
-
Chu, S.1
Huang, T.2
-
17
-
-
85009099416
-
-
http://amp.ece.cmu.edu/projects/AudioVisualSpeechProcessing/.
-
-
-
-
18
-
-
0036299249
-
CUAVE: A new audio-visual database for multimodal human-computer interface research
-
E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," Proc. Int. Conf. Acoust., Speech, and Sig. Process., pp. 2017-2020, 2002.
-
(2002)
Proc. Int. Conf. Acoust., Speech, and Sig. Process
, pp. 2017-2020
-
-
Patterson, E.K.1
Gurbuz, S.2
Tufekci, Z.3
Gowdy, J.N.4
-
19
-
-
84948594425
-
An algorithm for linearly constrained adaptive array processing
-
O. L. Frost, III, "An algorithm for linearly constrained adaptive array processing," Proc. of IEEE, vol. 60, No. 8, pp. 926-935, 1972.
-
(1972)
Proc. of IEEE
, vol.60
, Issue.8
, pp. 926-935
-
-
Frost, O.L.1
-
20
-
-
0019928857
-
An alternative approach to linearly constrained adaptive beamforming
-
L. J. Griffiths and C. W. Jim, "An alternative approach to linearly constrained adaptive beamforming," IEEE Trans. Antennas and Propag., vol. 30, No. 1, pp. 27-34, 1982.
-
(1982)
IEEE Trans. Antennas and Propag.
, vol.30
, Issue.1
, pp. 27-34
-
-
Griffiths, L.J.1
Jim, C.W.2
-
21
-
-
0034818519
-
Multi-microphone noise reduction techniques as front-end devices for speech recognition
-
J. Bitzer, K. U. Simmer, and K.-D. Kammeyer, "Multi-microphone noise reduction techniques as front-end devices for speech recognition," Speech Comm., vol. 34, pp. 3-12, 2001.
-
(2001)
Speech Comm.
, vol.34
, pp. 3-12
-
-
Bitzer, J.1
Simmer, K.U.2
Kammeyer, K.-D.3
-
22
-
-
0032677010
-
Performance of an hmm speech recognizer using a real-time tracking microphone array as input
-
T. B. Hughes, H.-S. Kim, J. H. DiBiase, and H. F. Silverman, "Performance of an hmm speech recognizer using a real-time tracking microphone array as input," IEEE Trans. Speech and Audio Process., vol. 7, No. 3, pp. 346-349, 1999.
-
(1999)
IEEE Trans. Speech and Audio Process
, vol.7
, Issue.3
, pp. 346-349
-
-
Hughes, T.B.1
Kim, H.-S.2
DiBiase, J.H.3
Silverman, H.F.4
-
23
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multim., vol. 2, No. 3, pp. 141-151, 2000.
-
(2000)
IEEE Trans. Multim.
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
24
-
-
0032309170
-
3D modeling and tracking of human lip motions
-
S. Basu, N. Oliver, and A. Pentland, "3D modeling and tracking of human lip motions," Proc. Sixth Int. Conf. Computer Vision, pp. 337-343, 1998.
-
(1998)
Proc. Sixth Int. Conf. Computer Vision
, pp. 337-343
-
-
Basu, S.1
Oliver, N.2
Pentland, A.3
-
25
-
-
0036844217
-
Modeling and animating realistic faces from images
-
F. Pighin, R. Szeliski, and D. H. Salesin, "Modeling and animating realistic faces from images," Int. J. of Computer Vision, vol. 50, No. 2, pp. 143-169, 2002.
-
(2002)
Int. J. of Computer Vision
, vol.50
, Issue.2
, pp. 143-169
-
-
Pighin, F.1
Szeliski, R.2
Salesin, D.H.3
-
26
-
-
0025477640
-
Speech database development at MIT: Timit and beyond
-
V. Zue, S. Seneff, and J. Glass, "Speech database development at MIT: TIMIT and beyond," Speech Comm., vol. 9, No. 4, pp. 351-356, 1990.
-
(1990)
Speech Comm.
, vol.9
, Issue.4
, pp. 351-356
-
-
Zue, V.1
Seneff, S.2
Glass, J.3
|