-
1
-
-
15044345504
-
Audio-visual automatic speech recognition: An overview
-
10MIT Press Cambridge, Mass, USA
-
G. Potamianos C. Neti J. Luettin I. Matthews G. Bailly E. Vatikiotis-Bateson P. Perrier Audio-visual automatic speech recognition: an overview. Issues in Visual and Audio-Visual Speech Processing 10 MIT Press Cambridge, Mass, USA 2004
-
(2004)
Issues in Visual and Audio-Visual Speech Processing
-
-
Potamianos, G.1
Neti, C.2
Luettin, J.3
Matthews, I.4
Bailly, G.5
Vatikiotis-Bateson, E.6
Perrier, P.7
-
2
-
-
85032752352
-
Audiovisual speech processing
-
T. Chen Audiovisual speech processing. IEEE Signal Processing Magazine 18 2001 1 9 21
-
(2001)
IEEE Signal Processing Magazine
, vol.18
, Issue.1
, pp. 9-21
-
-
Chen, T.1
-
5
-
-
0032178592
-
Quantitative association of vocal-tract and facial behavior
-
hani@cpdee.ufmg.br
-
H. Yehia hani@cpdee.ufmg.br P. Rubin E. Vatikiotis-Bateson Quantitative association of vocal-tract and facial behavior. Speech Communication 26 1-2 1998 23 43
-
(1998)
Speech Communication
, vol.26
, Issue.1-2
, pp. 23-43
-
-
Yehia, H.1
Rubin, P.2
Vatikiotis-Bateson, E.3
-
7
-
-
84899028297
-
Audio-vision: Using audio-visual synchrony to locate sounds
-
MIT Press Cambridge, Mass, USA
-
J. Hershey J. Movellan M. S. Kearns S. A. Solla D. A. Cohn Audio-vision: using audio-visual synchrony to locate sounds. Advances in Neural Information Processing Systems 11 MIT Press Cambridge, Mass, USA 1999 813 819
-
(1999)
Advances in Neural Information Processing Systems 11
, pp. 813-819
-
-
Hershey, J.1
Movellan, J.2
Kearns, M.S.3
Solla, S.A.4
Cohn, D.A.5
-
8
-
-
33947662555
-
Detecting replay attacks in audiovisual identity verification
-
Toulous, France
-
H. Bredin A. Miguel I. H. Witten G. Chollet Detecting replay attacks in audiovisual identity verification. Proceedings of the 31st IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '06) 1 Toulous, France 2006 621 624
-
(2006)
Proceedings of the 31st IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '06)
, vol.1
, pp. 621-624
-
-
Bredin, H.1
Miguel, A.2
Witten, I.H.3
Chollet, G.4
-
9
-
-
2642562769
-
Speaker association with signal-level audiovisual fusion
-
trevor@ai.mit.edu
-
J. W. Fisher III T. Darrell trevor@ai.mit.edu Speaker association with signal-level audiovisual fusion. IEEE Transactions on Multimedia 6 3 2004 406 413
-
(2004)
IEEE Transactions on Multimedia
, vol.6
, Issue.3
, pp. 406-413
-
-
Fisher Iii, J.W.1
Darrell, T.2
-
14
-
-
2642557514
-
FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
-
MIT Press Cambridge, Mass, USA
-
M. Slaney M. Covell FaceSync: a linear operator for measuring synchronization of video facial images and audio tracks. Advances in Neural Information Processing Systems 13 MIT Press Cambridge, Mass, USA 2000 814 820
-
(2000)
Advances in Neural Information Processing Systems 13
, pp. 814-820
-
-
Slaney, M.1
Covell, M.2
-
16
-
-
0001390793
-
Speech analysis and synthesis methods developed at ECL in NTT-from LPC to LSP
-
N. Sugamura F. Itakura Speech analysis and synthesis methods developed at ECL in NTT-from LPC to LSP. Speech Communications 5 1986 2 199 215
-
(1986)
Speech Communications
, vol.5
, Issue.2
, pp. 199-215
-
-
Sugamura, N.1
Itakura, F.2
-
17
-
-
85013597845
-
"eigenlips" for robust speech recognition
-
Adelaide, Australia
-
C. Bregler Y. Konig "Eigenlips" for robust speech recognition. Proceedings of the 19th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94) 2 Adelaide, Australia 1994 669 672
-
(1994)
Proceedings of the 19th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94)
, vol.2
, pp. 669-672
-
-
Bregler, C.1
Konig, Y.2
-
24
-
-
0000466122
-
Survey on independent component analysis
-
A. Hyvärinen Survey on independent component analysis. Neural Computing Surveys 2 1999 94 128
-
(1999)
Neural Computing Surveys
, vol.2
, pp. 94-128
-
-
Hyvärinen, A.1
-
27
-
-
34347328717
-
-
ICA http://www.cis.hut.fi/projects/ica/fastica/
-
-
-
Ica1
-
28
-
-
34347350736
-
-
Canonical Correlation Analysis.
-
Canonical Correlation Analysis. http://people.imt.liu.se/∼magnus/cca/
-
-
-
-
29
-
-
84977421344
-
Co-inertia analysis: An alternative method for studying species-environment relationships
-
S. Dolédec D. Chessel Co-inertia analysis: an alternative method for studying species-environment relationships. Freshwater Biology 31 1994 277 294
-
(1994)
Freshwater Biology
, vol.31
, pp. 277-294
-
-
Dolédec, S.1
Chessel, D.2
-
30
-
-
84898954418
-
Learning joint statistical models for audio-visual fusion and segregation
-
MIT Press Cambridge, Mass, USA
-
J. W. Fisher T. Darrell W. T. Freeman P. Viola T. K. Leen T. G. Dietterich V. Tresp Learning joint statistical models for audio-visual fusion and segregation. Advances in Neural Information Processing Systems 13 MIT Press Cambridge, Mass, USA 2001 772 778
-
(2001)
Advances in Neural Information Processing Systems 13
, pp. 772-778
-
-
Fisher, J.W.1
Darrell, T.2
Freeman, W.T.3
Viola, P.4
Leen, T.K.5
Dietterich, T.G.6
Tresp, V.7
-
31
-
-
0036874541
-
Separation of audio-visual speech sources: A new approach exploiting the audio-visual coherence of speech stimuli
-
jacob.klinkisch@gmx.de girin@icp.inpg.fr schwartz@icp.inpg.fr sodoyer@icp.inpg.fr christian.jutten@inpg.fr
-
D. Sodoyer sodoyer@icp.inpg.fr J.-L. Schwartz schwartz@icp.inpg.fr L. Girin girin@icp.inpg.fr J. Klinkisch jacob.klinkisch@gmx.de C. Jutten christian.jutten@inpg.fr Separation of audio-visual speech sources: a new approach exploiting the audio-visual coherence of speech stimuli. EURASIP Journal on Applied Signal Processing 2002 11 2002 1165 1173
-
(2002)
EURASIP Journal on Applied Signal Processing
, vol.2002
, Issue.11
, pp. 1165-1173
-
-
Sodoyer, D.1
Schwartz, J.-L.2
Girin, L.3
Klinkisch, J.4
Jutten, C.5
-
32
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
L. R. Rabiner A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77 2 1989 257 286
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
35
-
-
0001935972
-
XM2VTSDB: The extended M2VTS database
-
Washington, DC, USA
-
K. Messer J. Matas J. Kittler J. Luettin G. Maitre XM2VTSDB: the extended M2VTS database. Proceedings of International Conference on Audio- and Video-Based Biometric Person Authentication (AVBPA '99) Washington, DC, USA 1999 72 77
-
(1999)
Proceedings of International Conference on Audio- And Video-Based Biometric Person Authentication (AVBPA '99)
, pp. 72-77
-
-
Messer, K.1
Matas, J.2
Kittler, J.3
Luettin, J.4
Maitre, G.5
-
36
-
-
34347329766
-
-
BT-DAVID http://eegalilee.swan.ac.uk/
-
-
-
Bt-David1
-
37
-
-
33749528634
-
BIOMET: A multimodal person authentication database including face, voice, fingerprint, hand and signature modalities
-
Guildford, UK
-
S. Garcia-Salicetti C. Beumier G. Chollet BIOMET: a multimodal person authentication database including face, voice, fingerprint, hand and signature modalities. Proceedings of the 4th International Conference on Audio-and Video-Based Biometric Person Authentication (AVBPA '03) Guildford, UK 2003 845 853
-
(2003)
Proceedings of the 4th International Conference on Audio-and Video-Based Biometric Person Authentication (AVBPA '03)
, pp. 845-853
-
-
Garcia-Salicetti, S.1
Beumier, C.2
Chollet, G.3
-
39
-
-
33947376189
-
Multimodal speaker identification using canonical correlation analysis
-
Toulouse, France
-
M. E. Sargin E. Erzin Y. Yemez A. M. Tekalp Multimodal speaker identification using canonical correlation analysis. Proceedings of the 31st IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '06) 1 Toulouse, France 2006 613 616
-
(2006)
Proceedings of the 31st IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '06)
, vol.1
, pp. 613-616
-
-
Sargin, M.E.1
Erzin, E.2
Yemez, Y.3
Tekalp, A.M.4
-
40
-
-
34347345601
-
-
Text Retrieval Conference Video Track.
-
Text Retrieval Conference Video Track. http://trec.nist.gov/
-
-
-
-
41
-
-
34547523367
-
Audio-visual speech synchrony measure for talking-face identity verification
-
Honolulu, Hawaii, USA
-
H. Bredin G. Chollet Audio-visual speech synchrony measure for talking-face identity verification. Proceedings of the 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '07) Honolulu, Hawaii, USA 2007
-
(2007)
Proceedings of the 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '07)
-
-
Bredin, H.1
Chollet, G.2
|