SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 17, Issue 3, 2009, Pages 436-445

Unsupervised stream-weights computation in classification and recognition tasks

(3) Sánchez Soto, Eduardo b,c Potamianos, Alexandros a Daoudi, Khalid b

a TECHNICAL UNIVERSITY OF CRETE (Greece)

b CNRS (France)

c ORANGE LABS (France)

Author keywords

Decision fusion; Multistream weights estimation; Robust speech recognition

Indexed keywords

ARTIFICIAL DATA; AUDIO-VISUAL SPEECH; BACKGROUND MODEL; CLASS-DISTANCE; CLASSIFICATION AND RECOGNITION; CLASSIFICATION ERRORS; CLASSIFICATION TASKS; DECISION FUSION; INFORMATIVENESS; MODELING ERRORS; MULTISTREAM WEIGHTS ESTIMATION; NONLINEAR FUNCTIONS; ROBUST SPEECH RECOGNITION; STREAM RELIABILITY; TESTING CONDITIONS; THEORETICAL RESULT; TWO-STREAM; USE-MODEL; WEIGHT ESTIMATION;

ALGORITHMS; ESTIMATION;

SPEECH RECOGNITION;

EID: 70350439278 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2008.2011513 Document Type: Article

Times cited : (6)

References (16)

1
- 0030355935
- A new ASR approach based on independent processing and recombination of partial frequency bands
- Philadelphia, PA, Oct
- H. Bourlard and S. Dupont, "A new ASR approach based on independent processing and recombination of partial frequency bands, " in Proc. ICSLP, Philadelphia, PA, Oct. 1996.
- (1996) Proc. ICSLP
- Bourlard, H.¹ Dupont, S.²

2
- 85009097228
- Modeling auxiliary information in bayesian network based ASR
- Aalborg, Denmark, Sep
- T. Stephenson, M. Mathew, and H. Bourlard, "Modeling auxiliary information in bayesian network based ASR, " in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001.
- (2001) Proc. Eurospeech
- Stephenson, T.¹ Mathew, M.² Bourlard, H.³

3
- 4544290191
- Recent advances in the automatic recognition of audiovisual speech
- Sep
- G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior, "Recent advances in the automatic recognition of audiovisual speech, " Proc. IEEE, vol. 91, no. 9, pp. 1306-1326, Sep. 2003.
- (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.⁵

4
- 85135374344
- Integration of acoustic and visual speech for speaker recognition
- Berlin, Germany, Sep
- C. Chibelushi, J. Mason, and F. Deravi, "Integration of acoustic and visual speech for speaker recognition, " in Proc. EUROSPEECH, Berlin, Germany, Sep. 1993.
- (1993) Proc. EUROSPEECH
- Chibelushi, C.¹ Mason, J.² Deravi, F.³

5
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- Sep
- S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition, " IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, Sep. 2000.
- (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

6
- 0036874527
- Noise adaptive stream weighting in audio-visual speech recognition
- Nov
- M. Heckmann, F. Berthommier, and K. Kroschel, "Noise adaptive stream weighting in audio-visual speech recognition, " EURASIP J. Appl. Signal Process., vol. 1, no. 11, pp. 1260-1273, Nov. 2002.
- (2002) EURASIP J. Appl. Signal Process , vol.1 , Issue.11 , pp. 1260-1273
- Heckmann, M.¹ Berthommier, F.² Kroschel, K.³

7
- 70350473971
- On the integration of auditory and visual parameters in an HMM-based ASR
- A. Adjoudani and C. Benoit, "On the integration of auditory and visual parameters in an HMM-based ASR, " Springer Verlag, Series F: Comput. Syst. Sci., vol. 150, pp. 465-472, 1996.
- (1996) Springer Verlag, Series F: Comput. Syst. Sci. , vol.150 , pp. 465-472
- Adjoudani, A.¹ Benoit, C.²

8
- 0034842342
- Asynchronous stream modeling for large vocabulary audio-visual speech recognition
- Salt Lake City, UT, May
- J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition, " in Proc. ICASSP, Salt Lake City, UT, May 2001, pp. 169-172.
- (2001) Proc. ICASSP , pp. 169-172
- Luettin, J.¹ Potamianos, G.² Neti, C.³

9
- 0030676381
- Maximum likelihood weighting of dynamic speech features for CDHMM speech recognition
- J. Hernando, "Maximum likelihood weighting of dynamic speech features forCDHMMspeech recognition, " in Proc. ICASSP, Munich, Germany, Apr. 1997, pp. 1267-1270.
- (1997) Proc. ICASSP , pp. 1267-1270
- Hernando, J.¹

10
- 0031624666
- Discrimative training of HMM stream exponents for audio-visual speech recognition
- Seattle, WA, May
- G. Potamianos and H. P. Graf, "Discrimative training of HMM stream exponents for audio-visual speech recognition, " in Proc. ICASSP, Seattle, WA, May 1998, pp. 3733-3736.
- (1998) Proc. ICASSP , pp. 3733-3736
- Potamianos, G.¹ Graf, H.P.²

11
- 85009091822
- Audio visual speech recognition using MCE-basedhmmsand model dependent stream weights
- Beijing, China, Oct
- C. Miyajima, K. Tokuda, and T. Kitamura, "Audio visual speech recognition using MCE-basedHMMsand model dependent stream weights, " in Proc. ICSLP, Beijing, China, Oct. 2000.
- (2000) Proc. ICSLP
- Miyajima, C.¹ Tokuda, K.² Kitamura, T.³

12
- 0034853041
- Hierarchical discriminant features for audio-visual lvcsr
- Salt Lake City, UT, May
- G. Potamianos, J. Luettin, and C. Neti, "Hierarchical discriminant features for audio-visual LVCSR, " in Proc. ICASSP, Salt Lake City, UT, May 2001, pp. 165-168.
- (2001) Proc. ICASSP , pp. 165-168
- Potamianos, G.¹ Luettin, J.² Neti, C.³

13
- 0036295828
- Robust bi-modal speech recognition based on state synchronous modeling stream weight optimization
- Orlando, FL, May
- S. Nakamura, K. Kumatani, and S. Tamura, "Robust bi-modal speech recognition based on state synchronous modeling stream weight optimization, " in Proc. ICASSP, Orlando, FL, May 2002, pp. 309-312.
- (2002) Proc. ICASSP , pp. 309-312
- Nakamura, S.¹ Kumatani, K.² Tamura, S.³

14
- 0002100804
- Adaptive determination of audio and visual weights for automatic speech recognition
- Rhodes, Greece, Sep
- A. Rogozan, P. Deléglise, and M. Alissali, "Adaptive determination of audio and visual weights for automatic speech recognition, " in Proc. Workshop Audio-Visual Speech Process., Rhodes, Greece, Sep. 1997.
- (1997) Proc. Workshop Audio-Visual Speech Process
- Rogozan, A.¹ Deléglise, P.² Alissali, M.³

15
- 0034842451
- Weighting schemes for audio-visual fusion in speech recognition
- Salt Lake City, UT, May
- H. Glotin, D. Vergyri, C. Neti, G. Potamianos, and J. Luettin, "Weighting schemes for audio-visual fusion in speech recognition, " in Proc. ICASSP, Salt Lake City, UT, May 2001.
- (2001) Proc. ICASSP
- Glotin, H.¹ Vergyri, D.² Neti, C.³ Potamianos, G.⁴ Luettin, J.⁵

16
- 85009154155
- Stream weight optimization of speech and lip image sequence for audio-visul speech recognition
- Beijing, China, Oct.G. Potamianos and C. Neti, "Stream confidence estimation for audio-visual speech recognition, " in Proc. ICSLP, Beijing, China, Oct. 2000
- S. Nakamura, H. Ito, and K. Shikano, "Stream weight optimization of speech and lip image sequence for audio-visul speech recognition, " in Proc. ICSLP, Beijing, China, Oct. 2000G. Potamianos and C. Neti, "Stream confidence estimation for audio-visual speech recognition, " in Proc. ICSLP, Beijing, China, Oct. 2000.
- (2000) Proc. ICSLP
- Nakamura, S.¹ Ito, H.² Shikano, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.