메뉴 건너뛰기




Volumn 17, Issue 3, 2009, Pages 436-445

Unsupervised stream-weights computation in classification and recognition tasks

Author keywords

Decision fusion; Multistream weights estimation; Robust speech recognition

Indexed keywords

ARTIFICIAL DATA; AUDIO-VISUAL SPEECH; BACKGROUND MODEL; CLASS-DISTANCE; CLASSIFICATION AND RECOGNITION; CLASSIFICATION ERRORS; CLASSIFICATION TASKS; DECISION FUSION; INFORMATIVENESS; MODELING ERRORS; MULTISTREAM WEIGHTS ESTIMATION; NONLINEAR FUNCTIONS; ROBUST SPEECH RECOGNITION; STREAM RELIABILITY; TESTING CONDITIONS; THEORETICAL RESULT; TWO-STREAM; USE-MODEL; WEIGHT ESTIMATION;

EID: 70350439278     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2011513     Document Type: Article
Times cited : (6)

References (16)
  • 1
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and recombination of partial frequency bands
    • Philadelphia, PA, Oct
    • H. Bourlard and S. Dupont, "A new ASR approach based on independent processing and recombination of partial frequency bands, " in Proc. ICSLP, Philadelphia, PA, Oct. 1996.
    • (1996) Proc. ICSLP
    • Bourlard, H.1    Dupont, S.2
  • 2
    • 85009097228 scopus 로고    scopus 로고
    • Modeling auxiliary information in bayesian network based ASR
    • Aalborg, Denmark, Sep
    • T. Stephenson, M. Mathew, and H. Bourlard, "Modeling auxiliary information in bayesian network based ASR, " in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001.
    • (2001) Proc. Eurospeech
    • Stephenson, T.1    Mathew, M.2    Bourlard, H.3
  • 3
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audiovisual speech
    • Sep
    • G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior, "Recent advances in the automatic recognition of audiovisual speech, " Proc. IEEE, vol. 91, no. 9, pp. 1306-1326, Sep. 2003.
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.5
  • 4
    • 85135374344 scopus 로고
    • Integration of acoustic and visual speech for speaker recognition
    • Berlin, Germany, Sep
    • C. Chibelushi, J. Mason, and F. Deravi, "Integration of acoustic and visual speech for speaker recognition, " in Proc. EUROSPEECH, Berlin, Germany, Sep. 1993.
    • (1993) Proc. EUROSPEECH
    • Chibelushi, C.1    Mason, J.2    Deravi, F.3
  • 5
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Sep
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition, " IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, Sep. 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 6
    • 0036874527 scopus 로고    scopus 로고
    • Noise adaptive stream weighting in audio-visual speech recognition
    • Nov
    • M. Heckmann, F. Berthommier, and K. Kroschel, "Noise adaptive stream weighting in audio-visual speech recognition, " EURASIP J. Appl. Signal Process., vol. 1, no. 11, pp. 1260-1273, Nov. 2002.
    • (2002) EURASIP J. Appl. Signal Process , vol.1 , Issue.11 , pp. 1260-1273
    • Heckmann, M.1    Berthommier, F.2    Kroschel, K.3
  • 7
    • 70350473971 scopus 로고    scopus 로고
    • On the integration of auditory and visual parameters in an HMM-based ASR
    • A. Adjoudani and C. Benoit, "On the integration of auditory and visual parameters in an HMM-based ASR, " Springer Verlag, Series F: Comput. Syst. Sci., vol. 150, pp. 465-472, 1996.
    • (1996) Springer Verlag, Series F: Comput. Syst. Sci. , vol.150 , pp. 465-472
    • Adjoudani, A.1    Benoit, C.2
  • 8
    • 0034842342 scopus 로고    scopus 로고
    • Asynchronous stream modeling for large vocabulary audio-visual speech recognition
    • Salt Lake City, UT, May
    • J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition, " in Proc. ICASSP, Salt Lake City, UT, May 2001, pp. 169-172.
    • (2001) Proc. ICASSP , pp. 169-172
    • Luettin, J.1    Potamianos, G.2    Neti, C.3
  • 9
    • 0030676381 scopus 로고    scopus 로고
    • Maximum likelihood weighting of dynamic speech features for CDHMM speech recognition
    • J. Hernando, "Maximum likelihood weighting of dynamic speech features forCDHMMspeech recognition, " in Proc. ICASSP, Munich, Germany, Apr. 1997, pp. 1267-1270.
    • (1997) Proc. ICASSP , pp. 1267-1270
    • Hernando, J.1
  • 10
    • 0031624666 scopus 로고    scopus 로고
    • Discrimative training of HMM stream exponents for audio-visual speech recognition
    • Seattle, WA, May
    • G. Potamianos and H. P. Graf, "Discrimative training of HMM stream exponents for audio-visual speech recognition, " in Proc. ICASSP, Seattle, WA, May 1998, pp. 3733-3736.
    • (1998) Proc. ICASSP , pp. 3733-3736
    • Potamianos, G.1    Graf, H.P.2
  • 11
    • 85009091822 scopus 로고    scopus 로고
    • Audio visual speech recognition using MCE-basedhmmsand model dependent stream weights
    • Beijing, China, Oct
    • C. Miyajima, K. Tokuda, and T. Kitamura, "Audio visual speech recognition using MCE-basedHMMsand model dependent stream weights, " in Proc. ICSLP, Beijing, China, Oct. 2000.
    • (2000) Proc. ICSLP
    • Miyajima, C.1    Tokuda, K.2    Kitamura, T.3
  • 12
    • 0034853041 scopus 로고    scopus 로고
    • Hierarchical discriminant features for audio-visual lvcsr
    • Salt Lake City, UT, May
    • G. Potamianos, J. Luettin, and C. Neti, "Hierarchical discriminant features for audio-visual LVCSR, " in Proc. ICASSP, Salt Lake City, UT, May 2001, pp. 165-168.
    • (2001) Proc. ICASSP , pp. 165-168
    • Potamianos, G.1    Luettin, J.2    Neti, C.3
  • 13
    • 0036295828 scopus 로고    scopus 로고
    • Robust bi-modal speech recognition based on state synchronous modeling stream weight optimization
    • Orlando, FL, May
    • S. Nakamura, K. Kumatani, and S. Tamura, "Robust bi-modal speech recognition based on state synchronous modeling stream weight optimization, " in Proc. ICASSP, Orlando, FL, May 2002, pp. 309-312.
    • (2002) Proc. ICASSP , pp. 309-312
    • Nakamura, S.1    Kumatani, K.2    Tamura, S.3
  • 14
    • 0002100804 scopus 로고    scopus 로고
    • Adaptive determination of audio and visual weights for automatic speech recognition
    • Rhodes, Greece, Sep
    • A. Rogozan, P. Deléglise, and M. Alissali, "Adaptive determination of audio and visual weights for automatic speech recognition, " in Proc. Workshop Audio-Visual Speech Process., Rhodes, Greece, Sep. 1997.
    • (1997) Proc. Workshop Audio-Visual Speech Process
    • Rogozan, A.1    Deléglise, P.2    Alissali, M.3
  • 15
    • 0034842451 scopus 로고    scopus 로고
    • Weighting schemes for audio-visual fusion in speech recognition
    • Salt Lake City, UT, May
    • H. Glotin, D. Vergyri, C. Neti, G. Potamianos, and J. Luettin, "Weighting schemes for audio-visual fusion in speech recognition, " in Proc. ICASSP, Salt Lake City, UT, May 2001.
    • (2001) Proc. ICASSP
    • Glotin, H.1    Vergyri, D.2    Neti, C.3    Potamianos, G.4    Luettin, J.5
  • 16
    • 85009154155 scopus 로고    scopus 로고
    • Stream weight optimization of speech and lip image sequence for audio-visul speech recognition
    • Beijing, China, Oct.G. Potamianos and C. Neti, "Stream confidence estimation for audio-visual speech recognition, " in Proc. ICSLP, Beijing, China, Oct. 2000
    • S. Nakamura, H. Ito, and K. Shikano, "Stream weight optimization of speech and lip image sequence for audio-visul speech recognition, " in Proc. ICSLP, Beijing, China, Oct. 2000G. Potamianos and C. Neti, "Stream confidence estimation for audio-visual speech recognition, " in Proc. ICSLP, Beijing, China, Oct. 2000.
    • (2000) Proc. ICSLP
    • Nakamura, S.1    Ito, H.2    Shikano, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.