메뉴 건너뛰기




Volumn , Issue , 2004, Pages 235-242

A segment-based Audio-Visual speech recognizer: Data collection, development, and initial experiments

Author keywords

Audio visual corpora; Audio visual speech recognition

Indexed keywords

APPROXIMATION THEORY; COMPUTER SIMULATION; DATA ACQUISITION; INFORMATION ANALYSIS; MARKOV PROCESSES; MATHEMATICAL MODELS;

EID: 14944353581     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (108)

References (22)
  • 1
    • 0039228740 scopus 로고
    • The intrinsic bimodality of speech communication and the synthesis of talking faces
    • Hungary, September
    • C. Benoit. The intrinsic bimodality of speech communication and the synthesis of talking faces. In Journal on Communications of the Scientific Society for Telecommunications, Hungary, number 43, pages 32-40, September 1992.
    • (1992) Journal on Communications of the Scientific Society for Telecommunications , Issue.43 , pp. 32-40
    • Benoit, C.1
  • 4
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • September
    • S. Dupont and J. Luettin. Audio-visual speech modeling for continuous speech recognition. In IEEE Transactions on Multimedia, number 2, pages 141-151, September 2000.
    • (2000) IEEE Transactions on Multimedia , Issue.2 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 5
    • 0038359548 scopus 로고    scopus 로고
    • A probabilistic framework for segment-based speech recognition
    • To appear in
    • J. Glass. A probabilistic framework for segment-based speech recognition. To appear in Computer Speech and Language, 2003.
    • (2003) Computer Speech and Language
    • Glass, J.1
  • 6
    • 85128407852 scopus 로고    scopus 로고
    • Heterogeneous measurements and multiple classifiers for speech recognition
    • Sydney, Australia, November
    • A. Halberstadt and J. Glass. Heterogeneous measurements and multiple classifiers for speech recognition. In Proceedings of ICSLP 98, Sydney, Australia, November 1998.
    • (1998) Proceedings of ICSLP 98
    • Halberstadt, A.1    Glass, J.2
  • 9
    • 14944355052 scopus 로고    scopus 로고
    • Intel's AVCSR Toolkit source code can be downloaded from http://sourceforge.net/projects/opencvlibrary/.
  • 10
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden markov models
    • November
    • K. F. Lee and H. W. Hon. Speaker-independent phone recognition using hidden Markov models. In IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 37, no. 11, pp. 1641-1648, November 1989.
    • (1989) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.F.1    Hon, H.W.2
  • 15
  • 17
    • 85009230873 scopus 로고    scopus 로고
    • Audio-visual speech recognition in challenging environments
    • Geneva, Switzerland, September
    • G. Potamianos and C. Neti. Audio-visual speech recognition in challenging environments. In Proc. Of EUROSPEECH, pp. 1293-1296, Geneva, Switzerland, September 2003.
    • (2003) Proc. of EUROSPEECH , pp. 1293-1296
    • Potamianos, G.1    Neti, C.2
  • 18
    • 14944351246 scopus 로고    scopus 로고
    • Articulatory features for robust visual speech recognition
    • In these proceedings, State College, Pennsylvania
    • K. Saenko, T. Darrel, and J. Glass. Articulatory features for robust visual speech recognition In these proceedings, ICMI'04, State College, Pennsylvania, 2004.
    • (2004) ICMI'04
    • Saenko, K.1    Darrel, T.2    Glass, J.3
  • 19
    • 0041355006 scopus 로고    scopus 로고
    • The VidTIMIT database
    • Martigny, Switzerland
    • C. Sanderson. The VidTIMIT Database. IDIAP Communication 02-06, Martigny, Switzerland, 2002.
    • (2002) IDIAP Communication , vol.2 , Issue.6
    • Sanderson, C.1
  • 22
    • 0025477640 scopus 로고
    • Speech database development: TIMIT and beyond
    • V. Zue, S. Seneff, and J. Glass. Speech database development: TIMIT and beyond. Speech Communication, vol. 9, no. 4, pp. 351-356, 1990.
    • (1990) Speech Communication , vol.9 , Issue.4 , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.