메뉴 건너뛰기




Volumn , Issue , 2004, Pages 2489-2492

AVICAR: Audio-Visual Speech Corpus in a Car Environment

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO EQUIPMENT; SIGNAL TO NOISE RATIO; SPEECH ANALYSIS; VIDEO CAMERAS;

EID: 85009135251     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (127)

References (26)
  • 1
    • 85009106482 scopus 로고    scopus 로고
    • Audiovisual representation of prosody in expressive speech communication
    • B. Granström and D. House, "Audiovisual representation of prosody in expressive speech communication," ISCA Int. Conf. Speech Prosody, pp. 393-400, 2004.
    • (2004) ISCA Int. Conf. Speech Prosody , pp. 393-400
    • Granström, B.1    House, D.2
  • 2
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 3
    • 0001048664 scopus 로고
    • Visual contributions to speech intelligibility in noise
    • W. H. Sumby and I. Pollak, "Visual contributions to speech intelligibility in noise," J. Acoust. Soc. Am., vol. 26, No. 2, pp. 212-215, 1954.
    • (1954) J. Acoust. Soc. Am. , vol.26 , Issue.2 , pp. 212-215
    • Sumby, W.H.1    Pollak, I.2
  • 4
    • 0025767028 scopus 로고
    • Evaluating the articulation index for auditory-visual input
    • K. W. Grant and L. D. Braida, "Evaluating the articulation index for auditory-visual input," J. Acoust. Soc. Am., vol. 89, No. 6, pp. 2952-2960, 1991.
    • (1991) J. Acoust. Soc. Am. , vol.89 , Issue.6 , pp. 2952-2960
    • Grant, K.W.1    Braida, L.D.2
  • 5
    • 85027136924 scopus 로고
    • Minimum error rate training of inter-word context dependent acoustic model units in speech recognition
    • W. Chou, C.-H. Lee, and B. H. Juang, "Minimum error rate training of inter-word context dependent acoustic model units in speech recognition," Proc. Int. Conf. Spoken Lang. Process., pp. 439-442, 1994.
    • (1994) Proc. Int. Conf. Spoken Lang. Process. , pp. 439-442
    • Chou, W.1    Lee, C.-H.2    Juang, B.H.3
  • 6
    • 0032140546 scopus 로고    scopus 로고
    • On stochastic feature and model compensation approaches to robust speech recognition
    • C.-H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Comm., vol. 25, No. 1, pp. 29-47, 1998.
    • (1998) Speech Comm. , vol.25 , Issue.1 , pp. 29-47
    • Lee, C.-H.1
  • 7
    • 84946801025 scopus 로고    scopus 로고
    • Use of real and contaminated speech for training of a hands-free in-car speech recognizer
    • M. Matassoni, M. Omologo, and P. Svaizer, "Use of real and contaminated speech for training of a hands-free in-car speech recognizer," Eurospeech, 2001.
    • (2001) Eurospeech
    • Matassoni, M.1    Omologo, M.2    Svaizer, P.3
  • 10
    • 0034817675 scopus 로고    scopus 로고
    • Optimized second-order gradient microphone for hands-free speech recordings in cars
    • R. Aubauer and D. Leckschat, "Optimized second-order gradient microphone for hands-free speech recordings in cars," Speech Comm., vol. 34, No. 1-2, pp. 13-23, 2001.
    • (2001) Speech Comm. , vol.34 , Issue.1-2 , pp. 13-23
    • Aubauer, R.1    Leckschat, D.2
  • 12
    • 85135275880 scopus 로고    scopus 로고
    • The SpeechDat-car multilingual speech databases for in-car applications: Some first validation results
    • H. V. den Heuvel, R. Boudy, S. Euler, A. Moreno, and G. Richard, "The SpeechDat-Car multilingual speech databases for in-car applications: Some first validation results," Eurospeech, pp. 2279-2282, 1999.
    • (1999) Eurospeech , pp. 2279-2282
    • Den Heuvel, H.V.1    Boudy, R.2    Euler, S.3    Moreno, A.4    Richard, G.5
  • 15
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual speech processing
    • T. Chen, "Audiovisual speech processing," IEEE Sig. Process. Magazine, vol. 18, No. 1, pp. 9-21, 2001.
    • (2001) IEEE Sig. Process. Magazine , vol.18 , Issue.1 , pp. 9-21
    • Chen, T.1
  • 17
    • 85009099416 scopus 로고    scopus 로고
    • http://amp.ece.cmu.edu/projects/AudioVisualSpeechProcessing/.
  • 19
    • 84948594425 scopus 로고
    • An algorithm for linearly constrained adaptive array processing
    • O. L. Frost, III, "An algorithm for linearly constrained adaptive array processing," Proc. of IEEE, vol. 60, No. 8, pp. 926-935, 1972.
    • (1972) Proc. of IEEE , vol.60 , Issue.8 , pp. 926-935
    • Frost, O.L.1
  • 20
    • 0019928857 scopus 로고
    • An alternative approach to linearly constrained adaptive beamforming
    • L. J. Griffiths and C. W. Jim, "An alternative approach to linearly constrained adaptive beamforming," IEEE Trans. Antennas and Propag., vol. 30, No. 1, pp. 27-34, 1982.
    • (1982) IEEE Trans. Antennas and Propag. , vol.30 , Issue.1 , pp. 27-34
    • Griffiths, L.J.1    Jim, C.W.2
  • 21
    • 0034818519 scopus 로고    scopus 로고
    • Multi-microphone noise reduction techniques as front-end devices for speech recognition
    • J. Bitzer, K. U. Simmer, and K.-D. Kammeyer, "Multi-microphone noise reduction techniques as front-end devices for speech recognition," Speech Comm., vol. 34, pp. 3-12, 2001.
    • (2001) Speech Comm. , vol.34 , pp. 3-12
    • Bitzer, J.1    Simmer, K.U.2    Kammeyer, K.-D.3
  • 22
    • 0032677010 scopus 로고    scopus 로고
    • Performance of an hmm speech recognizer using a real-time tracking microphone array as input
    • T. B. Hughes, H.-S. Kim, J. H. DiBiase, and H. F. Silverman, "Performance of an hmm speech recognizer using a real-time tracking microphone array as input," IEEE Trans. Speech and Audio Process., vol. 7, No. 3, pp. 346-349, 1999.
    • (1999) IEEE Trans. Speech and Audio Process , vol.7 , Issue.3 , pp. 346-349
    • Hughes, T.B.1    Kim, H.-S.2    DiBiase, J.H.3    Silverman, H.F.4
  • 23
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multim., vol. 2, No. 3, pp. 141-151, 2000.
    • (2000) IEEE Trans. Multim. , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 25
    • 0036844217 scopus 로고    scopus 로고
    • Modeling and animating realistic faces from images
    • F. Pighin, R. Szeliski, and D. H. Salesin, "Modeling and animating realistic faces from images," Int. J. of Computer Vision, vol. 50, No. 2, pp. 143-169, 2002.
    • (2002) Int. J. of Computer Vision , vol.50 , Issue.2 , pp. 143-169
    • Pighin, F.1    Szeliski, R.2    Salesin, D.H.3
  • 26
    • 0025477640 scopus 로고
    • Speech database development at MIT: Timit and beyond
    • V. Zue, S. Seneff, and J. Glass, "Speech database development at MIT: TIMIT and beyond," Speech Comm., vol. 9, No. 4, pp. 351-356, 1990.
    • (1990) Speech Comm. , vol.9 , Issue.4 , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.