SCOPUS 정보 검색 플랫폼

8th International Conference on Spoken Language Processing, ICSLP 2004

Volumn , Issue , 2004, Pages 2489-2492

AVICAR: Audio-Visual Speech Corpus in a Car Environment

(7) Lee, Bowon a Hasegawa Johnson, Mark a Goudeseune, Camille a Kamdar, Suketu a Borys, Sarah a Liu, Ming a Huang, Thomas a

a University of Illinois at Urbana Champaign (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO EQUIPMENT; SIGNAL TO NOISE RATIO; SPEECH ANALYSIS; VIDEO CAMERAS;

AUDIO-VISUAL SPEECH; MULTI-SENSORY; NOISE CONDITIONS; PHONE NUMBER;

ACOUSTIC NOISE;

EID: 85009135251 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (127)

References (26)

1
- 85009106482
- Audiovisual representation of prosody in expressive speech communication
- B. Granström and D. House, "Audiovisual representation of prosody in expressive speech communication," ISCA Int. Conf. Speech Prosody, pp. 393-400, 2004.
- (2004) ISCA Int. Conf. Speech Prosody , pp. 393-400
- Granström, B.¹ House, D.²

2
- 0017199877
- Hearing lips and seeing voices
- H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ MacDonald, J.²

3
- 0001048664
- Visual contributions to speech intelligibility in noise
- W. H. Sumby and I. Pollak, "Visual contributions to speech intelligibility in noise," J. Acoust. Soc. Am., vol. 26, No. 2, pp. 212-215, 1954.
- (1954) J. Acoust. Soc. Am. , vol.26 , Issue.2 , pp. 212-215
- Sumby, W.H.¹ Pollak, I.²

4
- 0025767028
- Evaluating the articulation index for auditory-visual input
- K. W. Grant and L. D. Braida, "Evaluating the articulation index for auditory-visual input," J. Acoust. Soc. Am., vol. 89, No. 6, pp. 2952-2960, 1991.
- (1991) J. Acoust. Soc. Am. , vol.89 , Issue.6 , pp. 2952-2960
- Grant, K.W.¹ Braida, L.D.²

5
- 85027136924
- Minimum error rate training of inter-word context dependent acoustic model units in speech recognition
- W. Chou, C.-H. Lee, and B. H. Juang, "Minimum error rate training of inter-word context dependent acoustic model units in speech recognition," Proc. Int. Conf. Spoken Lang. Process., pp. 439-442, 1994.
- (1994) Proc. Int. Conf. Spoken Lang. Process. , pp. 439-442
- Chou, W.¹ Lee, C.-H.² Juang, B.H.³

6
- 0032140546
- On stochastic feature and model compensation approaches to robust speech recognition
- C.-H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Comm., vol. 25, No. 1, pp. 29-47, 1998.
- (1998) Speech Comm. , vol.25 , Issue.1 , pp. 29-47
- Lee, C.-H.¹

7
- 84946801025
- Use of real and contaminated speech for training of a hands-free in-car speech recognizer
- M. Matassoni, M. Omologo, and P. Svaizer, "Use of real and contaminated speech for training of a hands-free in-car speech recognizer," Eurospeech, 2001.
- (2001) Eurospeech
- Matassoni, M.¹ Omologo, M.² Svaizer, P.³

8
- 0000874053
- Le signe de l'elevation de la voix
- E. Lombard, "Le signe de l'elevation de la voix," Ann. Maladies Oreille, Larynx, Nez, Pharynx, vol. 37, pp. 101-119, 1911.
- (1911) Ann. Maladies Oreille, Larynx, Nez, Pharynx , vol.37 , pp. 101-119
- Lombard, E.¹

9
- 0022915795
- Recognition of speech under stress and in noise
- P. Rajasekaran, G. Doddington, and J. Picone, "Recognition of speech under stress and in noise," Proc. Int. Conf. Acoust., Speech, and Sig. Process., pp. 733-736, 1986.
- (1986) Proc. Int. Conf. Acoust., Speech, and Sig. Process , pp. 733-736
- Rajasekaran, P.¹ Doddington, G.² Picone, J.³

10
- 0034817675
- Optimized second-order gradient microphone for hands-free speech recordings in cars
- R. Aubauer and D. Leckschat, "Optimized second-order gradient microphone for hands-free speech recordings in cars," Speech Comm., vol. 34, No. 1-2, pp. 13-23, 2001.
- (2001) Speech Comm. , vol.34 , Issue.1-2 , pp. 13-23
- Aubauer, R.¹ Leckschat, D.²

11
- 0009590598
- Springer Verlag
- M. S. Brandstein and D. B. Ward, Microphone Arrays: Signal Processing Techniques and Applications. Springer Verlag, 2001.
- (2001) Microphone Arrays: Signal Processing Techniques and Applications
- Brandstein, M.S.¹ Ward, D.B.²

12
- 85135275880
- The SpeechDat-car multilingual speech databases for in-car applications: Some first validation results
- H. V. den Heuvel, R. Boudy, S. Euler, A. Moreno, and G. Richard, "The SpeechDat-Car multilingual speech databases for in-car applications: Some first validation results," Eurospeech, pp. 2279-2282, 1999.
- (1999) Eurospeech , pp. 2279-2282
- Den Heuvel, H.V.¹ Boudy, R.² Euler, S.³ Moreno, A.⁴ Richard, G.⁵

13
- 85009152939
- CU-move: Robust speech processing for in-vehicle speech systems
- J. H. L. Hansen, J. Plucienkowski, S. Gallant, B. Pellom, and W. Ward, "CU-Move: Robust speech processing for in-vehicle speech systems," Proc. Int. Conf. Spoken Lang. Process., pp. 524-527, 2000.
- (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 524-527
- Hansen, J.H.L.¹ Plucienkowski, J.² Gallant, S.³ Pellom, B.⁴ Ward, W.⁵

14
- 0013302639
- CSDC - The MoTiV car speech data collection
- D. Langmann, H. R. Pfitzinger, T. Schneider, R. Grudszus, A. Fischer, M. Westphal, T. Crull, and U. Jekosch, "CSDC - the MoTiV car speech data collection," Proc. Int. Conf. Lang. Resources and Eval., pp. 1107-1110, 1998.
- (1998) Proc. Int. Conf. Lang. Resources and Eval. , pp. 1107-1110
- Langmann, D.¹ Pfitzinger, H.R.² Schneider, T.³ Grudszus, R.⁴ Fischer, A.⁵ Westphal, M.⁶ Crull, T.⁷ Jekosch, U.⁸

15
- 85032752352
- Audiovisual speech processing
- T. Chen, "Audiovisual speech processing," IEEE Sig. Process. Magazine, vol. 18, No. 1, pp. 9-21, 2001.
- (2001) IEEE Sig. Process. Magazine , vol.18 , Issue.1 , pp. 9-21
- Chen, T.¹

16
- 0036295989
- Audio-visual speech modeling using coupled hidden Markov models
- S. Chu and T. Huang, "Audio-visual speech modeling using coupled hidden Markov models," Proc. Int. Conf. Acoust., Speech, and Sig. Process., pp. 2009-2012, 2002.
- (2002) Proc. Int. Conf. Acoust., Speech, and Sig. Process. , pp. 2009-2012
- Chu, S.¹ Huang, T.²

17
- 85009099416
- http://amp.ece.cmu.edu/projects/AudioVisualSpeechProcessing/.

18
- 0036299249
- CUAVE: A new audio-visual database for multimodal human-computer interface research
- E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," Proc. Int. Conf. Acoust., Speech, and Sig. Process., pp. 2017-2020, 2002.
- (2002) Proc. Int. Conf. Acoust., Speech, and Sig. Process , pp. 2017-2020
- Patterson, E.K.¹ Gurbuz, S.² Tufekci, Z.³ Gowdy, J.N.⁴

19
- 84948594425
- An algorithm for linearly constrained adaptive array processing
- O. L. Frost, III, "An algorithm for linearly constrained adaptive array processing," Proc. of IEEE, vol. 60, No. 8, pp. 926-935, 1972.
- (1972) Proc. of IEEE , vol.60 , Issue.8 , pp. 926-935
- Frost, O.L.¹

20
- 0019928857
- An alternative approach to linearly constrained adaptive beamforming
- L. J. Griffiths and C. W. Jim, "An alternative approach to linearly constrained adaptive beamforming," IEEE Trans. Antennas and Propag., vol. 30, No. 1, pp. 27-34, 1982.
- (1982) IEEE Trans. Antennas and Propag. , vol.30 , Issue.1 , pp. 27-34
- Griffiths, L.J.¹ Jim, C.W.²

21
- 0034818519
- Multi-microphone noise reduction techniques as front-end devices for speech recognition
- J. Bitzer, K. U. Simmer, and K.-D. Kammeyer, "Multi-microphone noise reduction techniques as front-end devices for speech recognition," Speech Comm., vol. 34, pp. 3-12, 2001.
- (2001) Speech Comm. , vol.34 , pp. 3-12
- Bitzer, J.¹ Simmer, K.U.² Kammeyer, K.-D.³

22
- 0032677010
- Performance of an hmm speech recognizer using a real-time tracking microphone array as input
- T. B. Hughes, H.-S. Kim, J. H. DiBiase, and H. F. Silverman, "Performance of an hmm speech recognizer using a real-time tracking microphone array as input," IEEE Trans. Speech and Audio Process., vol. 7, No. 3, pp. 346-349, 1999.
- (1999) IEEE Trans. Speech and Audio Process , vol.7 , Issue.3 , pp. 346-349
- Hughes, T.B.¹ Kim, H.-S.² DiBiase, J.H.³ Silverman, H.F.⁴

23
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multim., vol. 2, No. 3, pp. 141-151, 2000.
- (2000) IEEE Trans. Multim. , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

24
- 0032309170
- 3D modeling and tracking of human lip motions
- S. Basu, N. Oliver, and A. Pentland, "3D modeling and tracking of human lip motions," Proc. Sixth Int. Conf. Computer Vision, pp. 337-343, 1998.
- (1998) Proc. Sixth Int. Conf. Computer Vision , pp. 337-343
- Basu, S.¹ Oliver, N.² Pentland, A.³

25
- 0036844217
- Modeling and animating realistic faces from images
- F. Pighin, R. Szeliski, and D. H. Salesin, "Modeling and animating realistic faces from images," Int. J. of Computer Vision, vol. 50, No. 2, pp. 143-169, 2002.
- (2002) Int. J. of Computer Vision , vol.50 , Issue.2 , pp. 143-169
- Pighin, F.¹ Szeliski, R.² Salesin, D.H.³

26
- 0025477640
- Speech database development at MIT: Timit and beyond
- V. Zue, S. Seneff, and J. Glass, "Speech database development at MIT: TIMIT and beyond," Speech Comm., vol. 9, No. 4, pp. 351-356, 1990.
- (1990) Speech Comm. , vol.9 , Issue.4 , pp. 351-356
- Zue, V.¹ Seneff, S.² Glass, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.