메뉴 건너뛰기




Volumn , Issue , 2009, Pages

Hough transform-based mouth localization for audio-visual speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

FACE RECOGNITION; HOUGH TRANSFORMS; SPEECH RECOGNITION;

EID: 84898888944     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.5244/C.23.78     Document Type: Conference Paper
Times cited : (18)

References (29)
  • 1
    • 0001492549 scopus 로고    scopus 로고
    • Shape quantization and recognition with randomized trees
    • Y. Amit and D. Geman. Shape quantization and recognition with randomized trees. Neural Computation, 9(7):1545-1588, 1997.
    • (1997) Neural Computation , vol.9 , Issue.7 , pp. 1545-1588
    • Amit, Y.1    Geman, D.2
  • 2
    • 0019397313 scopus 로고
    • Generalizing the hough transform to detect arbitrary shapes
    • D. H. Ballard. Generalizing the hough transform to detect arbitrary shapes. Pattern Recognition, 13(2):111-122, 1981.
    • (1981) Pattern Recognition , vol.13 , Issue.2 , pp. 111-122
    • Ballard, D.H.1
  • 4
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • L. Breiman. Random forests. Machine Learning, 45(1):5-32, 2001.
    • (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 5
    • 17444403426 scopus 로고    scopus 로고
    • A multi-stage approach to facial feature detection
    • London, England
    • D. Cristinacce, T. Cootes, and I. Scott. A multi-stage approach to facial feature detection. In British Machine Vision Conference, London, England, pages 277-286, 2004.
    • (2004) British Machine Vision Conference , pp. 277-286
    • Cristinacce, D.1    Cootes, T.2    Scott, I.3
  • 7
    • 0036875002 scopus 로고    scopus 로고
    • A support vector machine-based dynamic network for visual speech recognition applications
    • M. Gordan, C. Kotropoulos, and I. Pitas. A support vector machine-based dynamic network for visual speech recognition applications. EURASIP J. Appl. Signal Process., 2002(1):1248-1259, 2002.
    • (2002) EURASIP J. Appl. Signal Process , vol.2002 , Issue.1 , pp. 1248-1259
    • Gordan, M.1    Kotropoulos, C.2    Pitas, I.3
  • 10
    • 70450273199 scopus 로고    scopus 로고
    • Information theoretic feature extraction for audio-visual speech recognition
    • M. Gurban and J. P. Thiran. Information theoretic feature extraction for audio-visual speech recognition. IEEE Transactions on Signal Processing, 2009.
    • (2009) IEEE Transactions on Signal Processing
    • Gurban, M.1    Thiran, J.P.2
  • 12
    • 84957886748 scopus 로고    scopus 로고
    • Real-time lip tracking for audio-visual speech recognition applications
    • R. Kaucic, B. Dalton, and A. Blake. Real-time lip tracking for audio-visual speech recognition applications. In European Conference on Computer Vision, pages 376-387, 1996.
    • (1996) European Conference on Computer Vision , pp. 376-387
    • Kaucic, R.1    Dalton, B.2    Blake, A.3
  • 13
    • 39749124915 scopus 로고    scopus 로고
    • Robust object detection with interleaved categorization and segmentation
    • B. Leibe, A. Leonardis, and B. Schiele. Robust object detection with interleaved categorization and segmentation. International Journal of Computer Vision, 77(1-3):259-289, 2008.
    • (2008) International Journal of Computer Vision , vol.77 , Issue.1-3 , pp. 259-289
    • Leibe, B.1    Leonardis, A.2    Schiele, B.3
  • 15
    • 84957810405 scopus 로고    scopus 로고
    • A comparison of active shape model and scale decomposition based features for visual speech recognition
    • I Matthews, J. A. Bangham, R. Harvey, and S. Cox. A comparison of active shape model and scale decomposition based features for visual speech recognition. In European Conference on Computer Vision, pages 514-528, 1998.
    • (1998) European Conference on Computer Vision , pp. 514-528
    • Matthews, J.I.1    Bangham, A.2    Harvey, R.3    Cox, S.4
  • 16
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • H. McGurk and J. MacDonald. Hearing lips and seeing voices. Nature, 264:746-748, 1976.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 18
    • 0036874756 scopus 로고    scopus 로고
    • Moving-talker, speaker-independent feature study, and baseline results using the cuave multimodal speech corpus
    • ISSN 1110-8657
    • E. K. Patterson, S. Gurbuz, Z. Tufekci, and J.N. Gowdy. Moving-talker, speaker-independent feature study, and baseline results using the cuave multimodal speech corpus. EURASIP J. Appl. Signal Process., 2002(1):1189-1201, 2002. ISSN 1110-8657.
    • (2002) EURASIP J. Appl. Signal Process. , vol.2002 , Issue.1 , pp. 1189-1201
    • Patterson, E.K.1    Gurbuz, S.2    Tufekci, Z.3    Gowdy, J.N.4
  • 20
    • 84863802260 scopus 로고    scopus 로고
    • Exploiting lower face symmetry in appearance-based automatic speechreading
    • G. Potamianos and P. Scanlon. Exploiting lower face symmetry in appearance-based automatic speechreading. In Audio-Visual Speech Process., pages 79-84, 2005.
    • (2005) Audio-Visual Speech Process , pp. 79-84
    • Potamianos, G.1    Scanlon, P.2
  • 23
    • 84898429387 scopus 로고    scopus 로고
    • BioID Technology Research, 2001. http://www.bioid.de/.
    • (2001)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.