메뉴 건너뛰기




Volumn , Issue , 2006, Pages 154-157

An asynchronous DBN for audio-visual speech recognition

Author keywords

Speech recognition

Indexed keywords

AUDIO ACOUSTICS; BAYESIAN NETWORKS; ERROR ANALYSIS; INFERENCE ENGINES; LINGUISTICS; RIVERS; SPEECH; SPEECH ANALYSIS; UNDERWATER ACOUSTICS; ACOUSTIC NOISE;

EID: 48749083240     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SLT.2006.326841     Document Type: Conference Paper
Times cited : (15)

References (16)
  • 2
    • 0036293559 scopus 로고    scopus 로고
    • The Graphical Models Toolkit: An open source software system for speech and time-series processing, in Proc
    • J. Bilmes and G. Zweig, "The Graphical Models Toolkit: An open source software system for speech and time-series processing," in Proc. ICASSP, 2002.
    • ICASSP, 2002
    • Bilmes, J.1    Zweig, G.2
  • 5
    • 85156254941 scopus 로고
    • Factorial hidden Markov models
    • Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, eds, MIT Press, Cambridge, MA, USA
    • Z. Ghahramani and M. Jordan, "Factorial hidden Markov models," in Proc. Conference Advances in Neural Information Processing Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, eds., vol. 8, pp. 472-478, MIT Press, Cambridge, MA, USA, 1995.
    • (1995) Proc. Conference Advances in Neural Information Processing , vol.8 , pp. 472-478
    • Ghahramani, Z.1    Jordan, M.2
  • 6
    • 4544343002 scopus 로고    scopus 로고
    • DBN based multi-stream models for audio-visual speech recognition
    • J. Gowdy, A. Subramanya, C. Bartels, and J. Bilmes, "DBN based multi-stream models for audio-visual speech recognition," in Proc. ICASSP, 2004.
    • (2004) Proc. ICASSP
    • Gowdy, J.1    Subramanya, A.2    Bartels, C.3    Bilmes, J.4
  • 8
    • 14944341906 scopus 로고    scopus 로고
    • Feature-based pronunciation modeling for speech recognition
    • K. Livescu and J. Glass, "Feature-based pronunciation modeling for speech recognition," in Proc. HLT/NAACL, 2004.
    • (2004) Proc. HLT/NAACL
    • Livescu, K.1    Glass, J.2
  • 9
    • 78651465434 scopus 로고    scopus 로고
    • Feature-based pronunciation modeling with trainable asynchrony probabilities
    • K. Livescu and J. Glass, "Feature-based pronunciation modeling with trainable asynchrony probabilities," in Proc. ICSLP, 2004.
    • (2004) Proc. ICSLP
    • Livescu, K.1    Glass, J.2
  • 12
    • 84957551318 scopus 로고    scopus 로고
    • Loosely-Coupled HMMs for ASR
    • H. Nock and S. Young, "Loosely-Coupled HMMs for ASR," in Proc. ICSLP, 2000.
    • (2000) Proc. ICSLP
    • Nock, H.1    Young, S.2
  • 13
    • 0036299249 scopus 로고    scopus 로고
    • CUAVE: A new audio-visual database for multimodal human-computer interface research, in Proc
    • E.K. Patterson, S. Gurbuz, Z. Tufekci, and J.N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," in Proc. ICASSP, 2002.
    • ICASSP, 2002
    • Patterson, E.K.1    Gurbuz, S.2    Tufekci, Z.3    Gowdy, J.N.4
  • 14


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.