메뉴 건너뛰기




Volumn 31, Issue 9, 2009, Pages 1700-1707

Multistream articulatory feature-based models for visual speech recognition

Author keywords

Articulatory features; Dynamic Bayesian networks; Support vector machines; Visual speech recognition

Indexed keywords

ARTICULATORY FEATURES; ASYNCHRONY; BASELINE MODELS; DYNAMIC BAYESIAN NETWORK; DYNAMIC BAYESIAN NETWORKS; HIDDEN STATE; MULTI-STREAM; MULTIPLE SEQUENCES; OBSERVATION MODEL; VISUAL SPEECH RECOGNITION; WORD MODELS;

EID: 67650911345     PISSN: 01628828     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPAMI.2008.303     Document Type: Article
Times cited : (30)

References (38)
  • 1
    • 27144455475 scopus 로고    scopus 로고
    • On Soft Evidence in Bayesian Networks,
    • Technical Report UWEETR-2004-00016, Electrical Eng. Dept, Univ. of Washington
    • J. Bilmes, "On Soft Evidence in Bayesian Networks," Technical Report UWEETR-2004-00016, Electrical Eng. Dept., Univ. of Washington, 2004.
    • (2004)
    • Bilmes, J.1
  • 3
    • 85032752364 scopus 로고    scopus 로고
    • Graphical Model Architectures for Speech Recognition
    • Sept
    • J.A. Bilmes and C. Bartels, "Graphical Model Architectures for Speech Recognition," IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 89-100, Sept. 2005.
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 89-100
    • Bilmes, J.A.1    Bartels, C.2
  • 4
    • 0027024362 scopus 로고
    • Articulatory Phonology: An Overview
    • C.P. Browman and L. Goldstein, "Articulatory Phonology: An Overview," Phonetica, vol. 49, nos. 3/4, pp. 155-180, 1992.
    • (1992) Phonetica , vol.49 , Issue.3-4 , pp. 155-180
    • Browman, C.P.1    Goldstein, L.2
  • 5
    • 34547497796 scopus 로고    scopus 로고
    • O. Cetin et al., An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling Proc. Int'l Conf. Acoustics, Speech, and Signal Proc., pp. IV-645-IV-648, Apr. 2007.
    • O. Cetin et al., "An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling" Proc. Int'l Conf. Acoustics, Speech, and Signal Proc., pp. IV-645-IV-648, Apr. 2007.
  • 7
    • 84990553353 scopus 로고
    • A Model for Reasoning About Persistence and Causation
    • Feb
    • T. Dean and K. Kanazawa, "A Model for Reasoning About Persistence and Causation," Computational Intelligence, vol. 5, no. 2, pp. 142-150, Feb. 1989.
    • (1989) Computational Intelligence , vol.5 , Issue.2 , pp. 142-150
    • Dean, T.1    Kanazawa, K.2
  • 8
    • 0002629270 scopus 로고
    • Maximum Likelihood from Incomplete Data via the EM Algorithm
    • A.P. Dempster, N.M. Laird, and D.B. Rubin, "Maximum Likelihood from Incomplete Data via the EM Algorithm," J. Royal Statistical Soc. Series B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. Royal Statistical Soc. Series B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 9
    • 0031198059 scopus 로고    scopus 로고
    • Production Models as a Structural Basis for Automatic Speech Recognition
    • Aug
    • L. Deng, G. Ramsay, and D. Sun, "Production Models as a Structural Basis for Automatic Speech Recognition," Speech Comm., vol. 22, nos. 2/3, pp. 93-111, Aug. 1997.
    • (1997) Speech Comm , vol.22 , Issue.2-3 , pp. 93-111
    • Deng, L.1    Ramsay, G.2    Sun, D.3
  • 10
    • 0036875002 scopus 로고    scopus 로고
    • A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications
    • M. Gordan, C. Kotropoulos, and I. Pitas, "A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications," EURASIP J. Applied Signal Processing, vol. 2002, no. 11, pp. 1248-1259, 2002.
    • (2002) EURASIP J. Applied Signal Processing , vol.2002 , Issue.11 , pp. 1248-1259
    • Gordan, M.1    Kotropoulos, C.2    Pitas, I.3
  • 13
    • 14944353581 scopus 로고    scopus 로고
    • T.J. Hazen, K. Saenko, C.-H. La, and J.R. Glass, A Segment-Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments Proc. Int'l Conf. Multimodal Interfaces, pp. 235-242, Oct. 2004.
    • T.J. Hazen, K. Saenko, C.-H. La, and J.R. Glass, "A Segment-Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments" Proc. Int'l Conf. Multimodal Interfaces, pp. 235-242, Oct. 2004.
  • 15
    • 33846680938 scopus 로고    scopus 로고
    • Speech Production Knowledge in Automatic Speech Recognition
    • Feb
    • S. King et al., "Speech Production Knowledge in Automatic Speech Recognition," J. Acoustical Soc. of Am., vol. 121, no. 2, pp. 723-742, Feb. 2007.
    • (2007) J. Acoustical Soc. of Am , vol.121 , Issue.2 , pp. 723-742
    • King, S.1
  • 16
    • 0034297586 scopus 로고    scopus 로고
    • Detection of Phonological Features in Continuous Speech Using Neural Networks
    • Oct
    • S. King and P. Taylor, "Detection of Phonological Features in Continuous Speech Using Neural Networks," Computer Speech and Language, vol. 14, no. 4, pp. 333-353, Oct. 2000.
    • (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 333-353
    • King, S.1    Taylor, P.2
  • 17
    • 0036642567 scopus 로고    scopus 로고
    • Combining Acoustic and Articulatory Feature Information for Robust Speech Recognition
    • July
    • K. Kirchhoff, G.A. Fink, and G. Sagerer, "Combining Acoustic and Articulatory Feature Information for Robust Speech Recognition," Speech Comm., vol. 37, nos. 3/4, pp. 303-319, July 2002.
    • (2002) Speech Comm , vol.37 , Issue.3-4 , pp. 303-319
    • Kirchhoff, K.1    Fink, G.A.2    Sagerer, G.3
  • 20
    • 78651465434 scopus 로고    scopus 로고
    • Feature-Based Pronunciation Modeling with Trainable Asynchrony Probabilities
    • Oct
    • K. Livescu and J. Glass, "Feature-Based Pronunciation Modeling with Trainable Asynchrony Probabilities" Proc. Int'l Conf. Spoken Language, pp. 677-680, Oct. 2004.
    • (2004) Proc. Int'l Conf. Spoken Language , pp. 677-680
    • Livescu, K.1    Glass, J.2
  • 21
    • 34547548915 scopus 로고    scopus 로고
    • Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition
    • Johns Hopkins Univ, Center for Language and Speech Processing
    • K. Livescu et al., "Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: JHU Summer Workshop Final Report," Johns Hopkins Univ., Center for Language and Speech Processing, 2007.
    • (2007) JHU Summer Workshop Final Report
    • Livescu, K.1
  • 22
    • 0017199877 scopus 로고
    • Hearing Lips and Seeing Voices
    • Dec
    • H. McGurk and J. McDonald, "Hearing Lips and Seeing Voices," Nature vol. 264, no. 5588, pp. 746-748, Dec. 1976.
    • (1976) Nature , vol.264 , Issue.5588 , pp. 746-748
    • McGurk, H.1    McDonald, J.2
  • 23
  • 24
    • 0013288412 scopus 로고    scopus 로고
    • Dynamic Bayesian Networks: Representation, Inference and Learning,
    • PhD dissertation, Computer Science Division, Univ. of California
    • K. Murphy, "Dynamic Bayesian Networks: Representation, Inference and Learning," PhD dissertation, Computer Science Division, Univ. of California, 2002.
    • (2002)
    • Murphy, K.1
  • 27
    • 0036081023 scopus 로고    scopus 로고
    • Modelling Asynchrony in Automatic Speech Recognition Using Loosely Coupled Hidden Markov Models
    • May/June
    • H. Nock and S. Young, "Modelling Asynchrony in Automatic Speech Recognition Using Loosely Coupled Hidden Markov Models," Cognitive Science, vol. 26, no. 3, pp. 283-301, May/June 2002.
    • (2002) Cognitive Science , vol.26 , Issue.3 , pp. 283-301
    • Nock, H.1    Young, S.2
  • 28
    • 1542303714 scopus 로고    scopus 로고
    • A Fused Hidden Markov Model with Application to Bimodal Speech Processing
    • Mar
    • H. Pan, S.E. Levinson, T.S. Huang, and Z. Liang, "A Fused Hidden Markov Model with Application to Bimodal Speech Processing," IEEE Trans. Signal Processing, vol. 52, no. 3, pp. 573-581, Mar. 2004.
    • (2004) IEEE Trans. Signal Processing , vol.52 , Issue.3 , pp. 573-581
    • Pan, H.1    Levinson, S.E.2    Huang, T.S.3    Liang, Z.4
  • 30
    • 0021541159 scopus 로고
    • Automatic Lipreading to Enhance Speech Recognition
    • E. Petajan, "Automatic Lipreading to Enhance Speech Recognition," Proc. Global Telecomm. Conf., pp. 265-272, 1984.
    • (1984) Proc. Global Telecomm. Conf , pp. 265-272
    • Petajan, E.1
  • 31
    • 0003243224 scopus 로고    scopus 로고
    • Probabilities for SV Machines
    • A.J. Smola, P.L. Bartlett, B. Schoelkopf, and D. Schuurmans, eds, pp, MIT Press
    • J. Platt, "Probabilities for SV Machines," Advances in Large Margin Classifiers, A.J. Smola, P.L. Bartlett, B. Schoelkopf, and D. Schuurmans, eds., pp. 61-73, MIT Press, 2000.
    • (2000) Advances in Large Margin Classifiers , pp. 61-73
    • Platt, J.1
  • 33
    • 0037697284 scopus 로고    scopus 로고
    • Hidden Articulator Markov Models for Speech Recognition
    • Oct
    • M. Richardson, J. Bilmes, and C. Diorio, "Hidden Articulator Markov Models for Speech Recognition," Speech Comm., vol. 41, nos. 2/3, pp. 511-529, Oct. 2003.
    • (2003) Speech Comm , vol.41 , Issue.2-3 , pp. 511-529
    • Richardson, M.1    Bilmes, J.2    Diorio, C.3
  • 35
    • 33646822127 scopus 로고    scopus 로고
    • K. Saenko, K. Livescu, J. Glass, and T. Darrell, Production Domain Modeling of Pronunciation for Visual Speech Recognition, Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, pp. v/473-v/ 476, Mar. 2005.
    • K. Saenko, K. Livescu, J. Glass, and T. Darrell, "Production Domain Modeling of Pronunciation for Visual Speech Recognition," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, pp. v/473-v/ 476, Mar. 2005.
  • 37
    • 0025477640 scopus 로고
    • Speech Database Development: TIMIT and Beyond
    • Aug
    • V. Zue, S. Seneff, and J. Glass, "Speech Database Development: TIMIT and Beyond," Speech Comm., vol. 9, no. 4, pp. 351-356, Aug. 1990.
    • (1990) Speech Comm , vol.9 , Issue.4 , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3
  • 38
    • 0004158153 scopus 로고    scopus 로고
    • Speech Recognition Using Dynamic Bayesian Networks,
    • PhD dissertation, Computer Science Division, Univ. of California
    • G. Zweig, "Speech Recognition Using Dynamic Bayesian Networks," PhD dissertation, Computer Science Division, Univ. of California, 1998.
    • (1998)
    • Zweig, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.