메뉴 건너뛰기




Volumn , Issue , 2007, Pages 1751-1756

Coarse speech recognition by audio-visual integration based on missing feature theory

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO-VISUAL; AUDIO-VISUAL SPEECH RECOGNITION; INTERNATIONAL CONFERENCES; MISSING FEATURE THEORY; REAL-WORLD;

EID: 51349110555     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IROS.2007.4399300     Document Type: Conference Paper
Times cited : (10)

References (21)
  • 1
    • 10444237268 scopus 로고    scopus 로고
    • Improvement of recognition of simultaneous speech signals using AV integration and scattering theory for humanoid robots
    • K. Nakadai et al., "Improvement of recognition of simultaneous speech signals using AV integration and scattering theory for humanoid robots", Speech Communication, vol.44, 2004, pp.97-112.
    • (2004) Speech Communication , vol.44 , pp. 97-112
    • Nakadai, K.1
  • 2
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • C.J. Leggetter et al., "Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models", Computer Speech and Language, vol.9, 1995, pp. 171-185.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1
  • 6
    • 51349162990 scopus 로고    scopus 로고
    • K. Nakadai et al., Real-time auditory and visual multiple-object tracking for robots, IJCAI-2001, MIT Press, pp.1424-1432.
    • K. Nakadai et al., "Real-time auditory and visual multiple-object tracking for robots", IJCAI-2001, MIT Press, pp.1424-1432.
  • 7
    • 0035386489 scopus 로고    scopus 로고
    • A cascade visual front end for speaker independent automatic speechreading
    • G. Potamianos et al., "A cascade visual front end for speaker independent automatic speechreading", Speech Technology, Special Issue on Multimedia, vol.4, 2001, pp.193-208.
    • (2001) Speech Technology , vol.4 , pp. 193-208
    • Potamianos, G.1
  • 8
    • 33646814706 scopus 로고    scopus 로고
    • A stream-weight optimization method for multi-stream hmms based on likelihood value normalization
    • SP
    • S. Tamura et al., "A stream-weight optimization method for multi-stream hmms based on likelihood value normalization", Proc. of Int'l Conf. on Acoustics, Speech and Signal Processing, 2005, SP-P5.2.
    • (2005) Proc. of Int'l Conf. on Acoustics, Speech and Signal Processing
    • Tamura, S.1
  • 9
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing systems to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • IEEE
    • J. Fiscus, "A post-processing systems to yield reduced word error rates: Recognizer output voting error reduction (ROVER)", in Proc. of the Workshop on Automatic Speech Recognition and Understanding, IEEE, 1997, pp.347-354.
    • (1997) Proc. of the Workshop on Automatic Speech Recognition and Understanding , pp. 347-354
    • Fiscus, J.1
  • 10
    • 85009106519 scopus 로고    scopus 로고
    • Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
    • ESCA
    • J. Barker et al., "Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise", Proc. of 7fh European Conference on Speech Communication Technology, 2001, pp.213-216. ESCA.
    • (2001) Proc. of 7fh European Conference on Speech Communication Technology , pp. 213-216
    • Barker, J.1
  • 11
    • 85009143830 scopus 로고    scopus 로고
    • Comparison of HMM experts with MLP experts in the full combination multi-band approach to robust ASR
    • A. Hagen et al., "Comparison of HMM experts with MLP experts in the full combination multi-band approach to robust ASR", Proc. of Int'l Conf. on Spoken Language Processing, 2000, pp.345-348.
    • (2000) Proc. of Int'l Conf. on Spoken Language Processing , pp. 345-348
    • Hagen, A.1
  • 12
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and recombination of partial frequency bands
    • H. Bourlard et al., "A new ASR approach based on independent processing and recombination of partial frequency bands", Proc. of Int'l Conf. on Spoken Language Processing, 1996, pp.426-429.
    • (1996) Proc. of Int'l Conf. on Spoken Language Processing , pp. 426-429
    • Bourlard, H.1
  • 13
    • 48149111531 scopus 로고    scopus 로고
    • Speech recognition for a humanoid with motor noise utilizing missing feature theory
    • IEEE
    • Y. Nishimura et al., "Speech recognition for a humanoid with motor noise utilizing missing feature theory", Proc. of Int'l Conf. on Humanoid Robots, 2006, pp.26-33. IEEE.
    • (2006) Proc. of Int'l Conf. on Humanoid Robots , pp. 26-33
    • Nishimura, Y.1
  • 14
    • 84955023511 scopus 로고
    • An analysis of perceptual confusions among some english consonants
    • G. Miller et al., "An analysis of perceptual confusions among some english consonants", JASA, vol.27, 1955, pp.338-352.
    • (1955) JASA , vol.27 , pp. 338-352
    • Miller, G.1
  • 15
    • 0017357502 scopus 로고
    • Effect of training on the visual recognition of consonants
    • B. Walden et al., "Effect of training on the visual recognition of consonants", J, of Speech and Hearing Research, vol.20, 1977, pp. 130-145.
    • (1977) J, of Speech and Hearing Research , vol.20 , pp. 130-145
    • Walden, B.1
  • 16
    • 0037221164 scopus 로고    scopus 로고
    • Look at the big picture (details will follow)
    • D. Ringach, "Look at the big picture (details will follow)", Nature Neuroscience, vol.6, 2003, no.l, pp.7-8.
    • (2003) Nature Neuroscience , vol.6 , Issue.L , pp. 7-8
    • Ringach, D.1
  • 18
    • 0029229987 scopus 로고
    • Markov model based phoneme class partitioning for improved constrained iterative speech enhancement
    • J.H.L. Hansen et al., "Markov model based phoneme class partitioning for improved constrained iterative speech enhancement", IEEE Trans. on Speech and Audio Processing, vol.3, 1995, no.l, pp.98-104.
    • (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , Issue.L , pp. 98-104
    • Hansen, J.H.L.1
  • 19
    • 85032689322 scopus 로고
    • Minimum cost based phoneme class detection for improved iterative speech enhancement
    • L. Arslan et al., "Minimum cost based phoneme class detection for improved iterative speech enhancement", Proc. of Int'l Conf. on Acoustics, Speech and Signal Processing, vol.11, 1994, pp.45-48.
    • (1994) Proc. of Int'l Conf. on Acoustics, Speech and Signal Processing , vol.11 , pp. 45-48
    • Arslan, L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.