메뉴 건너뛰기




Volumn 5, Issue 2, 2004, Pages 77-80

Robust speech processing using multi-sensor multi-source information fusion - An overview of the state of the art

Author keywords

Multisensor information fusion; Robust speech processing; Speech recognition

Indexed keywords

CAMERAS; COMPUTATIONAL COMPLEXITY; COMPUTATIONAL METHODS; DATA PROCESSING; MICROPHONES; SENSOR DATA FUSION; SIGNAL TO NOISE RATIO; SPEECH RECOGNITION;

EID: 1842843686     PISSN: 15662535     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.inffus.2004.02.001     Document Type: Article
Times cited : (12)

References (39)
  • 2
    • 0035458007 scopus 로고    scopus 로고
    • Robust sound localization using multi-source audio-visual information fusion
    • Aarabi P., Zaky S. Robust sound localization using multi-source audio-visual information fusion. Information Fusion. 3(2):2001;209-223.
    • (2001) Information Fusion , vol.3 , Issue.2 , pp. 209-223
    • Aarabi, P.1    Zaky, S.2
  • 4
    • 0034215560 scopus 로고    scopus 로고
    • Elucidative fusion systems - An exposition
    • Dasarathy B.V. Elucidative fusion systems - an exposition. Information Fusion. 1(1):2000;5-15.
    • (2000) Information Fusion , vol.1 , Issue.1 , pp. 5-15
    • Dasarathy, B.V.1
  • 10
    • 0032180188 scopus 로고    scopus 로고
    • Adaptive fusion of acoustic and visual sources for automatic speech recognition
    • Rogozan A., Deleglise P. Adaptive fusion of acoustic and visual sources for automatic speech recognition. Speech Communication. 26(1-2):1998;149-161.
    • (1998) Speech Communication , vol.26 , Issue.1-2 , pp. 149-161
    • Rogozan, A.1    Deleglise, P.2
  • 13
    • 0025503485 scopus 로고
    • Neural network models of sensory integration for improved vowel recognition
    • Yuhas B.P., Goldstein M.H., Sejnowski T.J., Jenkins R.E. Neural network models of sensory integration for improved vowel recognition. Proceedings of the IEEE. 78(10):1990;1658-1668.
    • (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1658-1668
    • Yuhas, B.P.1    Goldstein, M.H.2    Sejnowski, T.J.3    Jenkins, R.E.4
  • 15
    • 0030247984 scopus 로고    scopus 로고
    • Computer lip-reading for improved accuracy in automatic speech recognition
    • Silsbee P.L., Bovik A.C. Computer lip-reading for improved accuracy in automatic speech recognition. IEEE Transactions on Speech and Audio Processing. 4(5):1996;337-351.
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 337-351
    • Silsbee, P.L.1    Bovik, A.C.2
  • 17
    • 0042826822 scopus 로고    scopus 로고
    • Independent component analysis: Algorithms and applications
    • Hyvrinen A., Oja E. Independent component analysis: algorithms and applications. Neural Networks. 13(4-5):2000;411-430.
    • (2000) Neural Networks , vol.13 , Issue.4-5 , pp. 411-430
    • Hyvrinen, A.1    Oja, E.2
  • 20
    • 0029411030 scopus 로고
    • An information maximization approach to blind separation and blind deconvolution
    • Bell A., Sejnowski T. An information maximization approach to blind separation and blind deconvolution. Neural Computation. 7:1995;1129-1159.
    • (1995) Neural Computation , vol.7 , pp. 1129-1159
    • Bell, A.1    Sejnowski, T.2
  • 24
    • 0037469886 scopus 로고    scopus 로고
    • Algorithms for acoustic localization based on microphone array in service robotics
    • Enzo M., Massimiliano N., Gianni V. Algorithms for acoustic localization based on microphone array in service robotics. Robotics and Autonomous Systems. 42(2):2003;69-88.
    • (2003) Robotics and Autonomous Systems , vol.42 , Issue.2 , pp. 69-88
    • Enzo, M.1    Massimiliano, N.2    Gianni, V.3
  • 32
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Dupont S., Luettin J. Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia. 2(3):2000;141-151.
    • (2000) IEEE Transactions on Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 33
    • 0036154801 scopus 로고    scopus 로고
    • Speakers' direction finding using estimated time delays in the frequency domain
    • Berdugo B., Rosenhouse J., Azhari H. Speakers' direction finding using estimated time delays in the frequency domain. Signal Processing. 82(1):2002;19-30.
    • (2002) Signal Processing , vol.82 , Issue.1 , pp. 19-30
    • Berdugo, B.1    Rosenhouse, J.2    Azhari, H.3
  • 34
    • 0038381727 scopus 로고    scopus 로고
    • Audio-visual speech recognition based on optimized product HMMs and GMM based-MCE-GPD stream weight estimation
    • Kumatani K., Nakamura S. Audio-visual speech recognition based on optimized product HMMs and GMM based-MCE-GPD stream weight estimation. IEICE Transactions on Information and Systems E. 86-D(3):2003;454-463.
    • (2003) IEICE Transactions on Information and Systems E , vol.86 D , Issue.3 , pp. 454-463
    • Kumatani, K.1    Nakamura, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.