메뉴 건너뛰기




Volumn 23, Issue 5, 2007, Pages 840-851

Enabling multimodal human-robot interaction for the Karlsruhe humanoid robot

Author keywords

Audiovisual perception; Human centered robotics; Human robot interaction; Multimodal interaction

Indexed keywords

COMPUTER VISION; GESTURE RECOGNITION; MAN MACHINE SYSTEMS; SPEECH RECOGNITION; TRACKING (POSITION);

EID: 35348860213     PISSN: 15523098     EISSN: None     Source Type: Journal    
DOI: 10.1109/TRO.2007.907484     Document Type: Article
Times cited : (118)

References (57)
  • 1
    • 35348832224 scopus 로고    scopus 로고
    • Online, Available
    • The SFB 588 Website. (2004). [Online]. Available: http:// www.sfb588.uni-karlsruhe.de/
    • (2004) The SFB 588 Website
  • 2
    • 14044260929 scopus 로고    scopus 로고
    • Special Issue on Human-Friendly Robots
    • "Special Issue on Human-Friendly Robots," J. Robot. Soc. Jpn., vol. 16, no. 3, 1998.
    • (1998) J. Robot. Soc. Jpn , vol.16 , Issue.3
  • 7
    • 14944372689 scopus 로고    scopus 로고
    • Facial orientation during multi-party interaction with information kiosks
    • presented at the, Zurich, Switzerland
    • I. Bakx, K. van Turnhout, and J. Terken, "Facial orientation during multi-party interaction with information kiosks," presented at the Interact 2003, Zurich, Switzerland.
    • (2003) Interact
    • Bakx, I.1    van Turnhout, K.2    Terken, J.3
  • 8
    • 0031185845 scopus 로고    scopus 로고
    • Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection
    • Jul
    • P. N. Belhumeur, J. P. Hespanha, and D. J. Kriegman, "Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, no. 7, pp. 711-720, Jul. 1997.
    • (1997) IEEE Trans. Pattern Anal. Mach. Intell , vol.19 , Issue.7 , pp. 711-720
    • Belhumeur, P.N.1    Hespanha, J.P.2    Kriegman, D.J.3
  • 9
    • 2442491029 scopus 로고    scopus 로고
    • Social interactions in HRI: The robot view
    • May
    • C. Breazeal, "Social interactions in HRI: The robot view," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 34, no. 2, pp. 181-186, May 2004.
    • (2004) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev , vol.34 , Issue.2 , pp. 181-186
    • Breazeal, C.1
  • 11
    • 0347338032 scopus 로고    scopus 로고
    • Robust time delay estimation exploiting redundancy among multiple microphones
    • Nov
    • J. Chen, J. Benesty, and Y. Huang, "Robust time delay estimation exploiting redundancy among multiple microphones," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 549-557, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.6 , pp. 549-557
    • Chen, J.1    Benesty, J.2    Huang, Y.3
  • 13
    • 4644351478 scopus 로고    scopus 로고
    • Rapid prototyping for spoken dialogue systems
    • Taipei, Taiwan
    • M. Denecke, "Rapid prototyping for spoken dialogue systems," in Proc. 19th Int. Conf. Comput. Linguist., Taipei, Taiwan, 2002, vol. 1, pp. 1-7.
    • (2002) Proc. 19th Int. Conf. Comput. Linguist , vol.1 , pp. 1-7
    • Denecke, M.1
  • 14
    • 84896294568 scopus 로고    scopus 로고
    • A generic face representation approach for local appearance based face verification
    • presented at the
    • H. Ekenel and R. Stiefelhagen, "A generic face representation approach for local appearance based face verification," presented at the Workshop Face Recognit. Grand Challenge Exp., 2005.
    • (2005) Workshop Face Recognit. Grand Challenge Exp
    • Ekenel, H.1    Stiefelhagen, R.2
  • 16
    • 84863693528 scopus 로고    scopus 로고
    • Local appearance-based face recognition using discrete cosine transform
    • presented at the, Antalya, Turkey
    • H. K. Ekenel and R. Stiefelhagen, "Local appearance-based face recognition using discrete cosine transform," presented at the 13th Eur. Signal Process. Conf., Antalya, Turkey, 2005.
    • (2005) 13th Eur. Signal Process. Conf
    • Ekenel, H.K.1    Stiefelhagen, R.2
  • 17
    • 33845541933 scopus 로고    scopus 로고
    • Analysis of local appearance-based face recognition on FRGC 2.0 database
    • presented at the, Arlington, VA, Mar
    • H. Ekenel and R. Stiefelhagen, "Analysis of local appearance-based face recognition on FRGC 2.0 database," presented at the Face Recogit. Grand Challenge Workshop (FRGC), Arlington, VA, Mar. 2006.
    • (2006) Face Recogit. Grand Challenge Workshop (FRGC)
    • Ekenel, H.1    Stiefelhagen, R.2
  • 18
  • 19
    • 84878388839 scopus 로고    scopus 로고
    • Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition
    • Jeju-Islands, Korea
    • C. Fügen, H. Holzapfel, and A. Waibel, "Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition," in Proc. Int. Conf. Spoken Lang. Process., Jeju-Islands, Korea, 2004.
    • (2004) Proc. Int. Conf. Spoken Lang. Process
    • Fügen, C.1    Holzapfel, H.2    Waibel, A.3
  • 20
    • 85009112391 scopus 로고    scopus 로고
    • Recent advances in speech recognition system for IBM DARPA communicator
    • presented at the, Aalborg, Denmark, Sep
    • Y. Gao, H. Erdogan, Y. Li, V. Goel, and M. Picheny, "Recent advances in speech recognition system for IBM DARPA communicator," presented at the Eurospeech, Aalborg, Denmark, Sep. 2001.
    • (2001) Eurospeech
    • Gao, Y.1    Erdogan, H.2    Li, Y.3    Goel, V.4    Picheny, M.5
  • 23
    • 14944340532 scopus 로고    scopus 로고
    • Implementation and evaluation of a constraint based multimodal fusion system for speech and 3D pointing gestures
    • presented at the, State College, PA
    • H. Holzapfel, K. Nickel, and R. Stiefelhagen, "Implementation and evaluation of a constraint based multimodal fusion system for speech and 3D pointing gestures," presented at the Int. Conf. Multimodal Interfaces, State College, PA, 2004.
    • (2004) Int. Conf. Multimodal Interfaces
    • Holzapfel, H.1    Nickel, K.2    Stiefelhagen, R.3
  • 25
    • 44949181975 scopus 로고    scopus 로고
    • A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue
    • Pittsburgh, PA
    • H. Holzapfel and A. Waibel, "A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue," in Proc. Interspeech, Pittsburgh, PA, 2006.
    • (2006) Proc. Interspeech
    • Holzapfel, H.1    Waibel, A.2
  • 26
    • 0032136153 scopus 로고    scopus 로고
    • Condensation - conditional density propagation for visual tracking
    • M. Isard and A. Blake, "Condensation - conditional density propagation for visual tracking," Int. J. Comput. Vis., vol. 29, no. 1, pp. 5-28, 1998.
    • (1998) Int. J. Comput. Vis , vol.29 , Issue.1 , pp. 5-28
    • Isard, M.1    Blake, A.2
  • 27
    • 0014913763 scopus 로고
    • Effects of eye position on person perception
    • J. W. Tankard, "Effects of eye position on person perception," Percept. Mot. Skills, vol. 31, no. 3, pp. 883-893, 1970.
    • (1970) Percept. Mot. Skills , vol.31 , Issue.3 , pp. 883-893
    • Tankard, J.W.1
  • 28
    • 14944370978 scopus 로고    scopus 로고
    • Identifying the addressee in human-human-robot interactions based on head pose and speech
    • presented at the, State College, PA, Oct
    • M. Katzenmaier, R. Stiefelhagen, T. Schultz, I. Rogina, and A. Waibel, "Identifying the addressee in human-human-robot interactions based on head pose and speech," presented at the Int. Conf. Multimodal Interfaces, State College, PA, Oct. 2004.
    • (2004) Int. Conf. Multimodal Interfaces
    • Katzenmaier, M.1    Stiefelhagen, R.2    Schultz, T.3    Rogina, I.4    Waibel, A.5
  • 29
    • 35348872450 scopus 로고    scopus 로고
    • GuRoo: Autonomous humanoid platform for walking gait research
    • presented at the, Brisbane, Australia, Feb
    • D. Kee, G. Wyeth, A. Hood, and A. Drury, "GuRoo: Autonomous humanoid platform for walking gait research," presented at the Conf. Auton. Minirobots Res. Edutainment, Brisbane, Australia, Feb. 2003.
    • (2003) Conf. Auton. Minirobots Res. Edutainment
    • Kee, D.1    Wyeth, G.2    Hood, A.3    Drury, A.4
  • 31
    • 0006696629 scopus 로고
    • Effects of self-attributed and other-attributed gaze in interpersonal evaluations between males and females
    • C. L. Kleinke, A. A. Bustos, F. B. Meeker, and R. A. Staneski, "Effects of self-attributed and other-attributed gaze in interpersonal evaluations between males and females," J. Exp. Soc. Psychol., no. 9, pp. 154-163, 1973.
    • (1973) J. Exp. Soc. Psychol , Issue.9 , pp. 154-163
    • Kleinke, C.L.1    Bustos, A.A.2    Meeker, F.B.3    Staneski, R.A.4
  • 32
    • 45949092150 scopus 로고    scopus 로고
    • Temporal ICA for classification of acoustic events in a kitchen environment
    • presented at the, Lisbon, Portugal
    • F. Kraft, R. Malkin, T. Schaaf, and A. Waibel, "Temporal ICA for classification of acoustic events in a kitchen environment," presented at the Interspeech, Lisbon, Portugal, 2005.
    • (2005) Interspeech
    • Kraft, F.1    Malkin, R.2    Schaaf, T.3    Waibel, A.4
  • 33
    • 33846632781 scopus 로고    scopus 로고
    • Mechanical design of humanoid robot platform KHR-3 (Kaist humanoid robot-3: Hubo)
    • Dec
    • J. Lee, I. Park, J. Kim, and J. Oh, "Mechanical design of humanoid robot platform KHR-3 (Kaist humanoid robot-3: Hubo)," in Proc. IEEE-RAS Int. Conf. Humanoid Robots, Dec. 2005, pp. 321-326.
    • (2005) Proc. IEEE-RAS Int. Conf. Humanoid Robots , pp. 321-326
    • Lee, J.1    Park, I.2    Kim, J.3    Oh, J.4
  • 34
    • 35348877140 scopus 로고    scopus 로고
    • Context-sensitive speech recognition in ISU-dialogue systems: Results for the grammar-switching approach
    • presented at the, Barcelona, Spain
    • O. Lemon, "Context-sensitive speech recognition in ISU-dialogue systems: Results for the grammar-switching approach," presented at the Catalog'04, 8th Workshop Semantics Pragmatics Dialogue, Barcelona, Spain.
    • Catalog'04, 8th Workshop Semantics Pragmatics Dialogue
    • Lemon, O.1
  • 35
    • 17744406666 scopus 로고    scopus 로고
    • An extended set of Haar-like features for rapid object detection
    • Sep
    • R. Lienhart and J. Maydt, "An extended set of Haar-like features for rapid object detection," in Proc. Int. Conf. Image Process., Sep. 2002, vol. 1, pp. 900-903.
    • (2002) Proc. Int. Conf. Image Process , vol.1 , pp. 900-903
    • Lienhart, R.1    Maydt, J.2
  • 37
    • 84887145372 scopus 로고    scopus 로고
    • F. Metze, Q. Jin, C. Fugen, Y. Pan, and T. Schultz, Issues in meeting transcription - The Meeting Transcription System, presented at the Interspeech 2004 - 4CSLP, Jeju Island, Korea, Oct.
    • F. Metze, Q. Jin, C. Fugen, Y. Pan, and T. Schultz, "Issues in meeting transcription - The Meeting Transcription System," presented at the Interspeech 2004 - 4CSLP, Jeju Island, Korea, Oct.
  • 38
    • 0034326217 scopus 로고    scopus 로고
    • Bayesian face recognition
    • B. Moghaddam, T. Jebara, and A. Pentland, "Bayesian face recognition," Pattern Recogit., vol. 33, no. 11, pp. 1771-1782, 2000.
    • (2000) Pattern Recogit , vol.33 , Issue.11 , pp. 1771-1782
    • Moghaddam, B.1    Jebara, T.2    Pentland, A.3
  • 40
    • 35348837140 scopus 로고    scopus 로고
    • Visual recognition of pointing gestures for human-robot interaction
    • to be published
    • K. Nickel and R. Stiefelhagen, "Visual recognition of pointing gestures for human-robot interaction," Image Vis. Comput., to be published.
    • Image Vis. Comput
    • Nickel, K.1    Stiefelhagen, R.2
  • 42
    • 85079244419 scopus 로고    scopus 로고
    • M. Omologo and P. Svaizer, Acoustic event localization using a crosspower-spectrum phase based technique, in Proc. Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, Apr. 1994, pp. II-273-II-276.
    • M. Omologo and P. Svaizer, "Acoustic event localization using a crosspower-spectrum phase based technique," in Proc. Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, Apr. 1994, pp. II-273-II-276.
  • 43
    • 84966270011 scopus 로고    scopus 로고
    • The cu communicator: An architecture for dialogue systems
    • presented at the, Beijing, China, Oct
    • B. Pellom, W. Ward, and S. Pradhan, "The cu communicator: An architecture for dialogue systems," presented at the Int. Conf. Spoken Lang. Process., Beijing, China, Oct. 2000.
    • (2000) Int. Conf. Spoken Lang. Process
    • Pellom, B.1    Ward, W.2    Pradhan, S.3
  • 44
    • 44949181000 scopus 로고    scopus 로고
    • Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction
    • presented at the
    • T. Prommer, H. Holzapfel, and A. Waibel, "Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction," presented at the Interspeech (Int. Conf. Spoken Lang. Process.), 2006.
    • (2006) Interspeech (Int. Conf. Spoken Lang. Process.)
    • Prommer, T.1    Holzapfel, H.2    Waibel, A.3
  • 45
    • 0035151883 scopus 로고    scopus 로고
    • Looking means listening: Coordinating displays of engagement in doctor-patient interaction
    • J. Ruusuvuori, "Looking means listening: Coordinating displays of engagement in doctor-patient interaction," Soc. Sci. Med., vol. 52, pp. 1093-1108, 2001.
    • (2001) Soc. Sci. Med , vol.52 , pp. 1093-1108
    • Ruusuvuori, J.1
  • 47
    • 35348858601 scopus 로고    scopus 로고
    • K. Scheffler and S. Young, Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning, presented at the Hum. Lang. Technol., San Diego, CA, 2002.
    • K. Scheffler and S. Young, "Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning," presented at the Hum. Lang. Technol., San Diego, CA, 2002.
  • 48
    • 85009166685 scopus 로고    scopus 로고
    • Organization, communication, and control in the galaxy-II conversational system
    • presented at the, Budapest, Hungary, Sep
    • S. Seneff, R. Lau, and J. Polifroni, "Organization, communication, and control in the galaxy-II conversational system," presented at the Eurospeech, Budapest, Hungary, Sep. 1999.
    • (1999) Eurospeech
    • Seneff, S.1    Lau, R.2    Polifroni, J.3
  • 49
    • 15044355748 scopus 로고    scopus 로고
    • Large-scale evaluation of multimodal biometric authentication using state-of-the-art systems
    • Mar
    • R. Snelick, U. Uludag, A. Mink, M. Indovina, and A. K. Jain, "Large-scale evaluation of multimodal biometric authentication using state-of-the-art systems," IEEE Trans. Pattern Anal. Mach. Intell. vol. 27, no. 3, pp. 450-455, Mar. 2005.
    • (2005) IEEE Trans. Pattern Anal. Mach. Intell , vol.27 , Issue.3 , pp. 450-455
    • Snelick, R.1    Uludag, U.2    Mink, A.3    Indovina, M.4    Jain, A.K.5
  • 50
    • 84962868641 scopus 로고    scopus 로고
    • H. Soltau, F. Metze, C. Migen, and A. Waibel, A one pass-decoder based on polymorphic linguistic context assignment, presented at the Autom. Speech Recognit. Understanding, Madonna di Campiglio, Trento, Italy, Dec. 2001.
    • H. Soltau, F. Metze, C. Migen, and A. Waibel, "A one pass-decoder based on polymorphic linguistic context assignment," presented at the Autom. Speech Recognit. Understanding, Madonna di Campiglio, Trento, Italy, Dec. 2001.
  • 52
    • 84963787957 scopus 로고    scopus 로고
    • Tracking focus of attention in meetings
    • Pittsburgh, PA: IEEE, Oct
    • R. Stiefelhagen, "Tracking focus of attention in meetings," in Proc. Int. Conf. Multimodal Interfaces. Pittsburgh, PA: IEEE, Oct. 2002, pp. 273-280.
    • (2002) Proc. Int. Conf. Multimodal Interfaces , pp. 273-280
    • Stiefelhagen, R.1
  • 53
    • 0026065565 scopus 로고
    • Eigenfaces for recognition
    • M. Turk and A. Pentland, "Eigenfaces for recognition," J. Cogn. Neursci., vol. 3, no. 1, pp. 71-86, 1991.
    • (1991) J. Cogn. Neursci , vol.3 , Issue.1 , pp. 71-86
    • Turk, M.1    Pentland, A.2
  • 54
    • 0346870000 scopus 로고    scopus 로고
    • Robust real-time object detection
    • presented at the, San Diego, CA, Jul
    • P. Viola and M. Jones, "Robust real-time object detection," presented at the ICCV Workshop Star. Comput. Theories Vis., San Diego, CA, Jul. 2001.
    • (2001) ICCV Workshop Star. Comput. Theories Vis
    • Viola, P.1    Jones, M.2
  • 56
    • 84946714447 scopus 로고    scopus 로고
    • Y.-Y. Wang, A. Acero, and C. Chelba, Is word error rate a good indicator for spoken language understanding accuracy, presented at the Autom. Speech Recognit. Understanding, St. Thomas, U.S. Virgin Islands, 2003.
    • Y.-Y. Wang, A. Acero, and C. Chelba, "Is word error rate a good indicator for spoken language understanding accuracy," presented at the Autom. Speech Recognit. Understanding, St. Thomas, U.S. Virgin Islands, 2003.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.