메뉴 건너뛰기




Volumn 42, Issue 4, 2011, Pages 215-245

Integration of a voice recognition system in a social robot

Author keywords

ASR; audition system; automatic speech recognition; computing confidence score; dialogue; human computer interaction; human robot interaction; maggie; microphone system; natural language processing; natural language understanding; personal robot; robot audition; social robot; speech recognition; voice recognition

Indexed keywords

ASR; AUDITION SYSTEM; AUTOMATIC SPEECH RECOGNITION; CONFIDENCE SCORE; DIALOGUE; HUMAN-COMPUTER; MAGGIE; MICROPHONE SYSTEM; NATURAL LANGUAGE PROCESSING; NATURAL LANGUAGE UNDERSTANDING; PERSONAL ROBOT; ROBOT AUDITION; SOCIAL ROBOTS;

EID: 79958703093     PISSN: 01969722     EISSN: 10876553     Source Type: Journal    
DOI: 10.1080/01969722.2011.583593     Document Type: Article
Times cited : (42)

References (50)
  • 3
    • 21244505005 scopus 로고    scopus 로고
    • HERMES - A versatile personal robotic assistant
    • DOI 10.1109/JPROC.2004.835381, Human Interactive Robots for Psychological Enrichment
    • Bischoff, R. and Graefe, V. 2004. HERMES - a versatile personal robotic assistant. Proceedings of the IEEE 92, no. 11:1759-1779. doi:10.1109/JPROC.2004. 835381. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp?arnumber=1347457 (Pubitemid 40890670)
    • (2004) Proceedings of the IEEE , vol.92 , Issue.11 , pp. 1759-1778
    • Bischoff, R.1    Graefe, V.2
  • 4
    • 0037219850 scopus 로고    scopus 로고
    • Emotive qualities in lip-synchronized robot speech
    • May:, doi:10.1163=156855303321165079
    • Breazeal, C. 2003. Emotive qualities in lip-synchronized robot speech. Advanced Robotics 17, no. 2(May):97-113. doi:10.1163=156855303321165079. http://www.ingentaconnect.com/content/vsp/arb/2003/00000017/00000002/art00003
    • (2003) Advanced Robotics , vol.17 , Issue.2 , pp. 97-113
    • Breazeal, C.1
  • 8
    • 78049374466 scopus 로고    scopus 로고
    • Evaluation of semantic role labeling and dependency parsing of automatic speech recognition output
    • doi:10.1109=ICASSP.2010.5494946
    • Favre, B., Benoit, B., and Hakkani-Tur, D. 2010a. Evaluation of semantic role labeling and dependency parsing of automatic speech recognition output. 2010 IEEE International Conference on Acoustics, Speech and Signal Processing 1:5342-5345. doi:10.1109=ICASSP.2010.5494946. http://ieeexplore.ieee.org/lpdocs/ epic03/wrapper.htm?arnumber=5494946.
    • (2010) 2010 IEEE International Conference on Acoustics, Speech and Signal Processing , vol.1 , pp. 5342-5345
    • Favre, B.1    Benoit, B.2    Hakkani-Tur, D.3
  • 9
    • 78049374466 scopus 로고    scopus 로고
    • Evaluation of semantic role labeling and dependency parsing of automatic speech recognition output
    • Dallas USA: IEEE. doi:10.1109/ICASSP.2010.5494946
    • - 2010b. Evaluation of semantic role labeling and dependency parsing of automatic speech recognition output. In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 5342-5345. Dallas (USA): IEEE. doi:10.1109/ICASSP.2010.5494946. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp? arnumber=5494946.
    • (2010) 2010 IEEE International Conference on Acoustics, Speech and Signal Processing , pp. 5342-5345
    • Favre, B.1    Benoit, B.2    Hakkani-Tur, D.3
  • 13
    • 0001309032 scopus 로고    scopus 로고
    • Building ears for robots: Sound localization and separation
    • December:, doi:10.1007=BF02471133
    • Huang, J, Ohnishi, N., and Sugie, N. 1997. Building ears for robots: Sound localization and separation. Artificial Life and Robotics 1, no. 4(December):157-163. doi:10.1007=BF02471133. http://www.springerlink.com/ content/upw68k6138152679/
    • (1997) Artificial Life and Robotics , vol.1 , Issue.4 , pp. 157-163
    • Huang, J.1    Ohnishi, N.2    Sugie, N.3
  • 14
    • 0026374868 scopus 로고
    • Improved acoustic modeling with the SPHINX speech recognition system
    • 1991 International Conference on, Toronto Canada: IEEE. doi:10.1109/ICASSP.1991.150347
    • Huang, X. D., Lee, K. F., Hon, H. W., and Hwang, M. Y. 1991. Improved acoustic modeling with the SPHINX speech recognition system. In Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, 345-348. Toronto (Canada): IEEE. doi:10.1109/ICASSP.1991.150347. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp?arnumber=150347
    • (1991) Acoustics, Speech, and Signal Processing, 1991. ICASSP-91 , pp. 345-348
    • Huang, X.D.1    Lee, K.F.2    Hon, H.W.3    Hwang, M.Y.4
  • 15
    • 48149086425 scopus 로고    scopus 로고
    • Robust speech recognition system for communication robots in real environments
    • Genoa Italy: IEEE, December. doi:10.1109/ICHR.2006. 321294
    • Ishi, C., Matsuda, S., Kanda, T., Jitsuhiro, T., Ishiguro, H., Nakamura, S., and Hagita, N. 2006. Robust Speech Recognition System for Communication Robots in Real Environments. In 2006 6th IEEE-RAS International Conference on Humanoid Robots, 340-345. Genoa (Italy): IEEE, December. doi:10.1109/ICHR.2006. 321294. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4115624
    • (2006) 2006 6th IEEE-RAS International Conference on Humanoid Robots , pp. 340-345
    • Ishi, C.1    Matsuda, S.2    Kanda, T.3    Jitsuhiro, T.4    Ishiguro, H.5    Nakamura, S.6    Hagita, N.7
  • 16
    • 33646126917 scopus 로고    scopus 로고
    • A computational model of intention reading in imitation
    • May:, doi:10.1016/j.robot.2006.01.006
    • Jansen, B. and Belpaeme, T. 2006. A computational model of intention reading in imitation. Robotics and Autonomous Systems 54, no. 5(May):394-402. doi:10.1016/j.robot.2006.01.006. http://linkinghub.elsevier.com/retrieve/pii/ S0921889006000194
    • (2006) Robotics and Autonomous Systems , vol.54 , Issue.5 , pp. 394-402
    • Jansen, B.1    Belpaeme, T.2
  • 17
    • 78049393024 scopus 로고    scopus 로고
    • A general framework for building natural language understanding modules in voice search
    • Dallas: IEEE, March. doi:10.1109/ICASSP.2010.5494951
    • Junlan, F. 2010. A general framework for building natural language understanding modules in voice search. In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 5362-5365. Dallas: IEEE, March. doi:10.1109/ICASSP.2010.5494951. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp? arnumber=5494951
    • (2010) 2010 IEEE International Conference on Acoustics, Speech and Signal Processing , pp. 5362-5365
    • Junlan, F.1
  • 19
    • 33846702321 scopus 로고    scopus 로고
    • Human-robot interaction in real environments by audio-visual integration
    • Kim, H. and Choi, J. 2007. Human-robot interaction in real environments by audiovisual integration. International Journal of Control 5(1):61-69. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.126.8455&rep= rep1&type=pdf (Pubitemid 46198189)
    • (2007) International Journal of Control, Automation and Systems , vol.5 , Issue.1 , pp. 61-69
    • Kim, H.-D.1    Choi, J.-S.2    Kim, M.3
  • 20
    • 48749088932 scopus 로고    scopus 로고
    • Improving speech recognition using semantic and reference features in a multimodal dialog system
    • Jeju Island Korea: IEEE. doi:10.1109/ROMAN. 2007.4415120
    • Kim, K., Jeong, M., and Lee, G. G. 2007. Improving Speech Recognition Using Semantic and Reference Features in a Multimodal Dialog System. In RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication, 416-420. Jeju Island (Korea): IEEE. doi:10.1109/ROMAN. 2007.4415120. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp?arnumber=4415120
    • (2007) RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication , pp. 416-420
    • Kim, K.1    Jeong, M.2    Lee, G.G.3
  • 23
    • 79951801958 scopus 로고    scopus 로고
    • Computing confidence score of any input phrases for a spoken dialog system
    • Berkeley, California USA: IEEE, December. doi:10.1109/SLT.2010. 5700867
    • Lin, F. and Weng, F. 2010. Computing confidence score of any input phrases for a spoken dialog system. In 2010 IEEE Spoken Language Technology Workshop, 295-300. Berkeley, California (USA): IEEE, December. doi:10.1109/SLT.2010. 5700867. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp? arnumber=5700867
    • (2010) 2010 IEEE Spoken Language Technology Workshop , pp. 295-300
    • Lin, F.1    Weng, F.2
  • 24
    • 84904701644 scopus 로고    scopus 로고
    • El papel de la fonética en el desarrollo de las tecnologías del habla
    • Cadiz Spain: Servicio de Publicaciones de la Universidad de Cádiz
    • Llisterri, J., Carbó, C, Machuca, M., De la Mota, C, Riera, M., and Rios, A. 2003. El papel de la fonética en el desarrollo de las tecnologías del habla. In Memorias de las VII Jornadas de Linguística. Cadiz (Spain): Servicio de Publicaciones de la Universidad de Cádiz. http://liceu.uab.es/~joaquim/speech-technology/UNAM-03/UNAM03- Guion-Bib.pdf
    • (2003) Memorias de las VII Jornadas de Linguística
    • Llisterri, J.1    Carbó, C.2    Machuca, M.3    De La Mota, C.4    Riera, M.5    Rios, A.6
  • 25
    • 56749180541 scopus 로고    scopus 로고
    • Beyond the individual: New insights on language, cognition and robots
    • December:, doi:10.1080/09540090802518661
    • Lopes L. Seabra and Belpaeme, T. 2008. Beyond the individual: new insights on language, cognition and robots. Connection Science 204(December):231-237. doi:10.1080/09540090802518661. http://www.informaworld. com/openurl?genre=article&doi=10.1080/09540090802518661&magic=crossref) D404A21C5BB053405B1A640AFFD44AE3
    • (2008) Connection Science , vol.204 , pp. 231-237
    • Seabra, L.L.1    Belpaeme, T.2
  • 26
    • 34250613163 scopus 로고    scopus 로고
    • Robovie-IV: A communication robot interacting with people daily in an office
    • DOI 10.1109/IROS.2006.282594, 4059225, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006
    • Mitsunaga, N., Miyashita, T., Ishiguro, H., Kogure, K., and Hagita, N. 2006. Robovie-IV: A Communication Robot Interacting with People Daily in an Office. In Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on, 5066-5072. Beijing: IEEE. doi:10.1109/IROS.2006.282594. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp?arnumber=4059225 (Pubitemid 46928732)
    • (2006) IEEE International Conference on Intelligent Robots and Systems , pp. 5066-5072
    • Mitsunaga, N.1    Miyashita, T.2    Ishiguro, H.3    Kogure, K.4    Hagita, N.5
  • 29
    • 85011464740 scopus 로고    scopus 로고
    • Architecture for adaptive multimodal dialog systems based on VoiceXML
    • Scandinavia: Association for Computational Linguistics Morristown
    • Niklfeld, G. and Finan, R. 2001. Architecture for adaptive multimodal dialog systems based on VoiceXML. In Proceedings of EuroSpeech, 1-4. Scandinavia: Association for Computational Linguistics Morristown. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.21.7658&rep= rep1&type=pdf
    • (2001) Proceedings of EuroSpeech , pp. 1-4
    • Niklfeld, G.1    Finan, R.2
  • 31
  • 32
    • 44849118877 scopus 로고    scopus 로고
    • Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech
    • Kyoto Japan: IEEE. doi:10.1109/ASRU.2007.4430093
    • Okuno Hiroshi G. 2007. Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech. In IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), 111-116. Kyoto (Japan): IEEE. doi:10.1109/ASRU.2007.4430093. http://ieeexplore.ieee.org/lpdocs/ epic03/wrapper.htm?arnumber=4430093
    • (2007) IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) , pp. 111-116
    • Okuno Hiroshi, G.1
  • 35
    • 79958710696 scopus 로고    scopus 로고
    • Spoken dialog management for robots
    • Roy, N., Pineau, J., and Thrun, S. 1998. Spoken Dialog Management for Robots. Management.
    • (1998) Management
    • Roy, N.1    Pineau, J.2    Thrun, S.3
  • 36
    • 84880707672 scopus 로고    scopus 로고
    • Spoken dialogue management using probabilistic reasoning
    • Morristown, NJ, USA: Association for Computational Linguistics, October. doi:10.3115/1075218.1075231
    • -. 2000. Spoken dialogue management using probabilistic reasoning. Proceedings of the 38th Annual Meeting on Association for Computational Linguistics - ACL'00. Morristown, NJ, USA: Association for Computational Linguistics, October. doi:10.3115/1075218.1075231. http://portal.acm.org/ citation.cfm?id=1075218.1075231.
    • (2000) Proceedings of the 38th Annual Meeting on Association for Computational Linguistics - ACL'00
    • Roy, N.1    Pineau, J.2    Thrun, S.3
  • 38
    • 85009115792 scopus 로고    scopus 로고
    • A multi-modal dialog system for a mobile robot
    • Jeju Island Korea: IEEE, doi=10.1.1.59.9237
    • Shuyin, I. Toptsis, Li, S., Wrede, B., and Fink, G. A. 2004. A Multi-modal Dialog System for a Mobile Robot. In Int. Conf. on Spoken Language Processing, 273-276. Jeju Island (Korea): IEEE. http://citeseerx.ist.psu.edu/ viewdoc/summary? doi=10.1.1.59.9237
    • (2004) Int. Conf. on Spoken Language Processing , pp. 273-276
    • Shuyin, I.1    Toptsis, L.S.2    Wrede, B.3    Fink, G.A.4
  • 42
    • 34250621563 scopus 로고    scopus 로고
    • Three ring microphone array for 3D sound localization and separation for mobile robot audition
    • DOI 10.1109/IROS.2005.1545095, 1545095, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS
    • Tamai, Y., Sasaki, Y., Kagami, S., and Mizoguchi, H. 2005. Three Ring Microphone Array for 3D Sound Localization and Separation for Mobile Robot Audition. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 903-908. Edmonton (Canada): IEEE. doi:10.1109/IROS.2005.1545095. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1545095 (Pubitemid 43896422)
    • (2005) 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS , vol.1
    • Tamai, Y.1    Sasaki, Y.2    Kagami, S.3    Mizoguchi, H.4
  • 43
    • 79958721294 scopus 로고    scopus 로고
    • Development of zonal beamformer and its application to robot audition
    • Tanaka, N., Ogawa, T., Akagiri, K., and Kobayashi, T. 2010. DEVELOPMENT OF ZONAL BEAMFORMER AND ITS APPLICATION TO ROBOT AUDITION. In Signal Processing, 1:1529-1533. http://www.eurasip.org/Proceedings/Eusipco/Eusipco2010/Contents/ papers/1569292345.pdf
    • (2010) Signal Processing , vol.1 , pp. 1529-1533
    • Tanaka, N.1    Ogawa, T.2    Akagiri, K.3    Kobayashi, T.4
  • 44
    • 14044260635 scopus 로고    scopus 로고
    • Enhanced robot audition based on microphone array source separation with post-filter
    • FP1-B4, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
    • Valin, J.-M., Rouat, J., and Michaud, F. 2004. Enhanced robot audition based on microphone array source separation with post-filter. In 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2123-2128. Sendai (Japan): IEEE. doi:10.1109/IROS.2004.1389723. http://ieeexplore.ieee.org/ xpl/freeabs-all.jsp?arnumber=1389723 (Pubitemid 40275913)
    • (2004) 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , vol.3 , pp. 2123-2128
    • Valin, J.-M.1    Rouat, J.2    Michaud, F.3
  • 46
    • 0030351171 scopus 로고    scopus 로고
    • A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition
    • Philadelphia USA: IEEE. doi:10.1109/ICSLP.1996.607082
    • Valverde-Albacete, F. J. and Pardo, J. M. 1996. A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition. In Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP'96, 224-227. Philadelphia (USA): IEEE. doi:10.1109/ICSLP.1996.607082. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp? arnumber=607082
    • (1996) Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP ' , vol.96 , pp. 224-227
    • Valverde-Albacete, F.J.1    Pardo, J.M.2
  • 47
    • 79958759278 scopus 로고
    • Speech understanding through syntactic and semantic analysis
    • April:, doi:10.1109/TC.1976.1674625
    • Walker, D. E. 1976. Speech Understanding Through Syntactic and Semantic Analysis. IEEE Transactions on Computers C-25, no. 4(April):432-439. doi:10.1109/TC.1976.1674625. http://ieeexplore.ieee.org/xpl/freeabs-all.jsp? arnumber=1674625
    • (1976) IEEE Transactions on Computers C-25 , Issue.4 , pp. 432-439
    • Walker, D.E.1
  • 48
    • 79960390823 scopus 로고    scopus 로고
    • July 15
    • Wallis, P. 2010. A robot in the kitchen (July 15):25-30. http://portal.acm.org/citation.cfm?id=1870559.1870564
    • (2010) A Robot in the Kitchen , pp. 25-30
    • Wallis, P.1
  • 49
    • 77950563943 scopus 로고    scopus 로고
    • Automatic speech recognition improved by two-layered audio-visual integration for robot audition
    • Paris France: IEEE, December. doi:10.1109/ICHR.2009.5379586
    • Yoshida, T., Kazuhiro, N., and Okuno, H. G. 2009. Automatic speech recognition improved by two-layered audio-visual integration for robot audition. In 2009 9th IEEE-RAS International Conference on Humanoid Robots, 604-609. Paris (France): IEEE, December. doi:10.1109/ICHR.2009.5379586. http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5379586
    • (2009) 2009 9th IEEE-RAS International Conference on Humanoid Robots , pp. 604-609
    • Yoshida, T.1    Kazuhiro, N.2    Okuno, H.G.3
  • 50
    • 73849146246 scopus 로고    scopus 로고
    • Intelligent decision support system based on natural language understanding
    • Beijing China: IEEE, September. doi:10.1109/ICMSS.2009.5302806
    • Yu, X., Zhou, F., Zhang, F., and Yang, B. 2009. Intelligent Decision Support System Based on Natural Language Understanding. In 2009 International Conference on Management and Service Science, 1-4. Beijing (China): IEEE, September. doi:10.1109/ICMSS.2009.5302806. http://ieeexplore.ieee.org/xpl/ freeabs-all.jsp?arnumber=5302806
    • (2009) 2009 International Conference on Management and Service Science , pp. 1-4
    • Yu, X.1    Zhou, F.2    Zhang, F.3    Yang, B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.