SCOPUS 정보 검색 플랫폼

IEEE Transactions on Human-Machine Systems

Volumn 44, Issue 5, 2014, Pages 625-637

Multiparty interaction understanding using smart multimodal digital signage

(4) Tung, Tony a Gomez, Randy a,b Kawahara, Tatsuya a Matsuyama, Takashi a

a KYOTO UNIVERSITY (Japan)

b HONDA RESEARCH INSTITUTE JAPAN CO LTD (Japan)

Author keywords

Human machine system; multimodal interaction dynamics; multiparty interaction; smart digital signage

Indexed keywords

DIGITAL SIGNAGE; HUMAN-MACHINE SYSTEMS; MULTI-MODAL; MULTI-MODAL INTERACTIONS; MULTI-PARTY INTERACTIONS;

EID: 84907215298 PISSN: 21682291 EISSN: None Source Type: Journal
DOI: 10.1109/THMS.2014.2326873 Document Type: Article

Times cited : (17)

References (42)

1
- 84907220060
- S. Renals, and S. Bengio Eds. New York, NY, USA: Springer
- L. Chen, R. Rose, Y. Qiao, I. Kimbara, F. Parrill, H. Welji, T. Han, J. Tu, Z. Huang, M. Harper, F. Quek, Y. Xiong, D. McNeill, R. Tuttle, and T. Huang, "Vace multimodalmeeting corpus," Machine Learning for Multimodal Interaction, S. Renals, and S. Bengio Eds. New York, NY, USA: Springer, 2006.
- (2006) "vace Multimodalmeeting Corpus," Machine Learning for Multimodal Interaction
- Chen, L.¹ Rose, R.² Qiao, Y.³ Kimbara, I.⁴ Parrill, F.⁵ Welji, H.⁶ Han, T.⁷ Tu, J.⁸ Huang, Z.⁹ Harper, M.¹⁰ Quek, F.¹¹ Xiong, Y.¹² McNeill, D.¹³ Tuttle, R.¹⁴ Huang, T.¹⁵

2
- 41349108337
- A multimodal annotated corpus of concensus decision making meetings
- F. Pianesi, M. Zancanaro, B. Lepri, and A. Cappelletti, "A multimodal annotated corpus of concensus decision making meetings," Lang. Resources Eval., vol. 41, pp. 409-429, 2007.
- (2007) Lang. Resources Eval. , vol.41 , pp. 409-429
- Pianesi, F.¹ Zancanaro, M.² Lepri, B.³ Cappelletti, A.⁴

3
- 67650667995
- Meeting behavior detection in smart environments: Nonverbal cues that help to obtain natural interaction
- M. Poel, R. Poppe, and A. Nijholt, "Meeting behavior detection in smart environments: Nonverbal cues that help to obtain natural interaction," in Proc. IEEE Int Conf. Autom. Face Gesture Recog., 2008, pp. 1-6.
- (2008) Proc. IEEE Int Conf. Autom. Face Gesture Recog. , pp. 1-6
- Poel, M.¹ Poppe, R.² Nijholt, A.³

4
- 78650942732
- Analysis environment of conversational structure with nonverbal multimodal data
- Y. Sumi, M. Yano, and T. Nishida, "Analysis environment of conversational structure with nonverbal multimodal data," in Proc. Int. Conf. Multimodal Interfaces Workshop Mach. Learn. Multimodal Interact., 2010, pp. 44-1-44-4.
- (2010) Proc. Int. Conf. Multimodal Interfaces Workshop Mach. Learn. Multimodal Interact. , pp. 441-444
- Sumi, Y.¹ Yano, M.² Nishida, T.³

5
- 77955686025
- Robust speech recognition based on dereverberation parameter optimization using acoustic model likelihood
- Sep.
- R.Gomez and T. Kawahara, "Robust speech recognition based on dereverberation parameter optimization using acoustic model likelihood," IEEE Trans. Audio, Speech Lang. Process., vol. 18, no. 7, pp. 1708-1716, Sep. 2010.
- (2010) IEEE Trans. Audio, Speech Lang. Process. , vol.18 , Issue.7 , pp. 1708-1716
- Gomez, R.¹ Kawahara, T.²

6
- 84930032976
- New York, NY, USA: Springer
- T. Matsuyama, S. Nobuhara, T. Takai, and T. Tung, 3D Video and Its Applications. New York, NY, USA: Springer, 2012.
- (2012) 3D Video and Its Applications
- Matsuyama, T.¹ Nobuhara, S.² Takai, T.³ Tung, T.⁴

7
- 84887363649
- Interval-based modeling of human communication dynamics via hybrid dynamical systems
- H. Kawashima and T. Matsuyama, "Interval-based modeling of human communication dynamics via hybrid dynamical systems," in Proc. NIPS Workshop Modeling Human Commun. Dyn., 2010, pp. 34-37.
- (2010) Proc. NIPS Workshop Modeling Human Commun. Dyn. , pp. 34-37
- Kawashima, H.¹ Matsuyama, T.²

8
- 84974148892
- Backchannels across cultures: A study of Americans and Japanese
- S. White, "Backchannels across cultures: A study of Americans and Japanese," Lang. Soc., vol. 18, pp. 59-76, 1989.
- (1989) Lang. Soc. , vol.18 , pp. 59-76
- White, S.¹

9
- 84878405160
- Prediction of turn-taking by combining prosodic and eye-gaze information in poster conversations
- T. Kawahara, T. Iwatate, and K. Takanashi, "Prediction of turn-taking by combining prosodic and eye-gaze information in poster conversations," in Proc. Conf. Interspeech, 2012, pp. 1-4.
- (2012) Proc. Conf. Interspeech , pp. 1-4
- Kawahara, T.¹ Iwatate, T.² Takanashi, K.³

10
- 0030655038
- Video skimming and characterization through the combination of image and language understanding techniques
- M. A. Smith and T. Kanade, "Video skimming and characterization through the combination of image and language understanding techniques," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 1997, pp. 775-781.
- (1997) Proc. IEEE Conf. Comput. Vision Pattern Recog. , pp. 775-781
- Smith, M.A.¹ Kanade, T.²

11
- 34648875730
- Extractive summarization of meeting recordings
- G. Murray, S. Renals, and J. Carletta, "Extractive summarization of meeting recordings," in Proc. Conf. Interspeech, 2005, pp. 775-781.
- (2005) Proc. Conf. Interspeech , pp. 775-781
- Murray, G.¹ Renals, S.² Carletta, J.³

12
- 85122848536
- Active audition for humanoid
- K. Nakadai, T. Lourens, H. G. Okuno, and H. Kitano, "Active audition for humanoid," in Proc. Nat. Conf. Artif. Intell., 2000, pp. 832-839.
- (2000) Proc. Nat. Conf. Artif. Intell. , pp. 832-839
- Nakadai, K.¹ Lourens, T.² Okuno, H.G.³ Kitano, H.⁴

13
- 84907196006
- [Online]. Available
- Hark 2.0. (2013). [Online]. Available: http://winnie.kuis.kyotou. ac.jp/hark
- (2013)
- Hark 2.0¹

14
- 0024610919
- A tutorial on hidden Markow models and selected applications in speech recognition
- Feb.
- L. R. Rabiner, "A tutorial on hidden Markow models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

15
- 77956357907
- An interaction-embedded HMM framework for human behavior understanding: With nursing environments as examples
- Sep.
- C.-D. Liu, Y.-N. Chung, and P.-C. Chung, "An interaction-embedded HMM framework for human behavior understanding: With nursing environments as examples," IEEE Trans. Inf. Technol. Biomed., vol. 14, no. 5, pp. 1236-1246, Sep. 2010.
- (2010) IEEE Trans. Inf. Technol. Biomed. , vol.14 , Issue.5 , pp. 1236-1246
- Liu, C.-D.¹ Chung, Y.-N.² Chung, P.-C.³

16
- 84867723212
- Group dynamics and multimodal interaction modeling using a smart digital signage
- Springer LNCS
- T. Tung, R. Gomez, T. Kawahara, and T. Matsuyama, "Group dynamics and multimodal interaction modeling using a smart digital signage," in Eur. Conf. Comput. Vision Workshop, Springer LNCS, Part I, vol. 7583, 2012, pp. 362-371.
- (2012) Eur. Conf. Comput. Vision Workshop , vol.7583 , pp. 362-371
- Tung, T.¹ Gomez, R.² Kawahara, T.³ Matsuyama, T.⁴

17
- 84880673306
- Multi-party humanmachine interaction using a smart multimodal digital signage
- Springer LNCS
- T. Tung, R. Gomez, T. Kawahara, and T. Matsuyama, "Multi-party humanmachine interaction using a smart multimodal digital signage," in Proc. Int Conf. Human-Comput. Interact, Springer LNCS, Part IV, vol. 8007, 2013, pp. 408-415.
- (2013) Proc. Int Conf. Human-Comput. Interact , vol.8007 , pp. 408-415
- Tung, T.¹ Gomez, R.² Kawahara, T.³ Matsuyama, T.⁴

18
- 0028812427
- An optimum computergenerated pulse signal suitable for the measurement of very long impulse responses
- Y. Suzuki, F. Asano, H.-Y. Kim, and T. Sone, "An optimum computergenerated pulse signal suitable for the measurement of very long impulse responses," J. Acoust. Soc. Amer., vol. 97, no. 2, pp. 1119-1123, 1995.
- (1995) J. Acoust. Soc. Amer. , vol.97 , Issue.2 , pp. 1119-1123
- Suzuki, Y.¹ Asano, F.² Kim, H.-Y.³ Sone, T.⁴

19
- 17544395704
- Polar coordinate based nonlinear function for frequency-domain blind source separation
- H. Sawada, R. Mukai, S. Araki, and S. Makino, "Polar coordinate based nonlinear function for frequency-domain blind source separation," in Proc. IEEE Int Conf. Acoust., Speech, Signal Process., 2002, pp. 1001-1004.
- (2002) Proc. IEEE Int Conf. Acoust., Speech, Signal Process. , pp. 1001-1004
- Sawada, H.¹ Mukai, R.² Araki, S.³ Makino, S.⁴

20
- 51449090519
- Adaptive stepsize parameter control for real world blind source separation
- H. Nakajima, K. Nakadai, Y. Hasegawa, and H. Tsujino, "Adaptive stepsize parameter control for real world blind source separation," in Proc. IEEE Int Conf. Acoust., Speech, Signal Process., 2008, pp. 149-152.
- (2008) Proc. IEEE Int Conf. Acoust., Speech, Signal Process. , pp. 149-152
- Nakajima, H.¹ Nakadai, K.² Hasegawa, Y.³ Tsujino, H.⁴

21
- 0003786003
- Cambridge, MA, USA: MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition. Cambridge, MA, USA: MIT Press, 1997.
- (1997) Statistical Methods for Speech Recognition
- Jelinek, F.¹

22
- 84859957286
- Multi-party human-robot interaction with distant-talking speech recognition
- R. Gomez, T. Kawahara, K. Nakamura, and K. Nakadai, "Multi-party human-robot interaction with distant-talking speech recognition," in Proc. ACM/IEEE Int Conf. Human-Robot Interaction, 2012, pp. 439-446.
- (2012) Proc. ACM/IEEE Int Conf. Human-Robot Interaction , pp. 439-446
- Gomez, R.¹ Kawahara, T.² Nakamura, K.³ Nakadai, K.⁴

23
- 84907200036
- Automatic distance compensation for robust voice-based human-computer interaction
- R. Gomez, K. Nakamura, and K. Nakadai, "Automatic distance compensation for robust voice-based human-computer interaction," Int. J. Comput., Inf., Mechatronics Syst. Sci. Eng., vol. 7, no. 7, pp. 398-407, 2013.
- (2013) Int. J. Comput., Inf., Mechatronics Syst. Sci. Eng. , vol.7 , Issue.7 , pp. 398-407
- Gomez, R.¹ Nakamura, K.² Nakadai, K.³

24
- 80052878786
- Real-time human pose recognition in parts from a single depth image
- J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, A. Kipman, and A. Blake, "Real-time human pose recognition in parts from a single depth image," in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2011, pp. 1297-1304.
- (2011) Proc. IEEE Conf. Comput. Vis. Pattern Recog. , pp. 1297-1304
- Shotton, J.¹ Fitzgibbon, A.² Cook, M.³ Sharp, T.⁴ Finocchio, M.⁵ Moore, R.⁶ Kipman, A.⁷ Blake, A.⁸

25
- 77249176311
- Human motion tracking using a color-based particle filter driven by optical flow
- T. Tung and T. Matsuyama, "Human motion tracking using a color-based particle filter driven by optical flow," Eur. Conf. Comput. VisionWorkshop, MLVMA, pp. 1-12, 2008.
- (2008) Eur. Conf. Comput. VisionWorkshop, MLVMA , pp. 1-12
- Tung, T.¹ Matsuyama, T.²

26
- 2142812371
- Robust real-time face detection
- P. Viola and M. Jones, "Robust real-time face detection," Int J. Comput. Vision, vol. 57, no. 2, pp. 137-154, 2004.
- (2004) Int J. Comput. Vision , vol.57 , Issue.2 , pp. 137-154
- Viola, P.¹ Jones, M.²

27
- 84957810778
- Active appearance models
- T. F. Cootes, G. J. Edwards, and C. J. Taylor, "Active appearance models," in Proc. Eur. Conf. Comput. Vis., 1998, pp. 484-498.
- (1998) Proc. Eur. Conf. Comput. Vis. , pp. 484-498
- Cootes, T.F.¹ Edwards, G.J.² Taylor, C.J.³

28
- 80053039628
- Real time head pose estimation from consumer depth cameras
- G. Fanelli, T.Weise, J. Gall, and L. V. Gool, "Real time head pose estimation from consumer depth cameras," in Proc. DAGM - Int. Conf. Pattern Recog., 2011, pp. 101-110.
- (2011) Proc. DAGM - Int. Conf. Pattern Recog. , pp. 101-110
- Fanelli, G.¹ Weise, T.² Gall, J.³ Gool, L.V.⁴

29
- 77953873725
- User-oriented document summarization through vision-based eye-tracking
- S. Xu, H. Jiang, and F. C. Lau, "User-oriented document summarization through vision-based eye-tracking," in Proc. 13th ACM Int Conf. Intell. User Interfaces, 2009, pp. 7-16.
- (2009) Proc. 13th ACM Int Conf. Intell. User Interfaces , pp. 7-16
- Xu, S.¹ Jiang, H.² Lau, F.C.³

30
- 84856635786
- Inferring human gaze from appearance via adaptive linear regression
- L. Feng,Y. Sugano, T. Okabe, andY. Sato, "Inferring human gaze from appearance via adaptive linear regression," in Proc. IEEE Int Conf. Comput. Vision, 2011, pp. 153-160.
- (2011) Proc. IEEE Int Conf. Comput. Vision , pp. 153-160
- Feng, L.¹ Sugano, Y.² Okabe, T.³ Sato, Y.⁴

31
- 76749102346
- Complete multi-view reconstruction of dynamic scenes from probabilistic fusion of narrow and wide baseline stereo
- T. Tung, S. Nobuhara, and T. Matsuyama, "Complete multi-view reconstruction of dynamic scenes from probabilistic fusion of narrow and wide baseline stereo," in Proc. IEEE Int Conf. Comput. Vision, 2009, pp. 1709- 1716.
- (2009) Proc. IEEE Int Conf. Comput. Vision , pp. 1709-1716
- Tung, T.¹ Nobuhara, S.² Matsuyama, T.³

32
- 84906283243
- Estimation of interest and comprehension level of audience through multi-modal behaviors in poster conversations
- T. Kawahara, S. Hayashi, and K. Takanashi, "Estimation of interest and comprehension level of audience through multi-modal behaviors in poster conversations," in Proc. Conf. Interspeech, 2013, pp. 1882-1885.
- (2013) Proc. Conf. Interspeech , pp. 1882-1885
- Kawahara, T.¹ Hayashi, S.² Takanashi, K.³

33
- 0037312530
- Dynamic textures
- G. Doretto, A. Chiuso, Y. Wu, and S. Soatto, "Dynamic textures," Int J. Comput. Vis, vol. 51, no. 2, pp. 91-109, 2003.
- (2003) Int J. Comput. Vis , vol.51 , Issue.2 , pp. 91-109
- Doretto, G.¹ Chiuso, A.² Wu, Y.³ Soatto, S.⁴

34
- 70450184786
- View-invariant dynamic texture recognition using a bag of dynamical systems
- A. Ravichandran, R. Chaudhry, and R. Vidal, "View-invariant dynamic texture recognition using a bag of dynamical systems," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 2009, pp. 1932-1939.
- (2009) Proc. IEEE Conf. Comput. Vision Pattern Recog. , pp. 1932-1939
- Ravichandran, A.¹ Chaudhry, R.² Vidal, R.³

35
- 70450173435
- Histograms of oriented optical flow and binet-cauchy kernels on nonlinear dynamical systems for the recognition of human actions
- R. Chaudhry, A. Ravichandran, G. Hager, and R. Vidal, "Histograms of oriented optical flow and binet-cauchy kernels on nonlinear dynamical systems for the recognition of human actions," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 2009, pp. 1932-1939.
- (2009) Proc. IEEE Conf. Comput. Vision Pattern Recog. , pp. 1932-1939
- Chaudhry, R.¹ Ravichandran, A.² Hager, G.³ Vidal, R.⁴

36
- 84887379466
- Intrinsic characterization of dynamic surfaces
- T. Tung and T. Matsuyama, "Intrinsic characterization of dynamic surfaces," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 2013, pp. 233-240.
- (2013) Proc. IEEE Conf. Comput. Vision Pattern Recog. , pp. 233-240
- Tung, T.¹ Matsuyama, T.²

37
- 84911420890
- Timing-based local descriptor for dynamic surfaces
- T. Tung and T. Matsuyama, "Timing-based local descriptor for dynamic surfaces," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 2014.
- (2014) Proc. IEEE Conf. Comput. Vision Pattern Recog.
- Tung, T.¹ Matsuyama, T.²

38
- 33745956456
- Mixtures of dynamic textures
- A. B. Chan and N. Vasconcelos, "Mixtures of dynamic textures," in Proc. IEEE Int Conf. Comput. Vis., 2005, pp. 641-647.
- (2005) Proc. IEEE Int Conf. Comput. Vis. , pp. 641-647
- Chan, A.B.¹ Vasconcelos, N.²

39
- 0000113585
- Surface shape and curvature scales
- J. Koenderink and A. van Doorn, "Surface shape and curvature scales," Image Vis. Comput., vol. 10, pp. 557-564, 1992.
- (1992) Image Vis. Comput. , vol.10 , pp. 557-564
- Koenderink, J.¹ Van Doorn, A.²

40
- 49249085803
- Performance capture from sparsemulti-viewvideo
- E. de Aguiar, C. Stoll, C. Theobalt, N. Ahmed, H.-P. Seidel, and S. Thrun, "Performance capture from sparsemulti-viewvideo," ACMTrans.Graph., vol. 27, no. 3, pp. 98-1-98-10, 2008.
- (2008) ACMTrans.Graph. , vol.27 , Issue.3 , pp. 981-9810
- De Aguiar, E.¹ Stoll, C.² Theobalt, C.³ Ahmed, N.⁴ Seidel, H.-P.⁵ Thrun, S.⁶

41
- 84868005473
- 4th ed New York, NY, USA: Norton
- F. David, R. Pisani, and R. Purves, Statistics, 4th ed. New York, NY, USA: Norton, 2007.
- (2007) Statistics
- David, F.¹ Pisani, R.² Purves, R.³

42
- 84862678656
- Topology dictionary for 3D video understanding
- Aug.
- T. Tung and T. Matsuyama, "Topology dictionary for 3D video understanding," IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 8, pp. 1645-1657, Aug. 2012.
- (2012) IEEE Trans. Pattern Anal. Mach. Intell. , vol.34 , Issue.8 , pp. 1645-1657
- Tung, T.¹ Matsuyama, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.