-
1
-
-
84907220060
-
-
S. Renals, and S. Bengio Eds. New York, NY, USA: Springer
-
L. Chen, R. Rose, Y. Qiao, I. Kimbara, F. Parrill, H. Welji, T. Han, J. Tu, Z. Huang, M. Harper, F. Quek, Y. Xiong, D. McNeill, R. Tuttle, and T. Huang, "Vace multimodalmeeting corpus," Machine Learning for Multimodal Interaction, S. Renals, and S. Bengio Eds. New York, NY, USA: Springer, 2006.
-
(2006)
"vace Multimodalmeeting Corpus," Machine Learning for Multimodal Interaction
-
-
Chen, L.1
Rose, R.2
Qiao, Y.3
Kimbara, I.4
Parrill, F.5
Welji, H.6
Han, T.7
Tu, J.8
Huang, Z.9
Harper, M.10
Quek, F.11
Xiong, Y.12
McNeill, D.13
Tuttle, R.14
Huang, T.15
-
2
-
-
41349108337
-
A multimodal annotated corpus of concensus decision making meetings
-
F. Pianesi, M. Zancanaro, B. Lepri, and A. Cappelletti, "A multimodal annotated corpus of concensus decision making meetings," Lang. Resources Eval., vol. 41, pp. 409-429, 2007.
-
(2007)
Lang. Resources Eval.
, vol.41
, pp. 409-429
-
-
Pianesi, F.1
Zancanaro, M.2
Lepri, B.3
Cappelletti, A.4
-
3
-
-
67650667995
-
Meeting behavior detection in smart environments: Nonverbal cues that help to obtain natural interaction
-
M. Poel, R. Poppe, and A. Nijholt, "Meeting behavior detection in smart environments: Nonverbal cues that help to obtain natural interaction," in Proc. IEEE Int Conf. Autom. Face Gesture Recog., 2008, pp. 1-6.
-
(2008)
Proc. IEEE Int Conf. Autom. Face Gesture Recog.
, pp. 1-6
-
-
Poel, M.1
Poppe, R.2
Nijholt, A.3
-
4
-
-
78650942732
-
Analysis environment of conversational structure with nonverbal multimodal data
-
Y. Sumi, M. Yano, and T. Nishida, "Analysis environment of conversational structure with nonverbal multimodal data," in Proc. Int. Conf. Multimodal Interfaces Workshop Mach. Learn. Multimodal Interact., 2010, pp. 44-1-44-4.
-
(2010)
Proc. Int. Conf. Multimodal Interfaces Workshop Mach. Learn. Multimodal Interact.
, pp. 441-444
-
-
Sumi, Y.1
Yano, M.2
Nishida, T.3
-
5
-
-
77955686025
-
Robust speech recognition based on dereverberation parameter optimization using acoustic model likelihood
-
Sep.
-
R.Gomez and T. Kawahara, "Robust speech recognition based on dereverberation parameter optimization using acoustic model likelihood," IEEE Trans. Audio, Speech Lang. Process., vol. 18, no. 7, pp. 1708-1716, Sep. 2010.
-
(2010)
IEEE Trans. Audio, Speech Lang. Process.
, vol.18
, Issue.7
, pp. 1708-1716
-
-
Gomez, R.1
Kawahara, T.2
-
6
-
-
84930032976
-
-
New York, NY, USA: Springer
-
T. Matsuyama, S. Nobuhara, T. Takai, and T. Tung, 3D Video and Its Applications. New York, NY, USA: Springer, 2012.
-
(2012)
3D Video and Its Applications
-
-
Matsuyama, T.1
Nobuhara, S.2
Takai, T.3
Tung, T.4
-
7
-
-
84887363649
-
Interval-based modeling of human communication dynamics via hybrid dynamical systems
-
H. Kawashima and T. Matsuyama, "Interval-based modeling of human communication dynamics via hybrid dynamical systems," in Proc. NIPS Workshop Modeling Human Commun. Dyn., 2010, pp. 34-37.
-
(2010)
Proc. NIPS Workshop Modeling Human Commun. Dyn.
, pp. 34-37
-
-
Kawashima, H.1
Matsuyama, T.2
-
8
-
-
84974148892
-
Backchannels across cultures: A study of Americans and Japanese
-
S. White, "Backchannels across cultures: A study of Americans and Japanese," Lang. Soc., vol. 18, pp. 59-76, 1989.
-
(1989)
Lang. Soc.
, vol.18
, pp. 59-76
-
-
White, S.1
-
9
-
-
84878405160
-
Prediction of turn-taking by combining prosodic and eye-gaze information in poster conversations
-
T. Kawahara, T. Iwatate, and K. Takanashi, "Prediction of turn-taking by combining prosodic and eye-gaze information in poster conversations," in Proc. Conf. Interspeech, 2012, pp. 1-4.
-
(2012)
Proc. Conf. Interspeech
, pp. 1-4
-
-
Kawahara, T.1
Iwatate, T.2
Takanashi, K.3
-
10
-
-
0030655038
-
Video skimming and characterization through the combination of image and language understanding techniques
-
M. A. Smith and T. Kanade, "Video skimming and characterization through the combination of image and language understanding techniques," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 1997, pp. 775-781.
-
(1997)
Proc. IEEE Conf. Comput. Vision Pattern Recog.
, pp. 775-781
-
-
Smith, M.A.1
Kanade, T.2
-
11
-
-
34648875730
-
Extractive summarization of meeting recordings
-
G. Murray, S. Renals, and J. Carletta, "Extractive summarization of meeting recordings," in Proc. Conf. Interspeech, 2005, pp. 775-781.
-
(2005)
Proc. Conf. Interspeech
, pp. 775-781
-
-
Murray, G.1
Renals, S.2
Carletta, J.3
-
12
-
-
85122848536
-
Active audition for humanoid
-
K. Nakadai, T. Lourens, H. G. Okuno, and H. Kitano, "Active audition for humanoid," in Proc. Nat. Conf. Artif. Intell., 2000, pp. 832-839.
-
(2000)
Proc. Nat. Conf. Artif. Intell.
, pp. 832-839
-
-
Nakadai, K.1
Lourens, T.2
Okuno, H.G.3
Kitano, H.4
-
13
-
-
84907196006
-
-
[Online]. Available
-
Hark 2.0. (2013). [Online]. Available: http://winnie.kuis.kyotou. ac.jp/hark
-
(2013)
-
-
Hark 2.01
-
14
-
-
0024610919
-
A tutorial on hidden Markow models and selected applications in speech recognition
-
Feb.
-
L. R. Rabiner, "A tutorial on hidden Markow models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
-
(1989)
Proc. IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
15
-
-
77956357907
-
An interaction-embedded HMM framework for human behavior understanding: With nursing environments as examples
-
Sep.
-
C.-D. Liu, Y.-N. Chung, and P.-C. Chung, "An interaction-embedded HMM framework for human behavior understanding: With nursing environments as examples," IEEE Trans. Inf. Technol. Biomed., vol. 14, no. 5, pp. 1236-1246, Sep. 2010.
-
(2010)
IEEE Trans. Inf. Technol. Biomed.
, vol.14
, Issue.5
, pp. 1236-1246
-
-
Liu, C.-D.1
Chung, Y.-N.2
Chung, P.-C.3
-
16
-
-
84867723212
-
Group dynamics and multimodal interaction modeling using a smart digital signage
-
Springer LNCS
-
T. Tung, R. Gomez, T. Kawahara, and T. Matsuyama, "Group dynamics and multimodal interaction modeling using a smart digital signage," in Eur. Conf. Comput. Vision Workshop, Springer LNCS, Part I, vol. 7583, 2012, pp. 362-371.
-
(2012)
Eur. Conf. Comput. Vision Workshop
, vol.7583
, pp. 362-371
-
-
Tung, T.1
Gomez, R.2
Kawahara, T.3
Matsuyama, T.4
-
17
-
-
84880673306
-
Multi-party humanmachine interaction using a smart multimodal digital signage
-
Springer LNCS
-
T. Tung, R. Gomez, T. Kawahara, and T. Matsuyama, "Multi-party humanmachine interaction using a smart multimodal digital signage," in Proc. Int Conf. Human-Comput. Interact, Springer LNCS, Part IV, vol. 8007, 2013, pp. 408-415.
-
(2013)
Proc. Int Conf. Human-Comput. Interact
, vol.8007
, pp. 408-415
-
-
Tung, T.1
Gomez, R.2
Kawahara, T.3
Matsuyama, T.4
-
18
-
-
0028812427
-
An optimum computergenerated pulse signal suitable for the measurement of very long impulse responses
-
Y. Suzuki, F. Asano, H.-Y. Kim, and T. Sone, "An optimum computergenerated pulse signal suitable for the measurement of very long impulse responses," J. Acoust. Soc. Amer., vol. 97, no. 2, pp. 1119-1123, 1995.
-
(1995)
J. Acoust. Soc. Amer.
, vol.97
, Issue.2
, pp. 1119-1123
-
-
Suzuki, Y.1
Asano, F.2
Kim, H.-Y.3
Sone, T.4
-
19
-
-
17544395704
-
Polar coordinate based nonlinear function for frequency-domain blind source separation
-
H. Sawada, R. Mukai, S. Araki, and S. Makino, "Polar coordinate based nonlinear function for frequency-domain blind source separation," in Proc. IEEE Int Conf. Acoust., Speech, Signal Process., 2002, pp. 1001-1004.
-
(2002)
Proc. IEEE Int Conf. Acoust., Speech, Signal Process.
, pp. 1001-1004
-
-
Sawada, H.1
Mukai, R.2
Araki, S.3
Makino, S.4
-
20
-
-
51449090519
-
Adaptive stepsize parameter control for real world blind source separation
-
H. Nakajima, K. Nakadai, Y. Hasegawa, and H. Tsujino, "Adaptive stepsize parameter control for real world blind source separation," in Proc. IEEE Int Conf. Acoust., Speech, Signal Process., 2008, pp. 149-152.
-
(2008)
Proc. IEEE Int Conf. Acoust., Speech, Signal Process.
, pp. 149-152
-
-
Nakajima, H.1
Nakadai, K.2
Hasegawa, Y.3
Tsujino, H.4
-
22
-
-
84859957286
-
Multi-party human-robot interaction with distant-talking speech recognition
-
R. Gomez, T. Kawahara, K. Nakamura, and K. Nakadai, "Multi-party human-robot interaction with distant-talking speech recognition," in Proc. ACM/IEEE Int Conf. Human-Robot Interaction, 2012, pp. 439-446.
-
(2012)
Proc. ACM/IEEE Int Conf. Human-Robot Interaction
, pp. 439-446
-
-
Gomez, R.1
Kawahara, T.2
Nakamura, K.3
Nakadai, K.4
-
23
-
-
84907200036
-
Automatic distance compensation for robust voice-based human-computer interaction
-
R. Gomez, K. Nakamura, and K. Nakadai, "Automatic distance compensation for robust voice-based human-computer interaction," Int. J. Comput., Inf., Mechatronics Syst. Sci. Eng., vol. 7, no. 7, pp. 398-407, 2013.
-
(2013)
Int. J. Comput., Inf., Mechatronics Syst. Sci. Eng.
, vol.7
, Issue.7
, pp. 398-407
-
-
Gomez, R.1
Nakamura, K.2
Nakadai, K.3
-
24
-
-
80052878786
-
Real-time human pose recognition in parts from a single depth image
-
J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, A. Kipman, and A. Blake, "Real-time human pose recognition in parts from a single depth image," in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2011, pp. 1297-1304.
-
(2011)
Proc. IEEE Conf. Comput. Vis. Pattern Recog.
, pp. 1297-1304
-
-
Shotton, J.1
Fitzgibbon, A.2
Cook, M.3
Sharp, T.4
Finocchio, M.5
Moore, R.6
Kipman, A.7
Blake, A.8
-
25
-
-
77249176311
-
Human motion tracking using a color-based particle filter driven by optical flow
-
T. Tung and T. Matsuyama, "Human motion tracking using a color-based particle filter driven by optical flow," Eur. Conf. Comput. VisionWorkshop, MLVMA, pp. 1-12, 2008.
-
(2008)
Eur. Conf. Comput. VisionWorkshop, MLVMA
, pp. 1-12
-
-
Tung, T.1
Matsuyama, T.2
-
26
-
-
2142812371
-
Robust real-time face detection
-
P. Viola and M. Jones, "Robust real-time face detection," Int J. Comput. Vision, vol. 57, no. 2, pp. 137-154, 2004.
-
(2004)
Int J. Comput. Vision
, vol.57
, Issue.2
, pp. 137-154
-
-
Viola, P.1
Jones, M.2
-
27
-
-
84957810778
-
Active appearance models
-
T. F. Cootes, G. J. Edwards, and C. J. Taylor, "Active appearance models," in Proc. Eur. Conf. Comput. Vis., 1998, pp. 484-498.
-
(1998)
Proc. Eur. Conf. Comput. Vis.
, pp. 484-498
-
-
Cootes, T.F.1
Edwards, G.J.2
Taylor, C.J.3
-
28
-
-
80053039628
-
Real time head pose estimation from consumer depth cameras
-
G. Fanelli, T.Weise, J. Gall, and L. V. Gool, "Real time head pose estimation from consumer depth cameras," in Proc. DAGM - Int. Conf. Pattern Recog., 2011, pp. 101-110.
-
(2011)
Proc. DAGM - Int. Conf. Pattern Recog.
, pp. 101-110
-
-
Fanelli, G.1
Weise, T.2
Gall, J.3
Gool, L.V.4
-
29
-
-
77953873725
-
User-oriented document summarization through vision-based eye-tracking
-
S. Xu, H. Jiang, and F. C. Lau, "User-oriented document summarization through vision-based eye-tracking," in Proc. 13th ACM Int Conf. Intell. User Interfaces, 2009, pp. 7-16.
-
(2009)
Proc. 13th ACM Int Conf. Intell. User Interfaces
, pp. 7-16
-
-
Xu, S.1
Jiang, H.2
Lau, F.C.3
-
30
-
-
84856635786
-
Inferring human gaze from appearance via adaptive linear regression
-
L. Feng,Y. Sugano, T. Okabe, andY. Sato, "Inferring human gaze from appearance via adaptive linear regression," in Proc. IEEE Int Conf. Comput. Vision, 2011, pp. 153-160.
-
(2011)
Proc. IEEE Int Conf. Comput. Vision
, pp. 153-160
-
-
Feng, L.1
Sugano, Y.2
Okabe, T.3
Sato, Y.4
-
31
-
-
76749102346
-
Complete multi-view reconstruction of dynamic scenes from probabilistic fusion of narrow and wide baseline stereo
-
T. Tung, S. Nobuhara, and T. Matsuyama, "Complete multi-view reconstruction of dynamic scenes from probabilistic fusion of narrow and wide baseline stereo," in Proc. IEEE Int Conf. Comput. Vision, 2009, pp. 1709- 1716.
-
(2009)
Proc. IEEE Int Conf. Comput. Vision
, pp. 1709-1716
-
-
Tung, T.1
Nobuhara, S.2
Matsuyama, T.3
-
32
-
-
84906283243
-
Estimation of interest and comprehension level of audience through multi-modal behaviors in poster conversations
-
T. Kawahara, S. Hayashi, and K. Takanashi, "Estimation of interest and comprehension level of audience through multi-modal behaviors in poster conversations," in Proc. Conf. Interspeech, 2013, pp. 1882-1885.
-
(2013)
Proc. Conf. Interspeech
, pp. 1882-1885
-
-
Kawahara, T.1
Hayashi, S.2
Takanashi, K.3
-
33
-
-
0037312530
-
Dynamic textures
-
G. Doretto, A. Chiuso, Y. Wu, and S. Soatto, "Dynamic textures," Int J. Comput. Vis, vol. 51, no. 2, pp. 91-109, 2003.
-
(2003)
Int J. Comput. Vis
, vol.51
, Issue.2
, pp. 91-109
-
-
Doretto, G.1
Chiuso, A.2
Wu, Y.3
Soatto, S.4
-
34
-
-
70450184786
-
View-invariant dynamic texture recognition using a bag of dynamical systems
-
A. Ravichandran, R. Chaudhry, and R. Vidal, "View-invariant dynamic texture recognition using a bag of dynamical systems," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 2009, pp. 1932-1939.
-
(2009)
Proc. IEEE Conf. Comput. Vision Pattern Recog.
, pp. 1932-1939
-
-
Ravichandran, A.1
Chaudhry, R.2
Vidal, R.3
-
35
-
-
70450173435
-
Histograms of oriented optical flow and binet-cauchy kernels on nonlinear dynamical systems for the recognition of human actions
-
R. Chaudhry, A. Ravichandran, G. Hager, and R. Vidal, "Histograms of oriented optical flow and binet-cauchy kernels on nonlinear dynamical systems for the recognition of human actions," in Proc. IEEE Conf. Comput. Vision Pattern Recog., 2009, pp. 1932-1939.
-
(2009)
Proc. IEEE Conf. Comput. Vision Pattern Recog.
, pp. 1932-1939
-
-
Chaudhry, R.1
Ravichandran, A.2
Hager, G.3
Vidal, R.4
-
39
-
-
0000113585
-
Surface shape and curvature scales
-
J. Koenderink and A. van Doorn, "Surface shape and curvature scales," Image Vis. Comput., vol. 10, pp. 557-564, 1992.
-
(1992)
Image Vis. Comput.
, vol.10
, pp. 557-564
-
-
Koenderink, J.1
Van Doorn, A.2
-
40
-
-
49249085803
-
Performance capture from sparsemulti-viewvideo
-
E. de Aguiar, C. Stoll, C. Theobalt, N. Ahmed, H.-P. Seidel, and S. Thrun, "Performance capture from sparsemulti-viewvideo," ACMTrans.Graph., vol. 27, no. 3, pp. 98-1-98-10, 2008.
-
(2008)
ACMTrans.Graph.
, vol.27
, Issue.3
, pp. 981-9810
-
-
De Aguiar, E.1
Stoll, C.2
Theobalt, C.3
Ahmed, N.4
Seidel, H.-P.5
Thrun, S.6
-
41
-
-
84868005473
-
-
4th ed New York, NY, USA: Norton
-
F. David, R. Pisani, and R. Purves, Statistics, 4th ed. New York, NY, USA: Norton, 2007.
-
(2007)
Statistics
-
-
David, F.1
Pisani, R.2
Purves, R.3
-
42
-
-
84862678656
-
Topology dictionary for 3D video understanding
-
Aug.
-
T. Tung and T. Matsuyama, "Topology dictionary for 3D video understanding," IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 8, pp. 1645-1657, Aug. 2012.
-
(2012)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.34
, Issue.8
, pp. 1645-1657
-
-
Tung, T.1
Matsuyama, T.2
|