SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 3647-3651

Speaker trait characterization in web videos: Uniting speech, language, and facial features

(5) Weninger, Felix a Wagner, Claudia b Wollmer, Martin a,c Schuller, Bjorn a,b Morency, Louis Philippe d

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

b Institute of Information Systems and Information Management (Austria)

c BMW GROUP (Germany)

d University of Southern California ^* (United States)

Author keywords

computational paralinguistics; multi modal fusion; speaker classification

Indexed keywords

AUTOMATIC FEATURE EXTRACTION; AUTOMATIC SPEECH RECOGNITION; LINGUISTIC FEATURES; MULTI-MODAL APPROACH; MULTI-MODAL FUSION; PARALINGUISTICS; RACE CLASSIFICATION; SPEAKER CLASSIFICATION;

EYE PROTECTION; FACE RECOGNITION; LINGUISTICS; SIGNAL PROCESSING; SPEECH RECOGNITION; VIDEO STREAMING;

CLASSIFICATION (OF INFORMATION);

EID: 84890532851 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6638338 Document Type: Conference Paper

Times cited : (6)

References (26)

1
- 34547542381
- Comparison of four approaches to age and gender recognition for telephone applications
- Honolulu, Hawaii
- F. Metze, J. Ajmera, R. Englert, U. Bub, F. Burkhardt, J. Stegmann, C. Muller, R. Huber, B. Andrassy, J. G. Bauer, and B. Littel, "Comparison of four approaches to age and gender recognition for telephone applications," in Proc. of ICASSP, Honolulu, Hawaii, 2007, pp. 1089-1092.
- (2007) Proc. of ICASSP , pp. 1089-1092
- Metze, F.¹ Ajmera, J.² Englert, R.³ Bub, U.⁴ Burkhardt, F.⁵ Stegmann, J.⁶ Muller, C.⁷ Huber, R.⁸ Andrassy, B.⁹ Bauer, J.G.¹⁰ Littel, B.¹¹

2
- 77949917470
- Estimation of unknown speakers height from speech
- I. Mporas and T. Ganchev, "Estimation of unknown speakers height from speech," International Journal of Speech Technology, vol. 12, no. 4, pp. 149-160, 2009.
- (2009) International Journal of Speech Technology , vol.12 , Issue.4 , pp. 149-160
- Mporas, I.¹ Ganchev, T.²

3
- 84867336059
- Semantic speech tagging: Towards combined analysis of speaker traits
- K. Brandenburg and M. Sandler, Eds., Ilmenau, Germany, Audio Engineering Society
- B. Schuller, M. Wollmer, F. Eyben, G. Rigoll, and D. Arsic, "Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits," in Proceedings AES 42nd International Conference, K. Brandenburg and M. Sandler, Eds., Ilmenau, Germany, 2011, pp. 89-97, Audio Engineering Society.
- (2011) Proceedings AES 42nd International Conference , pp. 89-97
- Schuller, B.¹ Wollmer, M.² Eyben, F.³ Rigoll, G.⁴ Arsic, D.⁵

4
- 85032750851
- The computational paralinguistics challenge
- July
- B. Schuller, "The Computational Paralinguistics Challenge," IEEE Signal Processing Magazine, vol. 29, no. 4, pp. 97-101, July 2012.
- (2012) IEEE Signal Processing Magazine , vol.29 , Issue.4 , pp. 97-101
- Schuller, B.¹

5
- 84872229639
- The voice of leadership: Models and performances of automatic analysis in on-line speeches
- F. Weninger, J. Krajewski, A. Batliner, and B. Schuller, "The Voice of Leadership: Models and Performances of Automatic Analysis in On-Line Speeches," IEEE Transactions on Affective Computing, 2012, http://doi.ieeecomputersociety.org/10.1109/T-AFFC.2012.15.
- (2012) IEEE Transactions on Affective Computing
- Weninger, F.¹ Krajewski, J.² Batliner, A.³ Schuller, B.⁴

6
- 84878398325
- Age estimation from telephone speech using ivectors
- Portland, OR, USA, no pagination
- M. H. Bahari, M. McLaren, H. Van hamme, and D. Van Leeuwen, "Age Estimation from Telephone Speech using ivectors," in Proc. of INTERSPEECH, Portland, OR, USA, 2012, no pagination.
- (2012) Proc. of INTERSPEECH
- Bahari, M.H.¹ McLaren, M.² Van Hamme, H.³ Van Leeuwen, D.⁴

7
- 84867332081
- Paralinguistics in speech and language-state-of-The-art and the challenge
- January
- B. Schuller, S. Steidl, A. Batliner, F. Burkhardt, L. Devillers, C. Muller, and S. Narayanan, "Paralinguistics in Speech and Language-State-of-The-Art and the Challenge," Computer Speech and Language, Special Issue on Paralinguistics in Naturalistic Speech and Language, vol. 27, no. 1, pp. 4-39, January 2013.
- (2013) Computer Speech and Language, Special Issue on Paralinguistics in Naturalistic Speech and Language , vol.27 , Issue.1 , pp. 4-39
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Burkhardt, F.⁴ Devillers, L.⁵ Muller, C.⁶ Narayanan, S.⁷

8
- 79951945340
- Revisiting linear discriminant techniques in gender recognition
- J. Bekios-Calfa, J. M. Buenaposada, and L. Baumela, "Revisiting Linear Discriminant Techniques in Gender Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, no. 4, pp. 858-864, 2011.
- (2011) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.33 , Issue.4 , pp. 858-864
- Bekios-Calfa, J.¹ Buenaposada, J.M.² Baumela, L.³

9
- 77957902440
- Combining motion and appearance for gender classification from video sequences
- Tampa, FL, USA, IEEE, no pagination
- A. Hadid and M. Pietikainen, "Combining motion and appearance for gender classification from video sequences," in Proc. 19th International Conference on Pattern Recognition (ICPR 2008), Tampa, FL, USA, 2008, IEEE, no pagination.
- (2008) Proc. 19th International Conference on Pattern Recognition (ICPR 2008)
- Hadid, A.¹ Pietikainen, M.²

10
- 79551698511
- Automated person categorization for video surveillance using soft biometrics
- M. Demirkus, K. Garg, and S. Guler, "Automated person categorization for video surveillance using soft biometrics," in Biometric Technology for Human Identification VII, SPIE. 2010.
- (2010) Biometric Technology for Human Identification VII, SPIE.
- Demirkus, M.¹ Garg, K.² Guler, S.³

11
- 84865009952
- Soft biometric trait classification from real-world face videos conditioned on head pose estimation
- M. Demirkus, D. Precup, J. Clark, and T. Arbel, "Soft Biometric Trait Classification from Real-world Face Videos Conditioned on Head Pose Estimation," in Proc. IEEE Computer Society Workshop on Biometrics in association with IEEE CVPR, 2012, pp. 130-137.
- (2012) Proc. IEEE Computer Society Workshop on Biometrics in Association with IEEE CVPR , pp. 130-137
- Demirkus, M.¹ Precup, D.² Clark, J.³ Arbel, T.⁴

12
- 81855180780
- Analyzing facial behavioral features from videos
- A. A. Salah and B. Lepri, Eds. of Lecture Notes in Computer Science. Springer Berlin Heidelberg
- A. Hadid, "Analyzing facial behavioral features from videos," in Human Behavior Understanding, A. A. Salah and B. Lepri, Eds., vol. 7065 of Lecture Notes in Computer Science, pp. 52-61. Springer Berlin Heidelberg, 2011.
- (2011) Human Behavior Understanding , pp. 52-61
- Hadid, A.¹

13
- 33747179515
- The identity of bloggers: Openness and gender in personal weblogs
- S. Nowson and J. Oberlander, "The identity of bloggers: Openness and gender in personal weblogs," in In Proceedings of the AAAI Spring Symposia on Computational Approaches to Analyzing Weblogs, 2006.
- (2006) Proceedings of the AAAI Spring Symposia on Computational Approaches to Analyzing Weblogs
- Nowson, S.¹ Oberlander, J.²

14
- 84890542151
- Using liwc and coh-metrix to investigate gender differences in linguistic styles
- IGI Global doi:10.4018/978-1-60960-741-8.ch032
- C. M. Bell, P. M. McCarthy, and D. S. McNamara, "Using LIWC and Coh-Metrix to Investigate Gender Differences in Linguistic Styles," in Applied Natural Language Processing: Identification, Investigation and Resolution, P. McCarthy and C. Boonthum-Denecke, Eds., pp. 545-556. IGI Global, 2012, doi:10.4018/978-1-60960-741-8.ch032.
- (2012) Applied Natural Language Processing: Identification, Investigation and Resolution, P. McCarthy and C. Boonthum-Denecke, Eds. , pp. 545-556
- Bell, C.M.¹ McCarthy, P.M.² McNamara, D.S.³

15
- 79959816279
- Can conversational word usage be used to predict speaker demographics
- Makuhari, Japan
- D. Gillick, "Can conversational word usage be used to predict speaker demographics?," in Proc. of Interspeech, Makuhari, Japan, 2010, pp. 1381-1384.
- (2010) Proc. of Interspeech , pp. 1381-1384
- Gillick, D.¹

16
- 42549115295
- Audio-visual gender recognition
- M. Liu, X. Xu, and T. S. Huang, "Audio-visual gender recognition," in MIPPR 2007: Pattern Recognition and Computer Vision, vol. 6788 of SPIE, pp. 678803-678803-5. 2007.
- (2007) MIPPR 2007: Pattern Recognition and Computer Vision 6788 of SPIE , pp. 678803-678803
- Liu, M.¹ Xu, X.² Huang, T.S.³

17
- 84890497827
- Tech. Rep. Idiap-RR-73-2008, Idiap, November
- M. Pronobis and M. Magimai-Doss, "Integrating audio and vision for robust automatic gender recognition.," Tech. Rep. Idiap-RR-73-2008, Idiap, November 2008.
- (2008) Integrating Audio and Vision for Robust Automatic Gender Recognition.
- Pronobis, M.¹ Magimai-Doss, M.²

18
- 84884134833
- YouTube movie reviews: In cross, and open-domain sentiment analysis in an audiovisual context
- to appear
- M. Wollmer, F. Weninger, T. Knaup, B. Schuller, C. Sun, K. Sagae, and L.-P. Morency, "YouTube Movie Reviews: In, Cross, and Open-Domain Sentiment Analysis in an Audiovisual Context," IEEE Intelligent Systems Magazine, Special Issue on Concept-Level Opinion and Sentiment Analysis, 2013, to appear.
- (2013) IEEE Intelligent Systems Magazine, Special Issue on Concept-Level Opinion and Sentiment Analysis
- Wollmer, M.¹ Weninger, F.² Knaup, T.³ Schuller, B.⁴ Sun, C.⁵ Sagae, K.⁶ Morency, L.-P.⁷

19
- 60749097551
- Cambridge University Engineering Department, Cambridge, UK
- S. J. Young, G. Evermann, M. J. F. Gales, D. Kershaw, G. Moore, J. J. Odell, D. G. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK book version 3.4, Cambridge University Engineering Department, Cambridge, UK, 2006.
- (2006) The HTK Book Version 3.4
- Young, S.J.¹ Evermann, G.² Gales, M.J.F.³ Kershaw, D.⁴ Moore, G.⁵ Odell, J.J.⁶ Ollason, D.G.⁷ Povey, D.⁸ Valtchev, V.⁹ Woodland, P.C.¹⁰

20
- 78650977476
- OpenSMILE\-The Munich versatile and fast open-source audio feature extractor
- Florence, Italy, October, ACM
- F. Eyben, M. Wollmer, and B. Schuller, "openSMILE\-The Munich versatile and fast open-source audio feature extractor," in Proc. of ACM Multimedia, Florence, Italy, October 2010, pp. 1459-1462, ACM.
- (2010) Proc. of ACM Multimedia , pp. 1459-1462
- Eyben, F.¹ Wollmer, M.² Schuller, B.³

21
- 0042609977
- Psychological aspects of natural language use: Our words, our selves
- J.W. Pennebaker, M. R. Mehl, and K. G. Niederhoffer, "Psychological aspects of natural language use: Our words, our selves," Annual Review of Psychology, vol. 54, no. 1, pp. 547-577, 2003.
- (2003) Annual Review of Psychology , vol.54 , Issue.1 , pp. 547-577
- Pennebaker, J.W.¹ Mehl, M.R.² Niederhoffer, K.G.³

22
- 77649253939
- The psychological meaning of words: Liwc and computerized text analysis methods
- Y. R. Tausczik and J. W. Pennebaker, "The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods," Journal of Language and Social Psychology, vol. 29, no. 1, pp. 24-54, 2010.
- (2010) Journal of Language and Social Psychology , vol.29 , Issue.1 , pp. 24-54
- Tausczik, Y.R.¹ Pennebaker, J.W.²

23
- 84948481845
- An algorithm for suffix stripping
- October
- M. F. Porter, "An algorithm for suffix stripping," Program, vol. 3, no. 14, pp. 130-137, October 1980.
- (1980) Program , vol.3 , Issue.14 , pp. 130-137
- Porter, M.F.¹

24
- 84870983778
- LSTM-modeling of continuous emotions in an audiovisual affect recognition framework
- M. Wollmer, M. Kaiser, F. Eyben, B. Schuller, and G. Rigoll, "LSTM-modeling of continuous emotions in an audiovisual affect recognition framework," Image and Vision Computing, 2012.
- (2012) Image and Vision Computing
- Wollmer, M.¹ Kaiser, M.² Eyben, F.³ Schuller, B.⁴ Rigoll, G.⁵

25
- 76749092270
- The WEKA data mining software: An update
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, "The WEKA data mining software: an update," ACM SIGKDD Explorations Newsletter, vol. 11, no. 1, pp. 10-18, 2009.
- (2009) ACM SIGKDD Explorations Newsletter , vol.11 , Issue.1 , pp. 10-18
- Hall, M.¹ Frank, E.² Holmes, G.³ Pfahringer, B.⁴ Reutemann, P.⁵ Witten, I.H.⁶

26
- 33747115683
- Normative standards for vocal tract dimensions by race as measured by acoustic pharyngometry
- S. A. Xue and J. G. Hao, "Normative standards for vocal tract dimensions by race as measured by acoustic pharyngometry," Journal of Voice, vol. 20, no. 3, pp. 391-400, 2006.
- (2006) Journal of Voice , vol.20 , Issue.3 , pp. 391-400
- Xue, S.A.¹ Hao, J.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.